Siksha Sarovar

Siksha Sarovar (sikshasarovar.com) is a free educational web application that helps students in India learn programming and prepare for academic and competitive exams. The platform offers structured coding courses (C, C++, Python, Java, HTML, CSS, PHP, Power BI, AI, Machine Learning, Data Science), complete university curriculum notes for BCA/MCA students with previous year question papers, Class 10 and Class 12 CBSE/HBSE school notes, and dedicated preparation material for SSC, UPSC, Banking, Railway and other government exams. Browsing the site is completely free and requires no account. Users may optionally sign in with Google solely to save their learning progress, quiz scores and personal preferences across devices.

Privacy Policy | Terms of Service | Contact Siksha Sarovar | About Siksha Sarovar

v4.0.9 · PWA
Siksha Sarovar logo
Siksha Sarovar
Your Learning Universe

Siksha Sarovar is a free e-learning platform for coding courses, BCA university notes and competitive exam preparation. Optional Google sign-in saves your learning progress across devices.

Initializing knowledge base…
Compiling modules 0%

Data Warehouse Architecture

Lesson 5 of 32 in the free Data Warehousing and Data Mining(Elective-II) notes on Siksha Sarovar, written by Rohit Jangra.

---

5. Data Warehouse Architecture

A Data Warehouse architecture describes the overall system from the source data to the end-user tools. It typically consists of three main tiers.

5.1 Tier 1: The Bottom Tier (Data Source & ETL)

This is the server where the data resides before entering the warehouse.

  • Sources: Relational Databases (Oracle, SQL Server), Mainframes (VSAM), Flat files, ERP systems (SAP), CRM systems (Salesforce).
  • ETL Process: This is the critical bridge.
  • Extract: Reading data from source systems.
  • Transform: Cleaning, formatting, and integrating data. (e.g., converting currency, fixing typos, calculating derived fields).
  • Load: Writing the transformed data into the Data Warehouse tables.

5.2 Tier 2: The Middle Tier (The Data Warehouse Server)

This is the central storage component.

  • Database Server: Usually a Relational Database Management System (RDBMS) optimized for OLAP.
  • Storage Structures: Contains the dimension and fact tables organized into schemas (Star, Snowflake).
  • Metadata Repository: Keeps track of the data definitions and mappings.
  • OLAP Engine: Performs complex calculations and aggregations quickly.

5.3 Tier 3: The Top Tier (Front-End Tools)

This layer provides the interface for the end-user.

  • Query Tools: SQL query writers.
  • Reporting Tools: Crystal Reports, SSRS.
  • Data Visualization: Tableau, PowerBI, QlikView.
  • Data Mining Tools: Used to discover hidden patterns in the data.