Skip to content

Unlock the power of data with our comprehensive Talend project aimed at constructing a robust (DWH) from the renowned Northwind dataset. Divided into two pivotal phases, this project seamlessly integrates data from the Northwind Access Database and the Transactional Database in SQL Server.

Notifications You must be signed in to change notification settings

3amory99/Building-Northwind-DWH-Using-Talend

Repository files navigation

Building Northwind Data Warehouse Using Talend

Overview

Unlock the power of data with our comprehensive Talend project aimed at constructing a robust Data Warehouse (DWH) from the renowned Northwind dataset. Divided into two pivotal phases, this project seamlessly integrates data from the Northwind Access Database and the Transactional Database (Northwind) in SQL Server.

Staging Area

In the Staging Area, data is ingested and prepared for further processing. Two primary sources are utilized:

  1. Northwind Access Database:

    • This database serves as a key source of data for the project.
    • Talend is used to extract data from the Access database, transforming and cleaning it for compatibility with the data warehouse schema.
  2. Transactional Database (Northwind) in SQL Server:

    • The SQL Server database provides additional transactional data for a more comprehensive data warehouse.
    • Talend is employed to extract relevant data, ensuring consistency and conformity with the overall project requirements.
  • Customer

    customer

  • Employee

    employee

  • Product

    product

  • Order

    order

  • Order Details

    order_details

Data Warehouse

The second part of the project involves building the Data Warehouse. This involves:

  1. Schema Design:

    • Designing an effective and scalable data warehouse schema to accommodate the requirements of the project.
    • Ensuring that the schema supports efficient querying and reporting.
  2. ETL Processes:

    • Developing Extract, Transform, Load (ETL) processes using Talend to populate the Data Warehouse.
    • Transforming data from the Staging Area to fit the warehouse schema.
    • Handling any necessary data cleansing and enrichment.
  3. Optimization:

    • Implementing optimization techniques to enhance the performance of the warehouse.
    • Indexing, partitioning, and other strategies are considered for efficient data retrieval.
  • Customer Dimension

    dim_customer

  • Employee Dimension

    dim_employee

  • Product Dimension

    dim_product

  • Date Dimension

    dim_date

  • Fact Table 1

    fact_1

  • Fact Table 2

    fact2

  • Schema

    schema

Getting Started

To replicate this project, follow these steps:

  1. Clone the Repository:
    git clone https://github.com/your-username/Building-Northwind-DWH-Using-Talend.git
    

About

Unlock the power of data with our comprehensive Talend project aimed at constructing a robust (DWH) from the renowned Northwind dataset. Divided into two pivotal phases, this project seamlessly integrates data from the Northwind Access Database and the Transactional Database in SQL Server.

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages