This is the final assignment from the Data Management course from the Big Data & Analytics Masters @ EAE class of 2021. This project:
- Extracts data from the famous Microsoft SQL demo database Adventureworks
- Uses Date Nager API to specify holidays into the date dimension
- Uses a simple weather csv to specify northern and southern hemisphere season into the date dimension
- Transforms the data using Pentaho Kettle, as ETL tool
- Loads the data into My SQL as RDBMS for the OLAP database of the Sales and Purchases datamarts
Professors:
Team:
- Adrian Hagen
- Jon Dale
- Mohamed Ashmawy
- Mostafa Ezz
- Joseph Higaki
Implementation of SCD (Slowly Changing Dimension) type 2: add new row on history-worth-keeping attribute changes.
Use SCD type 2 appropriate version, for Product, Customer and Vendor Dimensions