Skip to content

yugakanse/Motor-Vehicle-collisions-Crashes

Repository files navigation

Project Title: Designing Advanced Data Architectures for Business Intelligence - Motor Vehicle Collisions/Crashes Analysis

Introduction: This project aims to design and implement advanced data architectures for analyzing motor vehicle collisions and crashes data from three major cities: New York, Chicago, and Austin. The data will be obtained from the respective Department of Transportation portals of each city. The project will involve data extraction, transformation, loading (ETL), dimensional modeling, and visualization using tools like Alteryx, Talend, Azure SQL Server/MySQL/SQL Server, Tableau, and Power BI.

Project Details:

  • Data Sources:
    • Motor Vehicle Collisions - Crashes | NYC Open Data (cityofnewyork.us)
    • Austin Crash Report Data - Crash Level Records | Open Data | City of Austin Texas
    • Traffic Crashes - Crashes | City of Chicago | Data Portal

Project Objectives:

  1. Determine the total number of accidents in each city.
  2. Present accident data effectively on a dashboard.
  3. Identify areas within each city with the highest number of accidents.
  4. Analyze accidents resulting in injuries.
  5. Investigate pedestrian involvement in accidents.
  6. Determine peak times for accidents (seasonality).
  7. Analyze injuries and fatalities among motorists.
  8. Identify areas with the highest fatality rates.
  9. Conduct time-based analysis of accidents.
  10. Analyze factors contributing to accidents.

Project Timeline:

  • Part 1:
    • Tasks:
      • Data profiling using Alteryx/ydata profile
      • Analysis document
      • Data staging (Staging tables)
      • ETL jobs using Talend
      • Incorporation of standard practices
      • Dimensional modeling (Facts and Dimensions)
  • Part 2:
    • Tasks:
      • Staging to Integration
      • Validation of dimensional data
      • Query dimensional data model for business questions
  • Part 3:
    • Tasks:
      • Visualization using Tableau and Power BI
      • Report publication (optional)
      • Submission of screenshots and source workbooks

Project Deliverables:

Project Notes:

  • Configure at least one dimension as SCD2 (Slowly Changing Dimension)
  • Handle null values appropriately
  • Maintain Source DIM table and audit columns
  • Ensure row counts match the file rows
  • Submit as a team, with one person responsible for submission

Project Support:

  • Reach out for any clarification or assistance required.
  • Utilize provided templates for mapping documents if needed.

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages