Skip to content
@DataForgeOpenAIHub

Color of the Roofs

Welcome to DataForge OpenAI Hub 🚀

DataForge OpenAI Hub is dedicated to building practical, real-world solutions leveraging the power of machine learning, MLOps, and data analytics pipelines. Explore our two main projects below!


Repositories Overview

1. End-to-End Data Pipeline: Steam Sales Analysis [Steam-Sales-Analysis] 🎮

This repository implements an end-to-end ETL pipeline for analyzing Steam sales data. It retrieves, processes, and stores gaming metadata and sales data, offering insightful trends and performance analysis.

  • ETL Pipeline: Automates the retrieval, validation, and processing of data from SteamSpy and Steam APIs.
  • Data Storage: Stored in a MySQL database hosted on Aiven Cloud.
  • Visualization: Interactive Tableau dashboards provide insights into gaming trends.
  • Automation: Fully automated using CLI-based orchestration.
  • Open-source contribution: Deployed and maintained a PyPI package/library (open-source contribution).

📂 Technologies Used: Python, MySQL, Tableau, Steam API, Typer CLI, ETL

🔗 View the repo


2. End-to-End ML Project: Credit Card Fraud Detection [mlops-credit-card-fraud-detection-end-to-end] 🛡️

This repository presents a comprehensive machine learning project that tackles credit card fraud detection using MLOps best practices. Highlights include:

  • Ensemble Models: Combines multiple algorithms for better accuracy in detecting fraudulent transactions.
  • Data & Model Versioning: Managed with DVC to ensure consistent and reproducible results.
  • CI/CD Pipelines: Implemented with GitHub Actions to automate the workflow.
  • Deployment: Model deployment to production with robust monitoring.

📂 Technologies Used: Python, Jupyter, Scikit-learn, DVC, GitHub Actions, Docker

🔗 View the repo


Getting Started

Clone the respective repository and follow the setup instructions provided in each project.

# Clone the Data Pipeline project
git clone https://github.com/DataForgeOpenAIHub/Steam-Sales-Analysis.git

# Clone the ML project
git clone https://github.com/DataForgeOpenAIHub/mlops-credit-card-fraud-detection-end-to-end.git

Popular repositories Loading

  1. mlops-credit-card-fraud-detection-end-to-end mlops-credit-card-fraud-detection-end-to-end Public

    End to End Machine Learning MLOps Project for Credit Card Fraud Detection using Ensemble Models, Data and Model Versioning through DVC, Github Actions, and Deployment

    Jupyter Notebook 1 1

  2. Steam-Sales-Analysis Steam-Sales-Analysis Public

    This repository features an ETL pipeline for retrieving, processing, validating, and ingesting game metadata and sales data from SteamSpy and Steam APIs. Data is stored in a MySQL database on Aiven…

    Jupyter Notebook

  3. .github .github Public

    GitHub profile of this organization.

Repositories

Showing 3 of 3 repositories
  • mlops-credit-card-fraud-detection-end-to-end Public

    End to End Machine Learning MLOps Project for Credit Card Fraud Detection using Ensemble Models, Data and Model Versioning through DVC, Github Actions, and Deployment

    DataForgeOpenAIHub/mlops-credit-card-fraud-detection-end-to-end’s past year of commit activity
    Jupyter Notebook 1 MIT 1 0 0 Updated Dec 26, 2024
  • .github Public

    GitHub profile of this organization.

    DataForgeOpenAIHub/.github’s past year of commit activity
    0 0 0 0 Updated Nov 5, 2024
  • Steam-Sales-Analysis Public

    This repository features an ETL pipeline for retrieving, processing, validating, and ingesting game metadata and sales data from SteamSpy and Steam APIs. Data is stored in a MySQL database on Aiven Cloud and visualized using Tableau dashboards for insightful analysis of gaming trends and sales performance.

    DataForgeOpenAIHub/Steam-Sales-Analysis’s past year of commit activity
    Jupyter Notebook 0 MIT 0 0 0 Updated Oct 22, 2024

People

This organization has no public members. You must be a member to see who’s a part of this organization.

Top languages

Loading…

Most used topics

Loading…