etl-pipelines

This repo contains the DAGs that run on my local Airflow environment. I use the local environment to test my DAGs before deploying them to virtual machines via Kubernetes

python airflow automation orchestration dags etl-pipelines

Updated Oct 30, 2022
Python

siddarthaThentu / Disaster-Response-Pipeline

Star

A deployed machine learning model that has the capability to automatically classify the incoming disaster messages into related 36 categories. Project developed as a part of Udacity's Data Science Nanodegree program.

bootstrap flask machine-learning plotly python3 data-analytics hyperparameter-optimization feature-engineering ensemble-models ml-pipelines etl-pipelines

Updated Jun 10, 2021
Python

Guilherme-B / baboon

Star

JSON-driven ETL pipeline framework prototype

json dag bonobo etl-pipelines

Updated Mar 25, 2020
Python

juniors90 / PymaciesArg

Star

An extension that registers all pharmacies in Argentina.

python datascience argentina pharmacy etl-framework etl-pipeline etl-job pharmacies pypi-package etl-automation etl-pipelines

Updated Oct 16, 2022
Python

extralo / loom

Star

Weaving together different threads (services like image/audio converse, ETL services, etc.) to enable the World Wide Flow

etl-framework etl-pipelines flow-architectures

Updated Dec 26, 2023
JavaScript

speedbits / LimitlessETL

Star

A Python and Spark based ETL framework. While it operates within speed limits that is framework and standards, but offers boundless possibilities.

etl etl-framework etl-pipeline etl-job etl-pipelines

Updated Apr 1, 2024
Python

SayamAlt / Formula-1-Data-Ingestion-Transformation---ETL-Pipeline

Star

This project demonstrates a complete ETL pipeline for Formula 1 racing data using Azure Databricks, Delta Lake, and Azure Data Factory. It covers data ingestion, transformation with PySpark and Spark SQL, data governance with Unity Catalog, and visualization through Power BI. Designed to showcase real-world data engineering workflows in Azure.

data-transformation data-engineering spark-streaming data-ingestion spark-sql spark-mllib microsoft-azure databricks-notebooks azure-databricks delta-lake workflow-orchestration etl-pipelines azure-data-lake-storage-gen2

Updated Nov 14, 2024
Python

Improve this page

Add a description, image, and links to the etl-pipelines topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the etl-pipelines topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

etl-pipelines

Here are 16 public repositories matching this topic...

yobix-ai / extractous

patterns-app / patterns-devkit

level-vc / useful

Chek0rrdn / DataEngineer_ETL

abrahamkoloboe27 / Airflow-Pipeline-Dashboard-Compagnie-Aerienne

ChristianRCanlas / ChristianRCanlas.github.io

angelxd84130 / Airflow-ETL

prneidhardt / Apache-Data-Pipeline

EmmanuelEzenwere / AutoDATA-prep

omar-elmaria / airflow_local

siddarthaThentu / Disaster-Response-Pipeline

Guilherme-B / baboon

juniors90 / PymaciesArg

extralo / loom

speedbits / LimitlessETL

SayamAlt / Formula-1-Data-Ingestion-Transformation---ETL-Pipeline

Improve this page

Add this topic to your repo