A streamlined data pipeline designed to extract cryptocurrencies (BTC, SOL, ETH, BNB, USDT, USDC) trade data from Binance, orchestrate the process using Apache Airflow, and efficiently store the data on Amazon EC2 and S3 for processing and archiving.
- Python
- Apache Airflow
- Binance API key and secret
- Amazon EC2 Instance
- Amazon S3 Bucket
- Binance Trade Data Extraction: Utilizes the Binance API to fetch cryptocurrency trade data.
- Airflow Orchestration: Apache Airflow DAG schedules and manages the data extraction process.
- Amazon EC2 Storage: Stores processed data on an Amazon EC2 instance.
- Amazon S3 Archiving: Efficiently archives and backs up data on Amazon S3 for scalability.