Skip to content
This repository has been archived by the owner on Mar 25, 2024. It is now read-only.

Latest commit

 

History

History
51 lines (34 loc) · 1.11 KB

README.md

File metadata and controls

51 lines (34 loc) · 1.11 KB

Data-Streaming-Pipeline

A step by step guide to building a highly scalable data streaming pipeline in Python. Article is avialable here

Architecture

image

Installation

  1. Install the Redis and run it locally.

  2. Clone the repository.

    git clone https://github.com/iam-mhaseeb/Data-Streaming-Pipeline.git
    
  3. Install the requirements.

    pip install -r requirements.txt
    
  4. You are good to go!

Quick start

  1. Start the producer quotes_spider:
    cd producer
    scrapy crawl quotes
    
  2. Start the consumer quotes_consumer:
    cd consumer
    python quotes_consumer.py
    
  3. See data flowing between pipeline as soon as data is generated from the producer.

Authors

Licensing

The project is MIT Licenced.