Skip to content

Latest commit

 

History

History
61 lines (41 loc) · 1.44 KB

README.md

File metadata and controls

61 lines (41 loc) · 1.44 KB

Instacart Scraper

Simple spider and workflow system for scraping Instacart's (US) default store for a given user.

Table of Contents

How to run

First, you have to setup your environment variables. Samples (.env.sample and .db.env.sample) were provided.

Then, execute (Docker and Docker Compose are required):

$ make run

Done 😄

Note: Recaptcha solving may fail. Retries are already in place, but in rare cases they are insufficient. In these cases, you try and run again.

How to check collected data

If you want to run a query in the database, execute:

$ POSTGRES_USER=<YOUR-POSTGRES-USER> QUERY=<YOUR-QUERY> make sql-query

However, to make things easier, a shortcut to make a SELECT * on all tables is available through:

$ POSTGRES_USER=<YOUR-POSTGRES-USER> make sql-select-all

How to destroy the created environment

This project uses Docker. To destroy created images, volumes, etc., execute:

$ make destroy

For developers

To setup you developer environment, create a virtualenv and execute:

$ make dev-setup

This project uses pre-commit for managing code formatting and pip-tools to manage dependencies.