Python web crawler framework based on Beautiful Soup html parsing and mechanize browser capabilities. Used with R to analyze apartment price data by neighborhood in the Boston area via renthop.com.
- You will need python3 installed on the local system
- Create a virtual environment using
python3 -m venv /path/to/env
- Activate the virtual environment using
source /path/to/env/bin/activate
- Clone the repository from github using
git clone <repository_url>
- Move into the main project folder (
cd apt-data-crawler
) and install requirements usingpip install -r requirements.txt
- Run the apartment crawler script with
python renthop-crawler.py
The apt-data-crawler app relies on the html structure of the renthop.com website, which may be changed at any moment. Thank you for reading.
Michael McCarthy <michemcc@outlook.com>
https://www.linkedin.com/in/michemcc/