Smichoff Scraper

This project is a web scraper designed to gather climbing route data from the WallOnSight website. The scraper extracts information about the climbing routes, including their details such as holds, difficulty, sector, line number, setter, creation date, character, height, and planned minimum durability. The data is saved in a JSON file for archival purposes.

Disclaimer

This project was created solely for personal, non-commercial use. The data is scraped from the WallOnSight website. The scraper uses only public data and does not access any unauthorized information. In case of any issues or concerns regarding this scraper, please contact me, and I will promptly take it down.

Contact

For any inquiries or issues, please reach out to me at [nikinayzer@gmail.com].

Features

Scrapes route information from the WallOnSight website.
Extracts detailed information from individual route pages, including height and planned minimum durability.
Handles pagination to gather data from all available pages.
Archives routes that have been removed from the website by marking them with an archive flag.
Avoids duplicating existing routes by checking against previously saved data.

Usage

Clone the repository:

git clone https://github.com/yourusername/smichoff-scraper.git
cd smichoff-scraper

Install the required packages:
```
pip install requests beautifulsoup4
```
Run the scraper:
```
python web_scraper.py
```
View the results: The scraped data is saved in the routes.json file in the project directory.

File Structure

web_scraper.py: Main script to scrape data from the WallOnSight website.
routes.json: JSON file where the scraped data is saved.

Example JSON Output

[
    {
        "id": "7351",
        "holds": "Blue Holds",
        "name": "Route Name",
        "difficulty": "5.10",
        "sector": "A (big wall)",
        "line_number": "1",
        "setter": "Setter Name",
        "creation_date": "07.05.2024",
        "character": "Technical",
        "height": "25m",
        "planned_until": "15.08.2024",
        "archive": false
    },
    ...
]

Name		Name	Last commit message	Last commit date
Latest commit History 1 Commit
.idea		.idea
README.MD		README.MD
routes.json		routes.json
web_scrapper.py		web_scrapper.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Smichoff Scraper

Disclaimer

Contact

Features

Usage

File Structure

Example JSON Output

About

Releases 1

Languages

Nikinayzer/smichoff-scraper

Folders and files

Latest commit

History

Repository files navigation

Smichoff Scraper

Disclaimer

Contact

Features

Usage

File Structure

Example JSON Output

About

Topics

Resources

Stars

Watchers

Forks

Releases 1

Languages