Scrape Trustpilot utilising Smartproxy's Web Scraping API
glom
BeautifulSoup
Once you have an active Web Scraping API subscription you can set your password in the dashboard Web Scraping API > Authentication method, you can also try sending a request right from the dashboard Web Scraping API > API playground method tab simply by clicking on Send Request. You will also see an example of curl request generated on the right.
Parser type | Example location | Download |
---|---|---|
HTML to JSON | Trustpilot_parser.py | curl https://raw.githubusercontent.com/Smartproxy/trustpilot_python_scraper/blob/main/Trustpilot_parser.py > Trustpilot_parser.py |
JSON to JSON | trustpilot_json_parser_glom.py | curl https://raw.githubusercontent.com/Smartproxy/trustpilot_python_scraper/blob/main/trustpilot_json_parser_glom.py > trustpilot_json_parser_glom.py |
This Python script extracts review text, review date & star ratings straight from the HTML of Trustpilot website and saves them to a JSON file.
This Python script extracts dozens of data points from a JSON that gets loaded when you visit Trustpilot. Data points include but are not limited to - bussiness information (name, URL, website, location etc.), reviews (text, rating, name, positive/negative), ratings.