Skip to content

This Python-based Amazon Scraper is designed to efficiently extract detailed product data from Amazon's product pages. The tool leverages powerful libraries like BeautifulSoup4 and csv, along with the Scrapingant API to simulate browser behavior and bypass Amazon’s anti-scraping algorithms.

Notifications You must be signed in to change notification settings

mr-mudgal/Amazon-Scrapper

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

5 Commits
 
 
 
 
 
 

Repository files navigation

Amazon-Scrapper

This is an Amazon Scrapper using Python, using scrapingant_client, and BeautifulSoup4 and csv modules in order to request the product pages, and extract the data from them respectively. Data extracted are:

  • Product URL
  • Product Name
  • Product Price
  • Product Description
  • Ratings
  • Number of Reviews
  • ASIN
  • Manufacturer

It extract 20 product listing pages. It then extract each single product page.

Initailly the data is stored in a dictionary variable, and then it is converted, and written into a file in CSV format.

It uses Scrapingant API as amazon have algorithm to detect the scrapping script, and makes its service unavailable in response to a scrapping script. API help us to simulate a real browser behaviour, hence bypassing the amazon algorithms, and allowing us to extract the data.

The whole software is divided into two parts, the first part extract products from product listing page, with their url, name, price, rating and number of review detail. In part 2, we extract every single product page, to obtain its description, asin and manufacturer.

Output File:

About

This Python-based Amazon Scraper is designed to efficiently extract detailed product data from Amazon's product pages. The tool leverages powerful libraries like BeautifulSoup4 and csv, along with the Scrapingant API to simulate browser behavior and bypass Amazon’s anti-scraping algorithms.

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages