- pandas: Used for data storage and manipulation.
- BeautifulSoup: Used for interpreting the HTML document.
- requests: Used to communicate with the web page.
- sqlite3: Creating the database instance.
This project focuses on web scraping using the requests
and BeautifulSoup
libraries to extract the contents of a web page. The primary objective is to analyze the HTML code of a webpage, identify relevant information, and extract it for further use.
- Utilize the
requests
library to fetch the HTML content of a specified web page.
- Use
BeautifulSoup
to parse and analyze the HTML code of the fetched web page.
- Identify and extract relevant information from the HTML code based on specified criteria.
- Save the extracted information in the required format - CSV.