Skip to content

A project made for practicing webscrapping with Python

Notifications You must be signed in to change notification settings

Shyuu7/WebScraping

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

5 Commits
 
 
 
 
 
 
 
 
 
 

Repository files navigation

A project in which I practiced WebScrapping using Python and BeautifulSoup. I had to scrape events' data from a real site (I chose https://www.turismo.gov.br/agenda-eventos/views/calendario.php, an official government site) and then save said data into a SQL database. When I found out the website had some dynamic elements that I also needed to access, I had to automate the process using Selenium, which allowed me to simulate the mouse clicks necessaries to make the elements become visible.

image The site has all the cultural events up until the end of the current year, which added up to around 400 events.

image

The script will only run if one or more database tables are empty. So if a fresh set of data is needed, we only need to create a new table/delete an existing one.

By the end of the script, I had 3 tables full of data such as events' names, dates, locations and metadata... image image image

...which then allowed me to run some SQL queries. image image image

About

A project made for practicing webscrapping with Python

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages