This project is built using
• Streamlit
• Snscrape
• Pandas
• Pymongo
• Datetime
Twitter Scrapper
Below is the dashboard created for scrapping twitter data with the help of streamlit.
The inputs to the dashboard are
- Keyword / Hashtag to be scrapped
- From date
- To date
- Maximum tweet count
Output 1 The scrapped data represented as a table according to above inputs The start and end dates are validated and displayed on the dashboard
Output 2 Two download options provided to download the data in csv and json format
Output 3 Upload to MongoDB option given to save all the scrapped data for future reference NOTE: To upload the scrapped data to MongoDB, The user should make sure to download and install MongoDB Community Server.
Output 4 The scrapped data uploaded in MongoDB under twitter_scraping database and scraped_data collection. The scrapped data uploaded as a single dictionary document inside the collection along with the hashtag or keyword used to scrape the data + current time stamp
Installations required
Pip
• pip install streamlit
• pip install snscrape
• pip install pandas
• pip install datetime
• pip install pymongo
MongoDB
• MongoDB Compass
https://www.mongodb.com/docs/compass/current/install/
• MongoDB Community Server
https://www.mongodb.com/try/download/community
Run Streamlit
Streamlit run main.py