- Website Used for Scraping : http://fbookshelf.herokuapp.com/
- Scraped Data file : scraping_hackathon_dataFinal.csv
- Libraries Used : beautifulsoap4, pandas, numpy, matplotlib
- Frameworks Used : RASA(For Chatbot), Django(For REST API)
- Replace Encoding from 'windows-1252' to 'utf-8'
- Changes during conversion i.e replace
maxVotedSet.NumbeofVotes = maxVotedSet.NumbeofVotes.replace({',': ''}, regex=True) => maxVotedSet.NumbeofVotes = maxVotedSet.NumbeofVotes.replace(',', '', regex=True)
pagesSet.NumberofPages = pagesSet.NumberofPages.replace({None:0.0}, regex=True) => pagesSet.NumberofPages = pagesSet.NumberofPages.replace('None',0.0, regex=True)