NLP and topic depictions for forum data
NLP and topic depictions for forum data. This is done to improve the Piazza Forum by providing an autogenerated trending page and FAQ.
- Reddit extractions : experiment.py to see how I extracted the reddit data files
- Topics extractions : machine_learning.py to see how I extract the topic
- Dash Visualization : dash_main.py to see how I visualize it in dash deployed locally
Clone this repository to your local.g Run the dash_main.py. This will run a Flask App Server (lazy loading) to your local Go to the port specified (default is 8050) These instructions will get you a copy of the project up and running on your local machine for development and testing purposes. See deployment for notes on how to deploy the project on a live system.
- python 3
- pandas
- sklearn
- dash
pip install pandas
To get your application running, you will need to:
- Clone this repository
- Install relevant libraries (refer to prerequisites)
- Run dash_main.py with python 3
- Access the server (localhost:8050)
To deploy this, you might want to use heroku https://www.heroku.com/ as your deployment cloud server
- Dash - The data visualization framework used
- Flask - The web library used
- Pandas - Data structure and analysis tools used
- Vincent Tatan - Initial work - Vincent tatan
- Ranon Sim - *Team mate * - Ranon Sim
This project is a project started by a programmer. Feel free to use, no licensing is invoked.
- Hat tip to anyone whose code was used
- Inspiration
- etc