Owner: Hoang-Dung Nguyen (Nguyễn Hoàng Dũng). Fresh Graduate from Ho Chi Minh City University of Technology, from Computer Science, but are in love with Data Science.
Hope that the set of projects will inspire you guys who are also having passion with Data Science. I know this is a vast field, and I am sometimes dizzy as well when learning this and knowing that something I have learned would not benefit me in getting a job (I am still somehow unemployed, so please give me a job to embrace myself), but I believe the journey to become a Data Scientist will grow gradually in me.
The repository is divided into topics, and each topic's material is contained within a folder. There are 4 aspects covered across the topics, and I will index it in this README.md
file.
Since it is a set of mini-projects, there is no or just a little overlap between projects. However, it should be a must to combine ideas I have learned in these projects to make a capstone project (which I believe should have been conducted after every 20 mini-projects).
Skills: Data Analysis, Mathematics, Statistics, Pandas, Numpy, Scipy
- Topic 1 - Predict Hotel Cancellations
- Topic 2 - Predict Student Performance From Game Play (from Kaggle)
- Topic 3 - Electric Moped Reviews (for DataCamp DS Associate Certificate)
- Topic 4 - Portfolio Risk Management
- Topic 11 - Recommendation Systems
Skills: Big Data Processing, MLOps
- Topic 5 - Crawling News with Scrapy
- Topic 6 - Crawling Dynamic Website with Selenium
- Topic 7 - Big Data Processing with PySpark
- Topic 12 - MLFlow
- Topic 13 - LangChain
Skills: PyTorch, Tensorflow, Python programming
- Topic 8 - Graph Neural Networks
- Topic 9 - Generative Adversarial Networks
- Topic 10 - BERT and Large Language Models (using Hugging Face)
Skills: Writing and Understanding