Skip to content

dfighter1312/data-science-collection

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

58 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

My Data Science Collection

Owner: Hoang-Dung Nguyen (Nguyễn Hoàng Dũng). Fresh Graduate from Ho Chi Minh City University of Technology, from Computer Science, but are in love with Data Science.

Hope that the set of projects will inspire you guys who are also having passion with Data Science. I know this is a vast field, and I am sometimes dizzy as well when learning this and knowing that something I have learned would not benefit me in getting a job (I am still somehow unemployed, so please give me a job to embrace myself), but I believe the journey to become a Data Scientist will grow gradually in me.

Structure of the repository

The repository is divided into topics, and each topic's material is contained within a folder. There are 4 aspects covered across the topics, and I will index it in this README.md file.

Since it is a set of mini-projects, there is no or just a little overlap between projects. However, it should be a must to combine ideas I have learned in these projects to make a capstone project (which I believe should have been conducted after every 20 mini-projects).

Exploratory Data Analysis

Skills: Data Analysis, Mathematics, Statistics, Pandas, Numpy, Scipy

  1. Topic 1 - Predict Hotel Cancellations
  2. Topic 2 - Predict Student Performance From Game Play (from Kaggle)
  3. Topic 3 - Electric Moped Reviews (for DataCamp DS Associate Certificate)
  4. Topic 4 - Portfolio Risk Management
  5. Topic 11 - Recommendation Systems

Data Engineering

Skills: Big Data Processing, MLOps

  1. Topic 5 - Crawling News with Scrapy
  2. Topic 6 - Crawling Dynamic Website with Selenium
  3. Topic 7 - Big Data Processing with PySpark
  4. Topic 12 - MLFlow
  5. Topic 13 - LangChain

Machine Learning Algorithms

Skills: PyTorch, Tensorflow, Python programming

  1. Topic 8 - Graph Neural Networks
  2. Topic 9 - Generative Adversarial Networks
  3. Topic 10 - BERT and Large Language Models (using Hugging Face)

Research Topics

Skills: Writing and Understanding

  1. Topic 14 - Background and Considerations for LLMOps

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published