Skip to content

Latest commit

 

History

History
30 lines (27 loc) · 1.02 KB

README.md

File metadata and controls

30 lines (27 loc) · 1.02 KB

TU-Data-Science

This repository is about labs and assignments of two cources at taibah university

Data Mining:

  • lab1: Writing mathimatics formula using markedown cell in jupyter notebook
  • lab2: python basics
  • lab3: python OOP
  • lab4: Numpy arrays
  • lab5: EDA - working on dataset - data visualization
  • lab6: PCA
  • lab7: K-Means clustering algorithm
  • lab8: Exercise: calculate the confidence
  • lab9: Bernoulli banive bayes - gaussian naive bayes
  • lab10: Linear regression - logistic regression
  • project: Complete clustering analysis on "facebook live sellers in thailand dataset"
  • Information Retrival:

  • lab1: Exercises: tokenization - frequency count
  • lab2: tokenization methods - Dealing with stop words
  • lab3: Regular expressions
  • lab4: Stemming
  • lab5: Similarity measures
  • lab6: Unigram inverted index
  • lab7: Positional index
  • lab8: Compute TF-IDF score in python