Skip to content

This notebook contains the usage of Pyspark to build machine learning classifiers (note that almost ml_algorithm supported by Pyspark are used in this notebook)

Notifications You must be signed in to change notification settings

ghanmi-hamza/Machine-learning-with-PySpark

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

3 Commits
 
 
 
 

Repository files navigation

Machine learning Using Pyspark

This is a helpful notebook which contains:
  • Creating a Spark application.
  • Using Spark sql for manipulating the dataframe
  • Using data processing steps (feature encoding, scaling, selection ...)
  • Training and Testing ml-models
In This notebook I tested almost machine learning algorithms:
  • DECISION TREE
  • Deep Learning Multilayer Perceptron
  • NAIVE BAYES
  • LOGISTIC REGRESSION
  • One-vs-Rest
  • RANDOM FOREST

Note that this notebook was made in google colab so you don't need to install any package just run cells in google colab + I used keystroke data and Touch data

Helpful notebook, Yeah!

About

This notebook contains the usage of Pyspark to build machine learning classifiers (note that almost ml_algorithm supported by Pyspark are used in this notebook)

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published