Skip to content

usman87626/PySpark-2.4.5-Notebook

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

9 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

PySpark-2.4.5-Notebook

This notebook is intended for learning PySpark from beginning to advanced level.

SPARKIFY_DATASET:

The dataset is a .json file that keeps track of timestamped events of the following actions performed on the digital music service:

  1. Play a Song
  2. Login
  3. Listening to an advertisement
  4. Downgrading subscription
  5. Cancelling subscription

There are 3 different sizes of the dataset available:

  1. mini_sparkify_event_data.json: the smallest instance of the dataset (125 mb)
  2. medium-sparkify-event-data.json: a medium-sized instance of the dataset (237 mb)
  3. sparkify_event_data.json: the full dataset (12 gb)

Download Links for Datasets:

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published