Skip to content

Latest commit

 

History

History
2 lines (2 loc) · 286 Bytes

File metadata and controls

2 lines (2 loc) · 286 Bytes

Big_Data_SPARK_AWS_EMR_CIS-9760_Analyzing_IMDB_Datasets

I worked with provision a Spark cluster on AWS EMR, connect it to a Jupyter Notebook and then run a series of queries (in python with DataFrame API or Spark SQL) that answer a few simple questions about the IMDB Data available.