Big_Data_SPARK_AWS_EMR_CIS-9760_Analyzing_IMDB_Datasets

I worked with provision a Spark cluster on AWS EMR, connect it to a Jupyter Notebook and then run a series of queries (in python with DataFrame API or Spark SQL) that answer a few simple questions about the IMDB Data available.