Spark sessions help beginning students dissect the first larger projects of the curriculum.
-
Updated
Nov 1, 2022 - Shell
Spark sessions help beginning students dissect the first larger projects of the curriculum.
In this section, we will perform customer segmentation using pyspark in the Flo dataset.
Utilizing Apache Spark & PySpark to analyze a movie dataset. Tasks include data exploration, identifying top-rated movies, training a linear regression model, and experimenting with Airflow.
Add a description, image, and links to the spark-session topic page so that developers can more easily learn about it.
To associate your repository with the spark-session topic, visit your repo's landing page and select "manage topics."