- Batch Processing using Apache Spark and Python3 for data exploration
- Dataset was downloded from https://www.kaggle.com/
- Focusing on Pyspark SQL libraries
- from pyspark.sql.types import BooleanType
- from pyspark.sql.functions import udf
- from pyspark.sql import functions as F
- from pyspark.sql import SparkSession
- from pyspark.sql import Window
-
Notifications
You must be signed in to change notification settings - Fork 1
essien1990/Apache-Spark
Folders and files
Name | Name | Last commit message | Last commit date | |
---|---|---|---|---|
Repository files navigation
About
Batch Processing using Apache Spark and Python for data exploration
Topics
Resources
Stars
Watchers
Forks
Releases
No releases published
Packages 0
No packages published