Skip to content

Latest commit

 

History

History
26 lines (20 loc) · 772 Bytes

README.md

File metadata and controls

26 lines (20 loc) · 772 Bytes

pyspark-pictures

Learn the pyspark API through pictures and simple examples

RDD Example:

example image

# flatMap
x = sc.parallelize([1,2,3])
y = x.flatMap(lambda x: (x, 100*x, x**2))
print(x.collect())
print(y.collect())

[1, 2, 3]
[1, 100, 1, 2, 200, 4, 3, 300, 9]

References

pyspark API

Contribute

Contributors are welcome
Original images are here, download to pdf, convert to svg with: genSVD (pdf2svg)