Skip to content

byuibigdata/pyspark-pictures

 
 

Repository files navigation

pyspark-pictures

Learn the pyspark API through pictures and simple examples

RDD Example:

example image

# flatMap
x = sc.parallelize([1,2,3])
y = x.flatMap(lambda x: (x, 100*x, x**2))
print(x.collect())
print(y.collect())

[1, 2, 3]
[1, 100, 1, 2, 200, 4, 3, 300, 9]

References

pyspark API

Contribute

Contributors are welcome
Original images are here, download to pdf, convert to svg with: genSVD (pdf2svg)

Releases

No releases published

Packages

No packages published

Languages

  • Jupyter Notebook 99.8%
  • Shell 0.2%