My solutions for the assignments of the course Supercomputing for Big Data (ET4310) in TU Delft. There were three parts implemented in Spark:
- Using Spark for In-Memory Computation.
- Perform specific data analytics on Kevin Bacon to identify the actors (both male and female) linked to him by the six degrees of separation.
- Implementation of DNA analysis pipeline