In this series of posts, I take a fresh look at Apache Spark and investigate its
applicability to a smaller problem (which in time may grow into a “true” big data problem).
The companion Github project
contains the sample code and installation instructions.
The series starts by introducing Spark and the bus time table case study.