Apache Spark is a fast and general-purpose cluster computing system, initially developed as AMPLab / UC Berkley as part of the Berkeley Data Analytics Stack (BDAS), http://en.wikipedia.org/wiki/UC_Berkeley. It provides high-level APIs for the following programming languages that make large, concurrent parallel jobs easy to write and deploy [12:11]:
Link to latest information
The URLs as any reference to Apache Spark may change in future versions.
The core element of Spark is Resilient Distributed Dataset (RDD), which is a collection of elements ...