Big Data Tools and Pipelines

Ideas and resources related to data tools.

Running Spark on Alluxio with S3

Calvin Jia presents an in-depth overview of Alluxio and its role in the big data ecosystem. In this segment, he reviews examples that show how Alluxio complements Spark and S3, to enable fast data access.