Video description
Alluxio is the solution of choice for big companies who need to manage data at multi-petabyte scale. In this course, PMC member Calvin Jia offers a full-blown Alluxio tour to any data scientist, developer or system administrator looking to improve the performance of their workloads, develop applications with Alluxio, or deploy and manage Alluxio clusters.
He offers a high level view (why Alluxio was developed, the problems it solves, who uses it, etc.) as well as a hands-on practicum. You'll set-up your own deployment (locally and in a cluster) using a compute framework on top of Alluxio, connecting it to multiple persistent data stores while preserving one namespace. Take this course and you'll come away knowing the benefits Alluxio brings to big data stacks.
- Understand the features and benefits of Alluxio and master the basics of how to use it
- Discover why companies like Intel, Baidu, and Alibaba use Alluxio for their big data needs
- Learn how the storage unification layer bridges computation frameworks and storage systems
- Gain practical experience deploying Alluxio in local and cluster modes
- Learn how to use Alluxio tools like the command line and the web UI
- Explore the Alluxio open source ecosystem and learn who the players are
Publisher resources
Table of contents
-
Introduction
- About Alluxio And The Course 00:03:38
- About The Author 00:01:24
-
Using Alluxio Locally
- Downloading Alluxio 00:03:03
- Starting The System Locally 00:05:09
- Interacting Via The Shell 00:02:45
- Browsing The Web UI 00:03:53
-
Examples With Alluxio
- Setting Up Alluxio With Spark And S3 00:06:15
- Running Spark on Alluxio with S3 00:05:29
- Using Alluxio With Unified Namespace 00:06:05
-
Deploying Alluxio On A Cluster
- Deploying Alluxio In AWS 00:07:49
- Conclusion
Product information
- Title: Introduction to Alluxio
- Author(s):
- Release date: June 2016
- Publisher(s): Infinite Skills
- ISBN: 9781771376006
You might also like
book
Expert Hadoop® Administration
The Comprehensive, Up-to-Date Apache Hadoop Administration Handbook and Reference “Sam Alapati has worked with production Hadoop …
article
Three Ways to Sell Value in B2B Markets
As customers face pressure to reduce costs while maintaining profitability, value-based selling (VBS) has become critical …
article
Run Llama-2 Models Locally with llama.cpp
Llama is Meta’s answer to the growing demand for LLMs. Unlike its well-known technological relative, ChatGPT, …
article
Use GitHub Copilot: Additional Tips
Using GitHub Copilot can feel like magic. The tool automatically fills out entire blocks of code--but …