Patrick Wendell

Spark 1.1 and Beyond!

Date: This event took place live on October 02 2014

Presented by: Patrick Wendell

Duration: Approximately 60 minutes.

Cost: Free

Questions? Please send email to




This webcast is no longer available for viewing.

Description:

Hosted By: Ben Lorica

In this webcast, Patrick Wendell from Databricks will be speaking about Spark's new 1.1 release. This release includes significant extensions to Spark's SQL, MLlib and Streaming libraries. It also adds several performance and robustness improvements to Spark's core engine. Patrick will also cover Spark internals and other more advanced concepts regarding Spark's internal execution to explain what has changed. This talk will focus on providing lower level details to help users who are performance-testing or debugging Spark, or trying out new Spark applications.

About Patrick Wendell

Patrick Wendell is an engineer at Databricks as well as a Spark Committer and PMC member. In the Spark project, Patrick has acted as release manager for several Spark releases, including Spark 1.0 and 1.1.

Patrick also maintains several subsystems of Spark's core engine. Before helping start Databricks, Patrick obtained an M.S. in Computer Science at UC Berkeley. His research focused on low latency scheduling for large scale analytics workloads. He holds a B.S.E in Computer Science from Princeton University.

Twitter: @pwendell

About Ben Lorica

Ben Lorica is the Chief Data Scientist and Director of Content Strategy for Data at O'Reilly Media, Inc.. He has applied Business Intelligence, Data Mining, Machine Learning and Statistical Analysis in a variety of settings including Direct Marketing, Consumer and Market Research, Targeted Advertising, Text Mining, and Financial Engineering. His background includes stints with an investment management company, internet startups, and financial services. He is an advisor to Databricks.