Patrick Wendell

Apache Spark 1.3 and Spark’s New Dataframe API

Date: This event took place live on March 25 2015

Presented by: Patrick Wendell

Duration: Approximately 60 minutes.

Cost: Free

Questions? Please send email to




This webcast is no longer available for viewing.

Description:

Hosted By: Ben Lorica

In this webcast, Patrick Wendell from Databricks will be speaking about Spark's new 1.3 release. Spark 1.3 brings extensions to all of Spark's major components (SQL, MLlib, Streaming) along with a new cross-cutting Dataframes API. The talk will outline what's new in Spark 1.3 and provide a deep dive on the dataframe feature. We'll leave plenty of time for Q and A about the release or about Spark in general.

About Patrick Wendell

Patrick Wendell is an engineer at Databricks as well as a Spark Committer and PMC member. In the Spark project, Patrick has acted as release manager for several Spark releases, including Spark 1.0. Patrick also maintains several subsystems of Spark's core engine. Before helping start Databricks, Patrick obtained an M.S. in Computer Science at UC Berkeley. His research focused on low latency scheduling for large scale analytics workloads. He holds a B.S.E in Computer Science from Princeton University

Twitter: @pwendell

About Ben Lorica

Ben Lorica is the Chief Data Scientist and Director of Content Strategy for Data at O'Reilly Media, Inc.. He has applied Business Intelligence, Data Mining, Machine Learning and Statistical Analysis in a variety of settings including Direct Marketing, Consumer and Market Research, Targeted Advertising, Text Mining, and Financial Engineering. His background includes stints with an investment management company, internet startups, and financial services. He is an advisor to Databricks.

Twitter: @bigdata

You may also be interested in:

Developer Certification
for Apache Spark