O'Reilly logo

Apache Spark for Data Science Cookbook by Padma Priya Chitturi

Stay ahead with the world's most comprehensive technology and business learning platform.

With Safari, you learn the way you learn best. Get unlimited access to videos, live online training, learning paths, books, tutorials, and more.

Start Free Trial

No credit card required

Chapter 10. Working with SparkR

In this chapter, we'll cover the following recipes:

  • Introduction
  • Installing R
  • Interactive analysis with the SparkR shell
  • Creating a SparkR standalone application from RStudio
  • Creating SparkR DataFrames
  • SparkR DataFrame operations
  • Applying user-defined functions in SparkR
  • Running SQL queries from SparkR and caching DataFrames
  • Machine learning with SparkR

Introduction

R is a flexible, open source, and powerful statistical programming language. It is preferred by many professional statisticians and researchers in a variety of fields. It has extensive statistical and graphical capabilities. R combines the aspects of functional and object-oriented programming. One of the key features of R is implicit looping, which yields compact, ...

With Safari, you learn the way you learn best. Get unlimited access to videos, live online training, learning paths, books, interactive tutorials, and more.

Start Free Trial

No credit card required