O'Reilly logo

Pro Spark Streaming: The Zen of Real-Time Analytics Using Apache Spark by Zubair Nabi

Stay ahead with the world's most comprehensive technology and business learning platform.

With Safari, you learn the way you learn best. Get unlimited access to videos, live online training, learning paths, books, tutorials, and more.

Start Free Trial

No credit card required

Index

A

  1. addAccumulator method

  2. addInPlace method

  3. Alternating least square (ALS)

  4. awaitTermination()method

B

  1. Batch processing

  2. Big Data systems, Spark

    1. acyclic graph

    2. canonical word-count

    3. MapReduce programming model

    4. Samza messages

    5. sensor network

    6. SQL to NoSQL

    7. stream-processing system

    8. Web 2.0 applications

    9. local Execution

    10. .sbt file

    11. standalone cluster mode

    12. YARN

C

  1. cache() function

  2. Call data record (CDR)

  3. Case-class method

  4. Cassandra Query Language (CQL)

  5. ChiSqSelector

  6. Chi-square selection

  7. Clickstream Dataset

  8. Collaborative filtering

  9. compute() method

  10. createCombiner function

  11. Custom receiver

    1. HttpInputDStream

    2. receiver interface method

D

  1. Data frame

    1. avoid shuffling

    2. cache aggressively

    3. MLlib

    4. persistence

    5. query transformation

      1. action

      2. aggregation expression

      3. cube operation

      4. DataFrameNaFunctions

      5. DataFrameStatFunctions ...

With Safari, you learn the way you learn best. Get unlimited access to videos, live online training, learning paths, books, interactive tutorials, and more.

Start Free Trial

No credit card required