book

Analytics for the Internet of Things (IoT)

by Andrew Minteer

July 2017

Beginner to intermediate

378 pages

10h 26m

English

Packt Publishing

Read now

Unlock full access

Content preview from Analytics for the Internet of Things (IoT)

Thinking about a single machine versus a cluster of machines

Designing distributed computing analytics requires that you think what can be run in parallel and what has to be run one step after another. Running computations in parallel is where a lot of the speed advantage comes from in cluster computing systems such as Spark. But it does require a little different thinking.

Think in terms of how to split up an analytics job into actions that can be run either record by record or on a small subset of records, without needing to know what is going on elsewhere in the full dataset. A simple example is a word count exercise.

Imagine you have millions of rows of survey results and need to analyze the survey question: "Comment about the insightfulness ...

Become an O’Reilly member and get unlimited access to this title plus top books and audiobooks from O’Reilly and nearly 200 top publishers, thousands of courses curated by job role, 150+ live events each month,
and much more.

Start your free trial

Big Data Analytics for Internet of Things

Publisher Resources

ISBN: 9781787120730Supplemental Content

Analytics for the Internet of Things (IoT)

by Andrew Minteer

Thinking about a single machine versus a cluster of machines

Become an O’Reilly member and get unlimited access to this title plus top books and audiobooks from O’Reilly and nearly 200 top publishers, thousands of courses curated by job role, 150+ live events each month,
and much more.

You might also like

Big Data Analytics for Internet of Things

IoT and Analytics Condition Based Maintenance

Hands-On Industrial Internet of Things

Internet of Things

Publisher Resources