Chapter 2

Exploring Data Engineering Pipelines and Infrastructure

IN THIS CHAPTER

check Defining big data

check Looking at some sources of big data

check Distinguishing between data science and data engineering

check Hammering down on Hadoop

check Exploring solutions for big data problems

check Checking out a real-world data engineering project

There’s a lot of hype around big data these days, but most people don’t really know or understand what it is or how they can use it to improve their lives and livelihoods. This chapter defines the term big data, explains where big data comes from and how it’s used, and outlines the roles that data engineers and data scientists play in the big data ecosystem. In this chapter, I introduce the fundamental big data concepts that you need in order to start generating your own ideas and plans on how to leverage big data and data science to improve your lifestyle and business workflow ...

Get Data Science For Dummies, 2nd Edition now with the O’Reilly learning platform.

O’Reilly members experience books, live events, courses curated by job role, and more from O’Reilly and nearly 200 top publishers.