May 2016
Beginner
320 pages
10h 39m
English
Chapter 1. Data science in a big data world
Figure 1.1. An Excel table is an example of structured data.
Figure 1.2. Email is simultaneously an example of unstructured data and natural language data.
Figure 1.3. Example of machine-generated data
Figure 1.4. Friends in a social network are an example of graph-based data.
Figure 1.5. The data science process
Figure 1.6. Big data technologies can be classified into a few main components.
Figure 1.7. The end result: the average salary by job description
Figure 1.8. Hortonworks Sandbox running within VirtualBox
Figure 1.9. The Hortonworks Sandbox welcome screen available at http://127.0.0.1:8000