© Mark Wickham 2018
Mark WickhamPractical Java Machine Learninghttps://doi.org/10.1007/978-1-4842-3951-3_2

2. Data: The Fuel for Machine Learning

Mark Wickham1 
(1)
Irving, TX, USA
 
Machine learning is all about data. This chapter will explore the many aspects of data with the goal of meeting the following objectives:
  • Review the data explosion and three megatrends that are making this machine learning revolution possible.

  • Introduce the importance of data and reprogramming yourself to think like a data scientist.

  • Review different categories of data.

  • Review various formats of unstructured data, including CSV, ARFF, and JSON.

  • Use the OpenOffice Calc program to prepare CSV data.

  • Find and use publicly available data.

  • Introduce techniques for creating your own ...

Get Practical Java Machine Learning: Projects with Google Cloud Platform and Amazon Web Services now with the O’Reilly learning platform.

O’Reilly members experience books, live events, courses curated by job role, and more from O’Reilly and nearly 200 top publishers.