 Acquire realworld set of tools for building enterprise level data science applications
 Surpasses the barrier of other languages in data science and learn create useful objectoriented codes
 Extensive use of Java compliant big data tools like apache spark, Hadoop, etc.
This book is for Java developers who are looking to perform data analysis in production environment. Those who wish to implement data analysis in their Big data applications will find this book helpful.
 Start from simple analytic tasks on big data
 Get into more complex tasks with predictive analytics on big data using machine learning
 Learn real time analytic tasks
 Understand the concepts with examples and case studies
 Prepare and refine data for analysis
 Create charts in order to understand the data
 See various realworld datasets
This book covers case studies such as sentiment analysis on a tweet dataset, recommendations on a movielens dataset, customer segmentation on an ecommerce dataset, and graph analysis on actual flights dataset.
This book is an endtoend guide to implement analytics on big data with Java. Java is the de facto language for major big data environments, including Hadoop. This book will teach you how to perform analytics on big data with productionfriendly Java. This book basically divided into two sections. The first part is an introduction that will help the readers get acquainted with big data environments, whereas the second part will contain a hardcore discussion on all the concepts in analytics on big data. It will take you from data analysis and data visualization to the core concepts and advantages of machine learning, reallife usage of regression and classification using Naïve Bayes, a deep discussion on the concepts of clustering,and a review of simple neural networks on big data using deepLearning4j or plain Java Spark code. This book is a musthave book for Java developers who want to start learning big data analytics and want to use it in the real world.
The approach of book is to deliver practical learning modules in manageable content. Each chapter is a selfcontained unit of a concept in big data analytics. Book will step by step builds the competency in the area of big data analytics. Examples using real world case studies to give ideas of real applications and how to use the techniques mentioned. The examples and case studies will be shown using both theory and code.
Big Data Analytics with Java
1. Big Data Analytics with Java

Why data analytics on big data?
 Big data for analytics
 Distributed computing on Hadoop
 HDFS concepts

Apache Spark
 Concepts
 Transformations
 Actions
 Spark Java API
 Spark samples using Java 8
 Loading data
 Data operations – cleansing and munging
 Analyzing data – count, projection, grouping, aggregation, and max/min
 Actions on RDDs
 Paired RDDs
 Saving data
 Collecting and printing results
 Executing Spark programs on Hadoop
 Apache Spark subprojects
 Spark machine learning modules
 Mahout – a popular Java ML library
 Deeplearning4j – a deep learning library
 Summary

Why data analytics on big data?
 2. First Steps in Data Analysis
 3. Data Visualization

4. Basics of Machine Learning

What is machine learning?
 Reallife examples of machine learning
 Type of machine learning
 Steps for machine learning problems
 Choosing the machine learning model
 Summary

What is machine learning?
 5. Regression on Big Data
 6. Naive Bayes and Sentiment Analysis
 7. Decision Trees
 8. Ensembling on Big Data
 9. Recommendation Systems
 10. Clustering and Customer Segmentation on Big Data
 11. Massive Graphs on Big Data
 12. RealTime Analytics on Big Data
 13. Deep Learning Using Big Data
 Index
 Title: Big Data Analytics with Java
 Author(s):
 Release date: July 2017
 Publisher(s): Packt Publishing
 ISBN: 9781787288980
