Book description
Learn the basics of analytics on big data using Java, machine learning and other big data tools
About This Book
Acquire real-world set of tools for building enterprise level data science applications
Surpasses the barrier of other languages in data science and learn create useful object-oriented codes
Extensive use of Java compliant big data tools like apache spark, Hadoop, etc.
Who This Book Is For
This book is for Java developers who are looking to perform data analysis in production environment. Those who wish to implement data analysis in their Big data applications will find this book helpful.
What You Will Learn
Start from simple analytic tasks on big data
Get into more complex tasks with predictive analytics on big data using machine learning
Learn real time analytic tasks
Understand the concepts with examples and case studies
Prepare and refine data for analysis
Create charts in order to understand the data
See various real-world datasets
In Detail
This book covers case studies such as sentiment analysis on a tweet dataset, recommendations on a movielens dataset, customer segmentation on an ecommerce dataset, and graph analysis on actual flights dataset.
This book is an end-to-end guide to implement analytics on big data with Java. Java is the de facto language for major big data environments, including Hadoop. This book will teach you how to perform analytics on big data with production-friendly Java. This book basically divided into two sections. The first part is an introduction that will help the readers get acquainted with big data environments, whereas the second part will contain a hardcore discussion on all the concepts in analytics on big data. It will take you from data analysis and data visualization to the core concepts and advantages of machine learning, real-life usage of regression and classification using Naïve Bayes, a deep discussion on the concepts of clustering,and a review of simple neural networks on big data using deepLearning4j or plain Java Spark code. This book is a must-have book for Java developers who want to start learning big data analytics and want to use it in the real world.
Style and approach
The approach of book is to deliver practical learning modules in manageable content. Each chapter is a self-contained unit of a concept in big data analytics. Book will step by step builds the competency in the area of big data analytics. Examples using real world case studies to give ideas of real applications and how to use the techniques mentioned. The examples and case studies will be shown using both theory and code.
Table of contents
-
Big Data Analytics with Java
- Table of Contents
- Big Data Analytics with Java
- Credits
- About the Author
- About the Reviewers
- www.PacktPub.com
- Customer Feedback
- Preface
-
1. Big Data Analytics with Java
-
Why data analytics on big data?
- Big data for analytics
- Distributed computing on Hadoop
- HDFS concepts
-
Apache Spark
- Concepts
- Transformations
- Actions
- Spark Java API
- Spark samples using Java 8
- Loading data
- Data operations – cleansing and munging
- Analyzing data – count, projection, grouping, aggregation, and max/min
- Actions on RDDs
- Paired RDDs
- Saving data
- Collecting and printing results
- Executing Spark programs on Hadoop
- Apache Spark sub-projects
- Spark machine learning modules
- Mahout – a popular Java ML library
- Deeplearning4j – a deep learning library
- Summary
-
Why data analytics on big data?
- 2. First Steps in Data Analysis
- 3. Data Visualization
-
4. Basics of Machine Learning
-
What is machine learning?
- Real-life examples of machine learning
- Type of machine learning
- Steps for machine learning problems
- Choosing the machine learning model
- Summary
-
What is machine learning?
- 5. Regression on Big Data
- 6. Naive Bayes and Sentiment Analysis
- 7. Decision Trees
- 8. Ensembling on Big Data
- 9. Recommendation Systems
- 10. Clustering and Customer Segmentation on Big Data
- 11. Massive Graphs on Big Data
- 12. Real-Time Analytics on Big Data
- 13. Deep Learning Using Big Data
- Index
Product information
- Title: Big Data Analytics with Java
- Author(s):
- Release date: July 2017
- Publisher(s): Packt Publishing
- ISBN: 9781787288980
You might also like
book
Data Science with Java
Data Science is booming thanks to R and Python, but Java brings the robustness, convenience, and …
book
Hands-On Big Data Modeling
Solve all big data problems by learning how to create efficient data models Key Features Create …
book
Advanced Analytics with PySpark
The amount of data being generated today is staggering and growing. Apache Spark has emerged as …
book
Data Science on AWS
With this practical book, AI and machine learning practitioners will learn how to successfully build and …