Skip to Content
Big Data Analytics with Java
book

Big Data Analytics with Java

by RAJAT MEHTA
July 2017
Beginner to intermediate
418 pages
9h 46m
English
Packt Publishing
Content preview from Big Data Analytics with Java

Dataset

For our case study on customer segmentation using clustering, we will be using a dataset from UCI repository of datasets for a UK online retail store. This retail store has shared its data with UCI and the dataset is freely available on their website. This data is essentially the transactions of different customers made on the online retail store. The transactions were made from different countries and the dataset size is good (thousands of rows). Let's go through the attributes of the dataset:

Attribute name

Description

Invoice number

Invoice number; a number uniquely assigned to each transaction

Stock code

Product (item) code; a 5-digit integral number uniquely assigned to each distinct product

Description

Product item name ...

Become an O’Reilly member and get unlimited access to this title plus top books and audiobooks from O’Reilly and nearly 200 top publishers, thousands of courses curated by job role, 150+ live events each month,
and much more.
Start your free trial

You might also like

Data Science with Java

Data Science with Java

Michael R. Brzustowicz
Data Science on AWS

Data Science on AWS

Chris Fregly, Antje Barth
Machine Learning: End-to-End guide for Java developers

Machine Learning: End-to-End guide for Java developers

Richard M. Reese, Jennifer L. Reese, Boštjan Kaluža, Dr. Uday Kamath, Krishna Choppella

Publisher Resources

ISBN: 9781787288980Supplemental Content