Data Analysis with Open Source Tools

by Philipp K. Janert

Released November 2010

Publisher(s): O'Reilly Media, Inc.

ISBN: 9780596802356

Book description

Collecting data is relatively easy, but turning raw information into something useful requires that you know how to extract precisely what you need. With this insightful book, intermediate to experienced programmers interested in data analysis will learn techniques for working with data in a business environment. You'll learn how to look at data to discover what it contains, how to capture those ideas in conceptual models, and then feed your understanding back into the organization through business plans, metrics dashboards, and other applications.

Along the way, you'll experiment with concepts through hands-on workshops at the end of each chapter. Above all, you'll learn how to think about the results you want to achieve -- rather than rely on tools to think for you.

Use graphics to describe data with one, two, or dozens of variables
Develop conceptual models using back-of-the-envelope calculations, as well asscaling and probability arguments
Mine data with computationally intensive methods such as simulation and clustering
Make your conclusions understandable through reports, dashboards, and other metrics programs
Understand financial calculations, including the time-value of money
Use dimensionality reduction techniques or predictive analytics to conquer challenging data analysis situations
Become familiar with different open source programming environments for data analysis

"Finally, a concise reference for understanding how to conquer piles of data."--Austin King, Senior Web Developer, Mozilla

"An indispensable text for aspiring data scientists."--Michael E. Driscoll, CEO/Founder, Dataspora

Publisher resources

View/Submit Errata

Product information

Title: Data Analysis with Open Source Tools
Author(s): Philipp K. Janert
Release date: November 2010
Publisher(s): O'Reilly Media, Inc.
ISBN: 9780596802356

book

Graph-Powered Analytics and Machine Learning with TigerGraph

by Victor Lee, Phuc Kien Nguyen, Alexander Thomas

With the rapid rise of graph databases, organizations are now implementing advanced analytics and machine learning …

book

Machine Learning and Big Data with kdb+/q

by Jan Novotny, Paul A. Bilokon, Aris Galiotos, Frederic Deleze

Upgrade your programming language to more effectively handle high-frequency data Machine Learning and Big Data with …

book

Hands-On Exploratory Data Analysis with Python

by Suresh Kumar Mukhiya, Usman Ahmed

Discover techniques to summarize the characteristics of your data using PyPlot, NumPy, SciPy, and pandas Key …

book

Predictive Analytics: Data Mining, Machine Learning and Data Science for Practitioners, 2nd Edition

by Dursun Delen

Use Predictive Analytics to Uncover Hidden Patterns and Correlations and Improve Decision-Making Using predictive analytics techniques, …

Data Analysis with Open Source Tools

Book description

Publisher resources

Table of contents

Product information

You might also like

Graph-Powered Analytics and Machine Learning with TigerGraph

Machine Learning and Big Data with kdb+/q

Hands-On Exploratory Data Analysis with Python

Predictive Analytics: Data Mining, Machine Learning and Data Science for Practitioners, 2nd Edition

Don’t leave empty-handed

It’s yours, free.

Check it out now on O’Reilly