Skip to Content
Dealing With Real-World Data
on-demand course

Dealing With Real-World Data

with Angie Ma, Gary Willis, Alessandra Stagliano
August 2017
Intermediate
41m
English
Infinite Skills

Overview

This course covers a subject central to the practice of data science and machine learning: the tricky and often overlooked problem of how to deal with real-world data. It provides an overview of the things data scientists think about when gaining access to a data set. You'll learn about data types, data exploration, the curse of dimensionality, PCA, model evaluation, and more, in this pragmatic introduction to the terminology and concepts surrounding data and machine learning. Learners with a basic working knowledge of mathematics will be able to enjoy the course and immediately start working on machine learning problems.

  • Learn to handle the many types of data used in real-world machine learning projects
  • Explore topics like data exploration, the curse of dimensionality, and PCA
  • Understand how to evaluate models and why this is important
  • Learn how to use — and enjoy free access to — the SherlockML data science platform
  • Develop the skills required for the machine learning job market where demand outstrips supply

Angie Ma, Gary Willis, and Alessandra Stagliano are data scientists with ASI Data Science, a London based AI/machine learning solutions firm. Angie co-founded ASI and is also the founder of Data Science Lab London, one of the biggest communities of data scientists and data engineers in Europe, with over 2,500 members. Angie holds a PhD in physics from London's University College, Gary Willis holds a PhD in statistical physics from London's Imperial College, and Alessandra Stagliano holds a PhD in computer science from the University of Genoa. Collectively, the group has worked on over 150 commercial AI/machine learning projects.

Become an O’Reilly member and get unlimited access to this title plus top books and audiobooks from O’Reilly and nearly 200 top publishers, thousands of courses curated by job role, 150+ live events each month,
and much more.

Watch now

Unlock full access

More than 5,000 organizations count on O’Reilly

AirBnbBlueOriginElectronic ArtsHomeDepotNasdaqRakutenTata Consultancy Services

QuotationMarkO’Reilly covers everything we've got, with content to help us build a world-class technology community, upgrade the capabilities and competencies of our teams, and improve overall team performance as well as their engagement.
Julian F.
Head of Cybersecurity
QuotationMarkI wanted to learn C and C++, but it didn't click for me until I picked up an O'Reilly book. When I went on the O’Reilly platform, I was astonished to find all the books there, plus live events and sandboxes so you could play around with the technology.
Addison B.
Field Engineer
QuotationMarkI’ve been on the O’Reilly platform for more than eight years. I use a couple of learning platforms, but I'm on O'Reilly more than anybody else. When you're there, you start learning. I'm never disappointed.
Amir M.
Data Platform Tech Lead
QuotationMarkI'm always learning. So when I got on to O'Reilly, I was like a kid in a candy store. There are playlists. There are answers. There's on-demand training. It's worth its weight in gold, in terms of what it allows me to do.
Mark W.
Embedded Software Engineer

You might also like

Database Design Best Practices: Building Scalable Data Solutions to Withstand the Test of Time

Database Design Best Practices: Building Scalable Data Solutions to Withstand the Test of Time

Edward Pollack
Software Architecture Superstream Series: Architecture Meets Data

Software Architecture Superstream Series: Architecture Meets Data

Neal Ford, Mark Richards, Pramod Sadalage, Zhamak Dehghani

Publisher Resources

ISBN: 9781492023876