October 2017
Intermediate to advanced
1159 pages
26h 10m
English
In this section, we'll review where to find additional resources for learning, discussing, presenting, or sharpening our data science skills.
One of the most well-known repositories of machine learning datasets is hosted by the University of California, Irvine. The UCI repository contains over 300 datasets covering a wide variety of challenges, including poker, movies, wine quality, activity recognition, stocks, taxi service trajectories, advertisements, and many others. Each dataset is usually equipped with a research paper where the dataset was used, which can give you a hint on how to start and what is the prediction baseline.
The UCI machine learning repository can be accessed at https://archive.ics.uci.edu ...