Getting a dataset for machine learning
While R has a built-in dataset, the sample size and field of application is limited. Apart from generating data within a simulation, another approach is to obtain data from external data repositories. A famous data repository is the UCI machine learning repository, which contains both artificial and real datasets. This recipe introduces how to get a sample dataset from the UCI machine learning repository.
Ensure that you have completed the previous recipes by installing R on your operating system.
How to do it...
Perform the following steps to retrieve data for machine learning:
- Access the UCI machine learning repository: http://archive.ics.uci.edu/ml/.
- Click on View ALL Data Sets ...