Importing and preprocessing data

As already discussed, Azure ML Studio is a complete ML tool that takes care of every step in the ML model development process. The only input needed is a raw dataset in a format understood by ML Studio; if the original data format is not recognized, then a file conversion is required, using either an external tool or the custom script modules in ML Studio. For raw files, the data formats currently recognized by ML Studio are CSV, TSV, ARFF, SvmLight, and R objects. Datasets can also be saved in zipped format to save storage space and bandwidth.

Datasets can be imported to ML Studio in two ways: by uploading a local file from the user's computer, or using cloud storage in Azure. To import data from your local ...

Get Hands-On Machine Learning with Azure now with O’Reilly online learning.

O’Reilly members experience live online training, plus books, videos, and digital content from 200+ publishers.