Importing and preprocessing data

As already discussed, Azure ML Studio is a complete ML tool that takes care of every step in the ML model development process. The only input needed is a raw dataset in a format understood by ML Studio; if the original data format is not recognized, then a file conversion is required, using either an external tool or the custom script modules in ML Studio. For raw files, the data formats currently recognized by ML Studio are CSV, TSV, ARFF, SvmLight, and R objects. Datasets can also be saved in zipped format to save storage space and bandwidth.

Datasets can be imported to ML Studio in two ways: by uploading a local file from the user's computer, or using cloud storage in Azure. To import data from your local ...

Get Hands-On Machine Learning with Azure now with the O’Reilly learning platform.

O’Reilly members experience books, live events, courses curated by job role, and more from O’Reilly and nearly 200 top publishers.