O'Reilly logo

Effective Amazon Machine Learning by Alexis Perrier

Stay ahead with the world's most comprehensive technology and business learning platform.

With Safari, you learn the way you learn best. Get unlimited access to videos, live online training, learning paths, books, tutorials, and more.

Start Free Trial

No credit card required

Examining data statistics

When Amazon ML created the data source, it carried out a basic statistical analysis of the different variables. For each variable, it estimated the following information:

  • Correlation of each attribute to the target
  • Number of missing values
  • Number of invalid values
  • Distribution of numeric variables with histogram and box plot 
  • Range, mean, and median for numeric variables
  • Most and least frequent categories for categorical variables
  • Word counts for text variables
  • Percentage of true values for binary variables

Go to the Datasource dashboard, and click on the new datasource you just created in order to access the data summary page. The left side menu lets you access data statistics for the target and different attributes, ...

With Safari, you learn the way you learn best. Get unlimited access to videos, live online training, learning paths, books, interactive tutorials, and more.

Start Free Trial

No credit card required