Basic Statistics and Data Mining for Data Science

Video description

Data science is an ever-evolving field, with exponentially growing popularity. Data science includes techniques and theories extracted from the fields of statistics, computer science, and most importantly machine learning, databases, and visualization.

This video course consists of step-by-step introductions to analyze data and the basics of statistics. The first chapter focuses on the steps to analyze data and which summary statistics are relevant given the type of data you are summarizing. The second chapter continues by focusing on summarizing individual variables and specifically some of the reasons users need to summarize variables. This chapter also illustrates several procedures, such as how to run and interpret frequencies and how to create various graphs. The third chapter introduces the idea of inferential statistics, probability, and hypothesis testing.

The rest of the chapters show you how to perform and interpret the results of basic statistical analyses (chi-square, independent and paired sample t-tests, one-way ANOVA, post-hoc tests, and bivariate correlations) and graphical displays (clustered bar charts, error bar charts, and scatterplots). You will also learn when to use different statistical techniques, how to set up different analyses, and how to interpret the results.

What You Will Learn

  • Get familiar with the basics of analyzing data
  • Exploring the importance of summarizing individual variables
  • Use inferential statistics
  • Know when to perform the Chi-Square test
  • Differentiate between independent and paired samples t-tests
  • Understand when to use a one-way ANOVA and post-hoc tests
  • Get well-versed with correlations


This course is for developers who are interested in entering the field of data science and are looking for a guide to the statistical concepts.

About The Author

Jesus Salcedo: Jesus Salcedo has a PhD in psychometrics from Fordham University. He is an independent statistical consultant and has been using SPSS products for over 20 years. He is a former SPSS Curriculum Team Lead and Senior Education Specialist who has written numerous SPSS training courses and trained thousands of users.

Publisher resources

Download Example Code

Product information

  • Title: Basic Statistics and Data Mining for Data Science
  • Author(s): Jesus Salcedo
  • Release date: December 2017
  • Publisher(s): Packt Publishing
  • ISBN: 9781788476782