Skip to Content
Data Science On Your First Day with Python
video

Data Science On Your First Day with Python

by Alfredo Deza, Noah Gift
October 2022
Beginner
46m
English
Pragmatic AI Labs

Overview

Data Science On Your First Day with Python

Be productive day one

This video course will show you how to start your first day as a data scientist. In this video course, you will learn how to start your first day as a data scientist.

Topics covered include: * creating a Github repository * using Colab notebook, * describing data using df.describe(), * plotting data with seaborn, lmplot, and distplot * comparing cumulative deaths in Covid19 plot by state * merging Pandas dataframe with election and Sugar Consumption * exporting CSV file and uploading to Github from Colab result * continuous integration of Jupyter Notebook with Github Actions * creating makefile, using Github Actions to test Jupyter via nbval plugin * using Github Status Badge for Jupyter Notebook test run pass/fail status

Key Moments: * 02:02 Data Science Project Structure Overview * 05:15 Create Github Repo * 07:28 Launch Github Codespaces * 11:34 Using Colab Notebook * 12:47 Using TOC in Colab Notebook * 14:43 Saving notebook to Colab to Github * 19:52 Ingesting CSV files into Colab * 22:31 Describing Data using df.describe() * 24:30 Plotting data with seaborn * 28:30 lmplot * 29:30 Comparing cumulative deaths in Covid19 plot by state * 32:40 Merging Pandas Dataframe with election and Sugar Consumption * 37:38 Exporting CSV file and uploading to Github from Colab result * 39:27 Continuous Integration of Jupyter Notebook with Github Actions * 40:44 Create Makefile * 44:32 Using Github Actions to test Jupyter via nbval plugin * 45:43 Using Github Status Badge for Jupyter Notebook test run pass

Learning Objectives

  • Create a Github Repo and launch it in a Codespaces instance
  • Create a Jupyter Notebook and save it to a Colab instance
  • Ingest Data from a CSV file into a Jupyter Notebook
  • Do EDA
  • Build and test your data science project with Github Actions and nbval
  • Data Science Notebook
Additional Popular Resources
Become an O’Reilly member and get unlimited access to this title plus top books and audiobooks from O’Reilly and nearly 200 top publishers, thousands of courses curated by job role, 150+ live events each month,
and much more.

Watch now

Unlock full access

More than 5,000 organizations count on O’Reilly

AirBnbBlueOriginElectronic ArtsHomeDepotNasdaqRakutenTata Consultancy Services

QuotationMarkO’Reilly covers everything we've got, with content to help us build a world-class technology community, upgrade the capabilities and competencies of our teams, and improve overall team performance as well as their engagement.
Julian F.
Head of Cybersecurity
QuotationMarkI wanted to learn C and C++, but it didn't click for me until I picked up an O'Reilly book. When I went on the O’Reilly platform, I was astonished to find all the books there, plus live events and sandboxes so you could play around with the technology.
Addison B.
Field Engineer
QuotationMarkI’ve been on the O’Reilly platform for more than eight years. I use a couple of learning platforms, but I'm on O'Reilly more than anybody else. When you're there, you start learning. I'm never disappointed.
Amir M.
Data Platform Tech Lead
QuotationMarkI'm always learning. So when I got on to O'Reilly, I was like a kid in a candy store. There are playlists. There are answers. There's on-demand training. It's worth its weight in gold, in terms of what it allows me to do.
Mark W.
Embedded Software Engineer

You might also like

Machine Learning, Data Science and Generative AI with Python

Machine Learning, Data Science and Generative AI with Python

Frank Kane

Publisher Resources

ISBN: 01222022VIDEOPAIMLOtherOtherOther