Let's first do some import statements:
# import packages we need for exploratory data analysis (EDA)# to store tabular dataimport pandas as pd# to do some mathimport numpy as np # a popular data visualization toolimport matplotlib.pyplot as plt # another popular data visualization toolimport seaborn as sns# allows the notebook to render graphics%matplotlib inline # a popular data visualization themeplt.style.use('fivethirtyeight')
And then, let's import our first dataset, which will explore salaries of different job titles in San Francisco. This dataset is available publicly and so you are encouraged to play around with it as much as you want:
# load in the data set# https://data.sfgov.org/City-Management-and-Ethics/Salary-Ranges-by-Job-Classification/7h4w-reyq ...