December 2018
Beginner to intermediate
684 pages
21h 9m
English
The download of the content of one or more html tables works as follows, for instance for the constituents of the S&P500 index from Wikipedia:
sp_url = 'https://en.wikipedia.org/wiki/List_of_S%26P_500_companies'sp = pd.read_html(sp_url, header=0)[0] # returns a list for each tablesp.info()RangeIndex: 505 entries, 0 to 504 Data columns (total 9 columns): Ticker symbol 505 non-null object Security 505 non-null object SEC filings 505 non-null object GICS Sector 505 non-null object GICS Sub Industry 505 non-null object Location 505 non-null object Date first added[3][4] 398 non-null object CIK 505 non-null int64 Founded 139 non-null object