book

Mastering Python for Finance - Second Edition

by James Ma Weiming

April 2019

Intermediate to advanced

426 pages

11h 13m

English

Packt Publishing

Read now

Unlock full access

Content preview from Mastering Python for Finance - Second Edition

Scaling and splitting the data

We are interested in using the earliest five years of pricing data for training our model, and the most recent year of 2018 for testing our predictions. Run the following codes to split our df dataset:

In [ ]:    df_train = df['2017':'2013']    df_test = df['2018']

The df_train and df_test variables contain our training and testing data respectively.

An important step in data preprocessing is to normalize the dataset. This will transform input feature values to a mean of zero and a variance of one. Normalization helps to avoid biases during training due to the different scales of input features.

The MinMaxScaler function of the sklearn module helps to transform each feature into a range between -1 and ...

Become an O’Reilly member and get unlimited access to this title plus top books and audiobooks from O’Reilly and nearly 200 top publishers, thousands of courses curated by job role, 150+ live events each month,
and much more.

Start your free trial

Publisher Resources

ISBN: 9781789346466Supplemental Content

Mastering Python for Finance - Second Edition

by James Ma Weiming

Scaling and splitting the data

Become an O’Reilly member and get unlimited access to this title plus top books and audiobooks from O’Reilly and nearly 200 top publishers, thousands of courses curated by job role, 150+ live events each month,
and much more.

You might also like

Python for Finance - Second Edition

Python for Finance, 2nd Edition

Python for Finance

Python for Finance: Investment Fundamentals and Data Analytics

Publisher Resources