book

Python for Finance

Name: Python for Finance
Author: Yves Hilpisch
ISBN: 9781491945285

by Yves Hilpisch

December 2014

Intermediate to advanced

606 pages

13h 46m

English

O'Reilly Media, Inc.

Read now

Unlock full access

Preface
Conventions Used in This BookUsing Code ExamplesSafari® Books OnlineHow to Contact UsAcknowledgments
I. Python and Finance
1. Why Python for Finance?
What Is Python?Brief History of PythonThe Python EcosystemPython User SpectrumThe Scientific StackTechnology in FinanceTechnology SpendingTechnology as EnablerTechnology and Talent as Barriers to EntryEver-Increasing Speeds, Frequencies, Data VolumesThe Rise of Real-Time AnalyticsPython for FinanceFinance and Python SyntaxEfficiency and Productivity Through PythonShorter time-to-resultsEnsuring high performanceFrom Prototyping to ProductionConclusionsFurther Reading
2. Infrastructure and Tools
Python DeploymentAnacondaPython Quant PlatformToolsPythonIPythonFrom shell to browserBasic usageMarkdown and LaTeXMagic commandsSystem shell commandsSpyderConclusionsFurther Reading
3. Introductory Examples
Implied VolatilitiesMonte Carlo SimulationPure PythonVectorization with NumPyFull Vectorization with Log Euler SchemeGraphical AnalysisTechnical AnalysisConclusionsFurther Reading
II. Financial Analytics and Development
4. Data Types and Structures
Basic Data TypesIntegersFloatsStringsBasic Data StructuresTuplesListsExcursion: Control StructuresExcursion: Functional ProgrammingDictsSetsNumPy Data StructuresArrays with Python ListsRegular NumPy ArraysStructured ArraysVectorization of CodeBasic VectorizationMemory LayoutConclusionsFurther Reading
5. Data Visualization
Two-Dimensional PlottingOne-Dimensional Data SetTwo-Dimensional Data SetOther Plot StylesFinancial Plots3D PlottingConclusionsFurther Reading
6. Financial Time Series
pandas BasicsFirst Steps with DataFrame ClassSecond Steps with DataFrame ClassBasic AnalyticsSeries ClassGroupBy OperationsFinancial DataRegression AnalysisHigh-Frequency DataConclusionsFurther Reading
7. Input/Output Operations
Basic I/O with PythonWriting Objects to DiskReading and Writing Text FilesSQL DatabasesWriting and Reading NumPy ArraysI/O with pandasSQL DatabaseFrom SQL to pandasData as CSV FileData as Excel FileFast I/O with PyTablesWorking with TablesWorking with Compressed TablesWorking with ArraysOut-of-Memory ComputationsConclusionsFurther Reading

8. Performance Python
Python Paradigms and PerformanceMemory Layout and PerformanceParallel ComputingThe Monte Carlo AlgorithmThe Sequential CalculationThe Parallel CalculationPerformance ComparisonmultiprocessingDynamic CompilingIntroductory ExampleBinomial Option PricingStatic Compiling with CythonGeneration of Random Numbers on GPUsConclusionsFurther Reading
9. Mathematical Tools
ApproximationRegressionMonomials as basis functionsIndividual basis functionsNoisy dataUnsorted dataMultiple dimensionsInterpolationConvex OptimizationGlobal OptimizationLocal OptimizationConstrained OptimizationIntegrationNumerical IntegrationIntegration by SimulationSymbolic ComputationBasicsEquationsIntegrationDifferentiationConclusionsFurther Reading
10. Stochastics
Random NumbersSimulationRandom VariablesStochastic ProcessesGeometric Brownian motionSquare-root diffusionStochastic volatilityJump diffusionVariance ReductionValuationEuropean OptionsAmerican OptionsRisk MeasuresValue-at-RiskCredit Value AdjustmentsConclusionsFurther Reading
11. Statistics
Normality TestsBenchmark CaseReal-World DataPortfolio OptimizationThe DataThe Basic TheoryPortfolio OptimizationsEfficient FrontierCapital Market LinePrincipal Component AnalysisThe DAX Index and Its 30 StocksApplying PCAConstructing a PCA IndexBayesian RegressionBayes’s FormulaPyMC3Introductory ExampleReal DataConclusionsFurther Reading
12. Excel Integration
Basic Spreadsheet InteractionGenerating Workbooks (.xls)Generating Workbooks (.xslx)Reading from WorkbooksUsing OpenPyxlUsing pandas for Reading and WritingScripting Excel with PythonInstalling DataNitroWorking with DataNitroScripting with DataNitroPlotting with DataNitroUser-defined functionsxlwingsConclusionsFurther Reading
13. Object Orientation and Graphical User Interfaces
Object OrientationBasics of Python ClassesSimple Short Rate ClassCash Flow Series ClassGraphical User InterfacesShort Rate Class with GUIUpdating of ValuesCash Flow Series Class with GUIConclusionsFurther Reading
14. Web Integration
Web BasicsftplibhttpliburllibWeb PlottingStatic PlotsInteractive PlotsReal-Time PlotsReal-time FX dataReal-time stock price quotesRapid Web ApplicationsTraders’ Chat RoomData ModelingThe Python CodeImports and database preliminariesCore functionalityTemplatingStylingWeb ServicesThe Financial ModelThe ImplementationConclusionsFurther Reading
III. Derivatives Analytics Library
15. Valuation Framework
Fundamental Theorem of Asset PricingA Simple ExampleThe General ResultsRisk-Neutral DiscountingModeling and Handling DatesConstant Short RateMarket EnvironmentsConclusionsFurther Reading
16. Simulation of Financial Models
Random Number GenerationGeneric Simulation ClassGeometric Brownian MotionThe Simulation ClassA Use CaseJump DiffusionThe Simulation ClassA Use CaseSquare-Root DiffusionThe Simulation ClassA Use CaseConclusionsFurther Reading
17. Derivatives Valuation
Generic Valuation ClassEuropean ExerciseThe Valuation ClassA Use CaseAmerican ExerciseLeast-Squares Monte CarloThe Valuation ClassA Use CaseConclusionsFurther Reading
18. Portfolio Valuation
Derivatives PositionsThe ClassA Use CaseDerivatives PortfoliosThe ClassA Use CaseConclusionsFurther Reading
19. Volatility Options
The VSTOXX DataVSTOXX Index DataVSTOXX Futures DataVSTOXX Options DataModel CalibrationRelevant Market DataOption ModelingCalibration ProcedureAmerican Options on the VSTOXXModeling Option PositionsThe Options PortfolioConclusionsFurther Reading
A. Selected Best Practices
Python SyntaxDocumentationUnit Testing
B. Call Option Class
C. Dates and Times
PythonNumPypandas
Index
About the Author
Colophon
Copyright

Content preview from Python for Finance

Chapter 7. Input/Output Operations

It is a capital mistake to theorize before one has data.
— Sherlock Holmes

As a general rule, the majority of data, be it in a finance context or any other application area, is stored on hard disk drives (HDDs) or some other form of permanent storage device, like solid state disks (SSDs) or hybrid disk drives. Storage capacities have been steadily increasing over the years, while costs per storage unit (e.g., megabytes) have been steadily falling.

At the same time, stored data volumes have been increasing at a much faster pace than the typical random access memory (RAM) available even in the largest machines. This makes it necessary not only to store data to disk for permanent storage, but also to compensate for lack of sufficient RAM by swapping data from RAM to disk and back.

Input/output (I/O) operations are therefore generally very important tasks when it comes to finance applications and data-intensive applications in general. Often they represent the bottleneck for performance-critical computations, since I/O operations cannot in general shuffle data fast enough to the RAM^[28] and from the RAM to the disk. In a sense, CPUs are often “starving” due to slow I/O operations.

Although the majority of today’s financial and corporate analytics efforts are confronted with “big” data (e.g., of petascale size), single analytics tasks generally use data (sub)sets that fall in the “mid” data category. A recent study concluded:

Our measurements as well as ...

Become an O’Reilly member and get unlimited access to this title plus top books and audiobooks from O’Reilly and nearly 200 top publishers, thousands of courses curated by job role, 150+ live events each month,
and much more.

Read now

Unlock full access

More than 5,000 organizations count on O’Reilly

O’Reilly covers everything we've got, with content to help us build a world-class technology community, upgrade the capabilities and competencies of our teams, and improve overall team performance as well as their engagement.

Julian F.

Head of Cybersecurity

I wanted to learn C and C++, but it didn't click for me until I picked up an O'Reilly book. When I went on the O’Reilly platform, I was astonished to find all the books there, plus live events and sandboxes so you could play around with the technology.

Addison B.

Field Engineer

I’ve been on the O’Reilly platform for more than eight years. I use a couple of learning platforms, but I'm on O'Reilly more than anybody else. When you're there, you start learning. I'm never disappointed.

Amir M.

Data Platform Tech Lead

I'm always learning. So when I got on to O'Reilly, I was like a kid in a candy store. There are playlists. There are answers. There's on-demand training. It's worth its weight in gold, in terms of what it allows me to do.

Mark W.

Embedded Software Engineer

Publisher Resources

ISBN: 9781491945360Errata

Cloud Computing

Data Engineering

Data Science

AI & ML

Programming Languages

Software Architecture

IT/Ops

Security

Design

Business

Soft Skills

Python for Finance

by Yves Hilpisch

Chapter 7. Input/Output Operations

Become an O’Reilly member and get unlimited access to this title plus top books and audiobooks from O’Reilly and nearly 200 top publishers, thousands of courses curated by job role, 150+ live events each month,
and much more.