Chapter 4. Handling large data on a single computer

This chapter covers

  • Working with large data sets on a single computer
  • Working with Python libraries suitable for larger data sets
  • Understanding the importance of choosing correct algorithms and data structures
  • Understanding how you can adapt algorithms to work inside databases

What if you had so much data that it seems to outgrow you, and your techniques no longer seem to suffice? What do you do, surrender or adapt?

Luckily you chose to adapt, because you’re still reading. This chapter introduces you to techniques and tools to handle larger data sets that are still manageable by a single computer if you adopt the right techniques.

This chapter gives you the tools to perform the classifications ...

Get Introducing Data Science now with the O’Reilly learning platform.

O’Reilly members experience books, live events, courses curated by job role, and more from O’Reilly and nearly 200 top publishers.