Chapter 2. Importing (Much) More Data

You and your team at ElectricHarmony are now faced with the challenge of importing a massive amount of data–which will better represent your total user base–to provide recommendations for users.

You try the method you used for ingesting data in Chapter 1, but it proves to be slow as you increase the size of the tracks dataset from a hundred thousand to a million rows. This often leads to the dreaded “spinning wheel of death,” making you wonder if your database choice is effective at scale.

Your concerns are valid. You need to answer key questions from your team and stakeholders, such as:

  • Can you ingest all the necessary data into the graph in less than 24 hours if the business produces a certain amount of data in a day?

  • Can the system serve recommendations in near real-time and keep up with the data produced by other systems every second?

  • In case of a disaster, how long would recovery ...

Get Neo4j: The Definitive Guide now with the O’Reilly learning platform.

O’Reilly members experience books, live events, courses curated by job role, and more from O’Reilly and nearly 200 top publishers.