The data dimension
To categorize the data dimension of the problem, we look at its volume, velocity, and variety (the 3Vs), which are defined as follows:
Volume: The volume is the expected size of the data that the algorithm will process.
Velocity: The velocity is the expected rate of new data generation when the algorithm is used. It can be zero.
Variety: The variety quantifies how many different types of data the designed algorithm is expected to deal with.
The following figure shows the 3Vs of the data in more detail. The center of this diagram shows the simplest possible data, with a small volume and low variety and velocity. As we move away from the center, the complexity of the data increases. It can increase in one or more of the ...
Become an O’Reilly member and get unlimited access to this title plus top books and audiobooks from O’Reilly and nearly 200 top publishers, thousands of courses curated by job role, 150+ live events each month,
and much more.
Read now
Unlock full access