Understanding big data
Actually, big data is a terminology which refers to challenges that we are facing due to exponential growth of data in terms of V problems. The challenges can be subdivided into the following phases:
- Capture
- Storage
- Search
- Sharing
- Analytics
- Visualization
Big data systems refer to technologies that can process and analyze data, which we discussed as volume, velocity, and variety data problems. The technologies that can solve big data problems should use the following architectural strategy:
- Distributed computing system
- Massively parallel processing (MPP)
- NoSQL (Not only SQL)
- Analytical database
The structure is as follows:
Big data systems ...
Get Hadoop Essentials now with the O’Reilly learning platform.
O’Reilly members experience books, live events, courses curated by job role, and more from O’Reilly and nearly 200 top publishers.