O'Reilly logo

Architecting the Industrial Internet by Carla Romano, Robert Stackowiak, Shyam Nath

Stay ahead with the world's most comprehensive technology and business learning platform.

With Safari, you learn the way you learn best. Get unlimited access to videos, live online training, learning paths, books, tutorials, and more.

Start Free Trial

No credit card required

Hadoop file systems

As Internet search engines were emerging, the requirement to provide search capabilities on unstructured, variable, and high-volume data increased dramatically. Unlike enterprise systems, there is little need to enforce referential integrity and perform deduplication or other data management functions on text data, photographs, social networks, and time-series measurements. The Hadoop file system (HDFS) provides inexpensive storage and search capabilities for highly variable, high-volume, high-velocity, and high-variety data, or the 4 Vs of big data, and provides inexpensive storage for very large volumes of data. 

As described in Chapter 6, Defining the Data and Analytics Architecture, the Hadoop storage platform distributes ...

With Safari, you learn the way you learn best. Get unlimited access to videos, live online training, learning paths, books, interactive tutorials, and more.

Start Free Trial

No credit card required