Architectural guidance
As evidenced in the previous sections, there are a plethora of options available for Data Consumption; choosing the right tool depends primarily on the use case you are attempting to implement using the Data Lake. We also see that the market is flooded with umpteen tools that make decision making very difficult.
Data Discovery
We have seen, in the previous sections, that Data Lake exposes a queryable interface to data consumers to discover the data. Simple visualizations such as a histogram or tag cloud can provide an intuitive understanding of the data. The following figure depicts the key aspects that are to be considered while choosing the right tools and technologies for Data Discovery:
Get Data Lake Development with Big Data now with the O’Reilly learning platform.
O’Reilly members experience books, live events, courses curated by job role, and more from O’Reilly and nearly 200 top publishers.