O'Reilly logo

Architecting the Industrial Internet by Carla Romano, Robert Stackowiak, Shyam Nath

Stay ahead with the world's most comprehensive technology and business learning platform.

With Safari, you learn the way you learn best. Get unlimited access to videos, live online training, learning paths, books, tutorials, and more.

Start Free Trial

No credit card required

Management considerations for data lakes

It’s easy to dump data into a data lake without a clear idea of what it will be used for or with the intention of using it later. Without some level of control, you can easily end up with a data swamp in which its difficult to manage or find relevant data, or worse, a data graveyard where data is stored but never used. A data lake needs a centralized index to keep track of data and information, and any different versions of it, and where it came from. It can also be useful to score the information as to how useful or accurate it is, and for which uses and applications and it's suitable how long it will be relevant or useful, with data governance to enforce retention and disposition policies.

Security ...

With Safari, you learn the way you learn best. Get unlimited access to videos, live online training, learning paths, books, interactive tutorials, and more.

Start Free Trial

No credit card required