September 2017
Intermediate to advanced
360 pages
9h 43m
English
It’s easy to dump data into a data lake without a clear idea of what it will be used for or with the intention of using it later. Without some level of control, you can easily end up with a data swamp in which its difficult to manage or find relevant data, or worse, a data graveyard where data is stored but never used. A data lake needs a centralized index to keep track of data and information, and any different versions of it, and where it came from. It can also be useful to score the information as to how useful or accurate it is, and for which uses and applications and it's suitable how long it will be relevant or useful, with data governance to enforce retention and disposition policies.
Security ...