Book description
This IBM® Redpaper publication provides a comprehensive overview of the IBM Spectrum® Discover metadata management software platform. We give a detailed explanation of how the product creates, collects, and analyzes metadata.
Several in-depth use cases are used that show examples of analytics, governance, and optimization. We also provide step-by-step information to install and set up the IBM Spectrum Discover trial environment.
More than 80% of all data that is collected by organizations is not in a standard relational database. Instead, it is trapped in unstructured documents, social media posts, machine logs, and so on. Many organizations face significant challenges to manage this deluge of unstructured data such as:
- Pinpointing and activating relevant data for large-scale analytics
- Lacking the fine-grained visibility that is needed to map data to business priorities
- Removing redundant, obsolete, and trivial (ROT) data
- Identifying and classifying sensitive data
IBM Spectrum Discover is a modern metadata management software that provides data insight for petabyte-scale file and Object Storage, storage on premises, and in the cloud. This software enables organizations to make better business decisions and gain and maintain a competitive advantage.IBM Spectrum Discover provides a rich metadata layer that enables storage administrators, data stewards, and data scientists to efficiently manage, classify, and gain insights from massive amounts of unstructured data. It improves storage economics, helps mitigate risk, and accelerates large-scale analytics to create competitive advantage and speed critical research.
Table of contents
- Front cover
- Notices
- Preface
- Chapter 1. IBM Spectrum Discover overview
- Chapter 2. Metadata essentials
-
Chapter 3. Sample use cases
- 3.1 Storage optimization
-
3.2 Data governance
- 3.2.1 Use case scenario
- 3.2.2 Data stewardship with IBM Spectrum Discover
- 3.2.3 Documenting the various PII components
- 3.2.4 Identifying regular expressions for the PII components
- 3.2.5 Creating tags to identify files or objects that include PII
- 3.2.6 Creating policies to identify files or objects that include PII
- 3.2.7 Defining and scheduling regular reports for governance
- 3.2.8 Summary
- 3.3 Healthcare and life sciences use cases
- 3.4 Summary
- Chapter 4. Deep inspection and the AI pipeline
- Appendix A. Installing and setting up IBM Spectrum Discover
- Related publications
- Back cover
Product information
- Title: IBM Spectrum Discover: Metadata Management for Deep Insight of Unstructured Storage
- Author(s):
- Release date: October 2019
- Publisher(s): IBM Redbooks
- ISBN: 9780738457864
You might also like
book
Creating a Data-Driven Organization
What do you need to become a data-driven organization? Far more than having big data or …
audiobook
How to Do Nothing
A galvanizing critique of the forces vying for our attention-and our personal information-that redefines what we …
book
PACS and Imaging Informatics: Basic Principles and Applications, Second Edition
The definitive guide to PACS — now with more clinically applicable material In recent years, the …
video
Full Stack Web Development Mastery Course - Novice to Expert
Full stack development refers to the development of both frontend (client-side) and backend (server-side) portions of …