Book description
This IBM® Redpaper publication provides a comprehensive overview of the IBM Spectrum® Discover metadata management software platform. We give a detailed explanation of how the product creates, collects, and analyzes metadata.
Several in-depth use cases are used that show examples of analytics, governance, and optimization. We also provide step-by-step information to install and set up the IBM Spectrum Discover trial environment.
More than 80% of all data that is collected by organizations is not in a standard relational database. Instead, it is trapped in unstructured documents, social media posts, machine logs, and so on. Many organizations face significant challenges to manage this deluge of unstructured data such as:
- Pinpointing and activating relevant data for large-scale analytics
- Lacking the fine-grained visibility that is needed to map data to business priorities
- Removing redundant, obsolete, and trivial (ROT) data
- Identifying and classifying sensitive data
IBM Spectrum Discover is a modern metadata management software that provides data insight for petabyte-scale file and Object Storage, storage on premises, and in the cloud. This software enables organizations to make better business decisions and gain and maintain a competitive advantage.IBM Spectrum Discover provides a rich metadata layer that enables storage administrators, data stewards, and data scientists to efficiently manage, classify, and gain insights from massive amounts of unstructured data. It improves storage economics, helps mitigate risk, and accelerates large-scale analytics to create competitive advantage and speed critical research.
Table of contents
- Front cover
- Notices
- Preface
- Chapter 1. IBM Spectrum Discover overview
- Chapter 2. Metadata essentials
-
Chapter 3. Sample use cases
- 3.1 Storage optimization
-
3.2 Data governance
- 3.2.1 Use case scenario
- 3.2.2 Data stewardship with IBM Spectrum Discover
- 3.2.3 Documenting the various PII components
- 3.2.4 Identifying regular expressions for the PII components
- 3.2.5 Creating tags to identify files or objects that include PII
- 3.2.6 Creating policies to identify files or objects that include PII
- 3.2.7 Defining and scheduling regular reports for governance
- 3.2.8 Summary
- 3.3 Healthcare and life sciences use cases
- 3.4 Summary
- Chapter 4. Deep inspection and the AI pipeline
- Appendix A. Installing and setting up IBM Spectrum Discover
- Related publications
- Back cover
Product information
- Title: IBM Spectrum Discover: Metadata Management for Deep Insight of Unstructured Storage
- Author(s):
- Release date: October 2019
- Publisher(s): IBM Redbooks
- ISBN: 9780738457864
You might also like
book
Metadata Management with IBM InfoSphere Information Server
What do you know about your data? And how do you know what you know about …
book
How to Lead a Values-Based Professional Services Firm
We live in a values-driven world. As times change, businesses must evolve. The way that leaders …
book
Systems of Insight for Digital Transformation: Using IBM Operational Decision Manager Advanced and Predictive Analytics
Systems of record (SORs) are engines that generates value for your business. Systems of engagement (SOE) …
book
Powerful Conversations: How High Impact Leaders Communicate
"Phil Harkins has it exactly right. To be a leader is to communicate powerfully—as he does …