Book description
Data-driven insights are a key competitive advantage for any industry today, but deriving insights from raw data can still take days or weeks. Most organizations can’t scale data science teams fast enough to keep up with the growing amounts of data to transform. What’s the answer? Self-service data.
With this practical book, data engineers, data scientists, and team managers will learn how to build a self-service data science platform that helps anyone in your organization extract insights from data. Sandeep Uttamchandani provides a scorecard to track and address bottlenecks that slow down time to insight across data discovery, transformation, processing, and production. This book bridges the gap between data scientists bottlenecked by engineering realities and data engineers unclear about ways to make self-service work.
- Build a self-service portal to support data discovery, quality, lineage, and governance
- Select the best approach for each self-service capability using open source cloud technologies
- Tailor self-service for the people, processes, and technology maturity of your data platform
- Implement capabilities to democratize data and reduce time to insight
- Scale your self-service portal to support a large number of users within your organization
Publisher resources
Table of contents
- Preface
- 1. Introduction
- I. Self-Service Data Discovery
- 2. Metadata Catalog Service
- 3. Search Service
- 4. Feature Store Service
- 5. Data Movement Service
- 6. Clickstream Tracking Service
- II. Self-Service Data Prep
- 7. Data Lake Management Service
- 8. Data Wrangling Service
- 9. Data Rights Governance Service
- III. Self-Service Build
- 10. Data Virtualization Service
- 11. Data Transformation Service
- 12. Model Training Service
- 13. Continuous Integration Service
- 14. A/B Testing Service
- IV. Self-Service Operationalize
- 15. Query Optimization Service
- 16. Pipeline Orchestration Service
- 17. Model Deploy Service
- 18. Quality Observability Service
- 19. Cost Management Service
- Index
- About the Author
Product information
- Title: The Self-Service Data Roadmap
- Author(s):
- Release date: September 2020
- Publisher(s): O'Reilly Media, Inc.
- ISBN: 9781492075257
You might also like
book
Robust Python
Does it seem like your Python projects are getting bigger and bigger? Are you feeling the …
book
Practical Time Series Analysis
Time series data analysis is increasingly important due to the massive production of such data through …
book
Communicating with Data
Data is a fantastic raw resource for powering change in an organization, but all too often …
book
Data Management at Scale, 2nd Edition
As data management continues to evolve rapidly, managing all of your data in a central place, …