Book description
Data-driven insights are a key competitive advantage for any industry today, but deriving insights from raw data can still take days or weeks. Most organizations can’t scale data science teams fast enough to keep up with the growing amounts of data to transform. What’s the answer? Self-service data.
With this practical book, data engineers, data scientists, and team managers will learn how to build a self-service data science platform that helps anyone in your organization extract insights from data. Sandeep Uttamchandani provides a scorecard to track and address bottlenecks that slow down time to insight across data discovery, transformation, processing, and production. This book bridges the gap between data scientists bottlenecked by engineering realities and data engineers unclear about ways to make self-service work.
- Build a self-service portal to support data discovery, quality, lineage, and governance
- Select the best approach for each self-service capability using open source cloud technologies
- Tailor self-service for the people, processes, and technology maturity of your data platform
- Implement capabilities to democratize data and reduce time to insight
- Scale your self-service portal to support a large number of users within your organization
Publisher resources
Table of contents
- Preface
- 1. Introduction
- I. Self-Service Data Discovery
- 2. Metadata Catalog Service
- 3. Search Service
- 4. Feature Store Service
- 5. Data Movement Service
- 6. Clickstream Tracking Service
- II. Self-Service Data Prep
- 7. Data Lake Management Service
- 8. Data Wrangling Service
- 9. Data Rights Governance Service
- III. Self-Service Build
- 10. Data Virtualization Service
- 11. Data Transformation Service
- 12. Model Training Service
- 13. Continuous Integration Service
- 14. A/B Testing Service
- IV. Self-Service Operationalize
- 15. Query Optimization Service
- 16. Pipeline Orchestration Service
- 17. Model Deploy Service
- 18. Quality Observability Service
- 19. Cost Management Service
- Index
- About the Author
Product information
- Title: The Self-Service Data Roadmap
- Author(s):
- Release date: September 2020
- Publisher(s): O'Reilly Media, Inc.
- ISBN: 9781492075202
You might also like
book
40 Algorithms Every Programmer Should Know
Learn algorithms for solving classic computer science problems with this concise guide covering everything from fundamental …
book
Software Engineering at Google
Today, software engineers need to know not only how to program effectively but also how to …
book
Head First Design Patterns, 2nd Edition
You know you don’t want to reinvent the wheel, so you look to design patterns—the lessons …
book
Analytical Skills for AI and Data Science
While several market-leading companies have successfully transformed their business models by following data- and AI-driven paths, …