Chapter 3. Data Wrangling and Data Preparation for GenAI and Agentic AI Applications
Imagine a global financial institution that has invested millions in cutting-edge generative AI models, only to discover that their carefully curated data—meticulously prepared for traditional analytics—fails to support even basic reasoning tasks. Their customer service AI struggles to connect information across systems, their risk models can’t incorporate unstructured regulatory guidance, and their market intelligence system generates inaccurate insights due to semantic inconsistencies across data sources. Despite world-class data infrastructure and analytics capabilities, they find themselves at a competitive disadvantage against more nimble competitors who have reimagined their data foundations for the AI era.
The Enterprise Challenge
This scenario is playing out across industries as organizations confront an uncomfortable reality: data preparation approaches that served them well for decades are fundamentally inadequate for the demands of generative AI and agentic systems. The gap isn’t merely technical—it represents a paradigm shift in how organizations must conceptualize, structure, and evolve their data assets to remain competitive in an AI-driven economy.
Purpose and Audience
This chapter serves as both a strategic guide and a practical handbook for organizations navigating the transformation from traditional data management to AI-ready knowledge systems. It is written for a diverse ...
Become an O’Reilly member and get unlimited access to this title plus top books and audiobooks from O’Reilly and nearly 200 top publishers, thousands of courses curated by job role, 150+ live events each month,
and much more.
Read now
Unlock full access