May 2024
Intermediate to advanced
368 pages
10h 7m
English
Customizing Retrieval-Augmented Generation (RAG) components and optimizing performance is critical to building robust, production-ready applications with LlamaIndex. This chapter explores methods for leveraging open source models, intelligent routing across large language models (LLMs), and using community-built modules to increase flexibility and cost-effectiveness. Advanced tracing, evaluation methods, and deployment options are explored to gain deep insight, ensure reliable operation, and streamline the development life cycle.
Throughout this chapter, we’re going to cover the following main topics: