Hands-on Retrieval Augmented Generation (RAG)

Intermediate

Implement RAG with LlamaIndex

Course Outcomes

Demonstrate how to create an end-to-end RAG solution
Explain some of the limitations of LLMs
Demonstrate how to implement RAG with LlamaIndex and GPT-4-Turbo Assistants

Retrieval Augmented Generation (RAG) is one of the most common business applications for Large Language Models. RAG enables the querying of internal documents as knowledge bases. In this course, participants will learn how to query their internal documents and understand how RAG works under the hood. Learners will work with embeddings and Vector databases, explore what use cases RAG is good for, and learn how to build RAG-based solutions using LlamaIndex. They can take lessons learned and apply them on the job, making knowledge-bases available to internal teams.

What you’ll learn and how you can apply it

Understanding and work with embeddings and Vector databases
Learn use cases for RAG
Build solutions using LlamaIndex

This live event is for you because...

You’re a developer, data scientist or machine learning engineer
You work with ChatGPT or other LLMs and want to develop solutions for working with bespoke data
You are a developer and want to build solutions with LLMs

Prerequisites

Experienced with Python (intermediate)
Basic understanding of ML and LLMs
OpenAI and Cohere account and API key

Course Preparation

Have an OpenAI API key and login (to complete the exercises)

Recommended follow-up:

Read Hands-on RAG for Production (book)
Read RAG with Python Cookbook (book)

Schedule

The time frames are only estimates and may vary according to how the class is progressing.

Module 1 - Embeddings (60 minutes)

Presentation: Limitations of LLMs
Presentation: Embeddings
Exercise: Introduction to Embeddings
Pulse Check or Poll: Embeddings
Break

Module 2 - Retrieval Augmented Generation (60 minutes)

Presentation: Using external sources of data
Presentation: Practical considerations when working with RAG
Exercise: Working with vector databases
Exercise: Introduction to RAG and LlamaIndex
Exercise: Working with RAG and FAISS
Q&A
Break

Module 3 - More Advanced Considerations for RAG (60 minutes)

Presentation: Improving Query Processing
Presentation: Post-retrieval processing using Retrievers and Query Engines
Exercise: (mini project) Working with a knowledge-base using LlamaIndex
Exercise: Bi-Encoders and Cross-encoders for post-retrieval
Q&A

Module 4 - Using OpenAI Assistant retrieval and agents (60 minutes)

Presentation: Retrieval using the playground
Exercise: Retrieval using the OpenAI API
Presentation: Why RAG-based solutions are still relevant despite OpenAI Assistant
Exercise: Retrieval project using OpenAI Assistants
Presentation: The future of RAG - Agents

Your Instructor

Jonathan Fernandes
Jonathan Fernandes works with large language models every day, and he did so long before ChatGPT came on the scene. He focuses primarily on LLMs in production to solve business problems across a variety of domains including finance, health, manufacturing, legal, and education. Previously, he worked for LLM-provider startups including Cohere. Jonathan has taught over 250,000 people about LLMs and how to use them.

linkedin search

Skills covered

Retrieval Augmented Generation (RAG)

Large Language Models (LLMs)

Cloud Computing

Data Engineering

Data Science

AI & ML

Programming Languages

Software Architecture

IT/Ops

Security

Design

Business

Soft Skills