on-demand course

Production LLM Monitoring: Observability, Tracing & Cost Optimization

with Paulo Dichone

February 2026

Advanced

2h 35m

English

Packt Publishing

Closed Captioning available in English

Watch now

Unlock full access

Includes

Badge

Course outline

Introduction
1m 15s
Observability and Cost Management – Overview
2m 18s
The Hidden Costs of LLM Applications
2m 11s
Traditional vs LLM Observability
2m 0s
The Three Pillars for LLMs
1m 25s
ROI Calculator – Making the Business Case
1m 47s
Understanding LLM Costs
11m 20s
Where Costs Hide – RAG and Agent Pipeline
3m 45s
The Hidden Cost Multiplier
4m 3s
Observability Platform Selection
3m 43s
Setting Up Langfuse
5m 50s
Setting up Langfuse and Creating First Trace
4m 0s
Langfuse Data Model
8m 5s
Hands-on: First LLM Trace – Deep Dive
4m 14s
Langfuse API Levels – Code Demonstrations
7m 45s
Production Instrumented LLM Use Case – Hands-on
14m 33s
Instrumenting a Multi-Step RAG Pipeline – Langfuse Observability – Full Handson
27m 3s
Framework Integration (LangChain)
10m 51s
Cost Optimization Strategies – Overview
7m 25s
Prompt Optimization – Handson
2m 16s
Semantic Caching
7m 38s
Smart Model Routing
5m 14s
Cost Optimization Summary
1m 25s
Setting up Alerts that Matter
7m 30s
Security and Compliance Patterns
4m 39s
Production Patterns Implementations – Real-world
1m 13s
Course Recap and Next Steps
2m 18s

Overview

In this 2-hour course, you will learn how to implement production-grade LLM observability, tracing, and cost optimization using Langfuse, enabling faster debugging, reliable monitoring, and tighter control over LLM API spending in real-world systems.

What I will be able to do after this course

Implement production-grade LLM observability using Langfuse and tracing concepts
Reduce LLM API costs using semantic caching, model routing, and prompt optimization
Debug LLM applications quickly using traces, spans, and instrumentation patterns
Set up cost alerts and monitoring dashboards to prevent budget escalations
Build production-ready patterns for token tracking, cost calculation, and PII redaction

Course Instructor(s)

Paulo Dichone is a software engineer and educator who has taught 280,000+ students across 175 countries. He is the founder of Build Apps with Paulo and delivers practical, career-focused training in software development and cloud solutions. His teaching emphasizes real-world implementation that prepares learners for production challenges.

Who is it for?

This course is ideal for ML engineers, AI engineers, backend developers, and technical leads running LLM applications in production who need visibility into performance and costs. Learners should have basic Python skills, prior experience making LLM API calls, and a working Python setup with a code editor.

Become an O’Reilly member and get unlimited access to this title plus top books and audiobooks from O’Reilly and nearly 200 top publishers, thousands of courses curated by job role, 150+ live events each month,
and much more.

Watch now

Unlock full access

More than 5,000 organizations count on O’Reilly

O’Reilly covers everything we've got, with content to help us build a world-class technology community, upgrade the capabilities and competencies of our teams, and improve overall team performance as well as their engagement.

Julian F.

Head of Cybersecurity

I wanted to learn C and C++, but it didn't click for me until I picked up an O'Reilly book. When I went on the O’Reilly platform, I was astonished to find all the books there, plus live events and sandboxes so you could play around with the technology.

Addison B.

Field Engineer

I’ve been on the O’Reilly platform for more than eight years. I use a couple of learning platforms, but I'm on O'Reilly more than anybody else. When you're there, you start learning. I'm never disappointed.

Amir M.

Data Platform Tech Lead

I'm always learning. So when I got on to O'Reilly, I was like a kid in a candy store. There are playlists. There are answers. There's on-demand training. It's worth its weight in gold, in terms of what it allows me to do.

Mark W.

Embedded Software Engineer

Executive Briefing: Data catalogs—Concepts, capabilities, and key platforms

Publisher Resources

ISBN: 9781807605858

Cloud Computing

Data Engineering

Data Science

AI & ML

Programming Languages

Software Architecture

IT/Ops

Security

Design

Business

Soft Skills

Production LLM Monitoring: Observability, Tracing & Cost Optimization

with Paulo Dichone

Chapter 1 : Introduction

Chapter 2 : The Business Case Why Observability = Money

Chapter 3 : Understanding LLM Costs – Where Your Money Goes

Chapter 4 : Observability Platform Selection – Langfuse and Hands-on

Chapter 5 : Instrumenting Your LLM Application

Chapter 6 : Cost Optimization Strategies That Work

Chapter 7 : Monitoring, Alerting & Debugging

Chapter 8 : Production Patterns & Security

Chapter 9 : Wrap up and Next Steps