book

Open Source Observability

by Peter Corless, Neha Pawar

April 2025

Intermediate to advanced

62 pages

1h 12m

English

O'Reilly Media, Inc.

Read now

Unlock full access

Foreword
Introduction
1. The Evolution of Observability
Traditional Observability StacksKey Limitations in Today’s StacksObservability 2.0 and the Shift Toward Disaggregation
2. Key Benefits of a Disaggregated Stack
Flexibility and CustomizationData Autonomy and ReusabilityCost-Effectiveness in Scaling
3. Core Layers of a Disaggregated Stack
Instrumentation, Agents, and Data CollectionStorage and Query MechanismsVisualization, Analysis, and Automation Tools
4. Implementing the Collection Layer
Managing Data Volume and VarietyMetricsLogsTracesStreaming SystemsTelemetry and Stream Processing
5. Optimizing Storage and Query for Performance
Handling High Cardinality, High Dimensionality, and JSON SupportAdvanced Indexing and Compression Techniques
6. Best Practices for Visualization
7. Future Trends in Observability
Enhancing the Platform Engineering Team’s ExperienceAI and ObservabilityAI for ObservabilityObservability for AIConclusion
Reference List

About the Authors

Content preview from Open Source Observability

Chapter 5. Optimizing Storage and Query for Performance

With a disaggregated observability stack, users can decide what storage solution works best for their telemetry and use cases. Traditionally, there have been many different types of storage systems, optimized for each pillar of observability. For example:

Metrics: Prometheus, Timescale, InfluxDB, and other time series databases or key-value stores
Logs: Grafana Loki, plus Elasticsearch, OpenSearch, and other search engines
Traces: Grafana Tempo, Jaeger, Hypertrace, and other column stores

More recently, real-time analytical databases like Apache Pinot and ClickHouse have been increasingly used for observability use cases because they can support more than one “pillar” in a common repository (i.e., the “Observability 1.5” mentioned in Chapter 3).

Let’s look at other ways to mix and match databases within a disaggregated stack. The Jaeger platform is truly versatile, with many different community-supported backend storage options, such as PostgreSQL, Cassandra, and ClickHouse (“Additional Storage Backends” 2025). Each of these storage options provides very different capabilities, including in their supported rates of ingestion, query performance, and query flexibility (Jegadish 2023).

However, users have also bypassed the Jaeger storage backend entirely, streaming telemetry from Jaeger agents directly into Apache Pinot for real-time analysis (StarTree 2024).

Let’s dig a little deeper into what makes for a good match for an observability ...

Become an O’Reilly member and get unlimited access to this title plus top books and audiobooks from O’Reilly and nearly 200 top publishers, thousands of courses curated by job role, 150+ live events each month,
and much more.

Read now

Unlock full access

More than 5,000 organizations count on O’Reilly

O’Reilly covers everything we've got, with content to help us build a world-class technology community, upgrade the capabilities and competencies of our teams, and improve overall team performance as well as their engagement.

Julian F.

Head of Cybersecurity

I wanted to learn C and C++, but it didn't click for me until I picked up an O'Reilly book. When I went on the O’Reilly platform, I was astonished to find all the books there, plus live events and sandboxes so you could play around with the technology.

Addison B.

Field Engineer

I’ve been on the O’Reilly platform for more than eight years. I use a couple of learning platforms, but I'm on O'Reilly more than anybody else. When you're there, you start learning. I'm never disappointed.

Amir M.

Data Platform Tech Lead

I'm always learning. So when I got on to O'Reilly, I was like a kid in a candy store. There are playlists. There are answers. There's on-demand training. It's worth its weight in gold, in terms of what it allows me to do.

Mark W.

Embedded Software Engineer

Publisher Resources

ISBN: 9798341622135

Cloud Computing

Data Engineering

Data Science

AI & ML

Programming Languages

Software Architecture

IT/Ops

Security

Design

Business

Soft Skills

Open Source Observability

by Peter Corless, Neha Pawar

Chapter 5. Optimizing Storage and Query for Performance

Become an O’Reilly member and get unlimited access to this title plus top books and audiobooks from O’Reilly and nearly 200 top publishers, thousands of courses curated by job role, 150+ live events each month,
and much more.