Skip to Content
Cloud Native Geospatial Analytics with Apache Sedona
book

Cloud Native Geospatial Analytics with Apache Sedona

by Pawel Tokaj, Jia Yu, Mo Sarwat
December 2025
Intermediate to advanced
338 pages
8h 38m
English
O'Reilly Media, Inc.
Content preview from Cloud Native Geospatial Analytics with Apache Sedona

Foreword

Geospatial data has become central to our understanding and response to the world around us. From monitoring ecosystems to precise map matching, location is often the key to unlocking insight. However, as the volume and velocity of geospatial data have surged, our analytical tools have struggled to keep pace. Traditional GIS tools excel at analysis but are often limited to single machine environments. Meanwhile, cloud data warehouses offer impressive scalability but often treat geospatial data as an afterthought.

Apache Sedona bridges this divide. Sedona is an open source framework that embeds geospatial analysis directly into distributed computing platforms such as Apache Spark, Flink, and Snowflake. It treats spatial as a first-class concern, enabling complex spatial joins, queries, and raster processing across billions of records. With Sedona, we gain both the depth of geospatial science coupled with the elasticity of the cloud.

I introduced Sedona briefly in my previous book, Introduction to GIS Programming: A Practical Python Guide to Open Source Geospatial Tools, where I included a chapter on distributed computing with Apache Sedona. That chapter sparked strong interest among readers, but it could only scratch the surface. Sedona is far too powerful and comprehensive to be condensed into a single section. It deserves a full-length treatment, and that is precisely what Cloud Native Geospatial Analytics with Apache Sedona provides.

This book is authored by Sedona’s ...

Become an O’Reilly member and get unlimited access to this title plus top books and audiobooks from O’Reilly and nearly 200 top publishers, thousands of courses curated by job role, 150+ live events each month,
and much more.

Read now

Unlock full access

More than 5,000 organizations count on O’Reilly

AirBnbBlueOriginElectronic ArtsHomeDepotNasdaqRakutenTata Consultancy Services

QuotationMarkO’Reilly covers everything we've got, with content to help us build a world-class technology community, upgrade the capabilities and competencies of our teams, and improve overall team performance as well as their engagement.
Julian F.
Head of Cybersecurity
QuotationMarkI wanted to learn C and C++, but it didn't click for me until I picked up an O'Reilly book. When I went on the O’Reilly platform, I was astonished to find all the books there, plus live events and sandboxes so you could play around with the technology.
Addison B.
Field Engineer
QuotationMarkI’ve been on the O’Reilly platform for more than eight years. I use a couple of learning platforms, but I'm on O'Reilly more than anybody else. When you're there, you start learning. I'm never disappointed.
Amir M.
Data Platform Tech Lead
QuotationMarkI'm always learning. So when I got on to O'Reilly, I was like a kid in a candy store. There are playlists. There are answers. There's on-demand training. It's worth its weight in gold, in terms of what it allows me to do.
Mark W.
Embedded Software Engineer

You might also like

Azure OpenAI Service for Cloud Native Applications

Azure OpenAI Service for Cloud Native Applications

Adrián González Sánchez
Serverless Development on AWS

Serverless Development on AWS

Sheen Brisals, Luke Hedger
Generative AI on AWS

Generative AI on AWS

Chris Fregly, Antje Barth, Shelbee Eigenbrode
Data Pipelines with Apache Airflow

Data Pipelines with Apache Airflow

Bas Harenslak, Julian de Ruiter

Publisher Resources

ISBN: 9781098173982Errata Page