Skip to Content
DuckDB in Action, Video Edition
video

DuckDB in Action, Video Edition

by Michael Hunger, Mark Needham, Michael Simons
August 2024
Intermediate
7h 40m
English
Manning Publications

Overview

In Video Editions the narrator reads the book while the content, figures, code listings, diagrams, and text appear on the screen. Like an audiobook that you can also watch as a video.

Dive into DuckDB and start processing gigabytes of data with ease—all with no data warehouse.

DuckDB is a cutting-edge SQL database that makes it incredibly easy to analyze big data sets right from your laptop. In DuckDB in Action you’ll learn everything you need to know to get the most out of this awesome tool, keep your data secure on prem, and save you hundreds on your cloud bill. From data ingestion to advanced data pipelines, you’ll learn everything you need to get the most out of DuckDB—all through hands-on examples.

Open up DuckDB in Action and learn how to:

  • Read and process data from CSV, JSON and Parquet sources both locally and remote
  • Write analytical SQL queries, including aggregations, common table expressions, window functions, special types of joins, and pivot tables
  • Use DuckDB from Python, both with SQL and its "Relational"-API, interacting with databases but also data frames
  • Prepare, ingest and query large datasets
  • Build cloud data pipelines
  • Extend DuckDB with custom functionality

Pragmatic and comprehensive, DuckDB in Action introduces the DuckDB database and shows you how to use it to solve common data workflow problems. You won’t need to read through pages of documentation—you’ll learn as you work. Get to grips with DuckDB's unique SQL dialect, learning to seamlessly load, prepare, and analyze data using SQL queries. Extend DuckDB with both Python and built-in tools such as MotherDuck, and gain practical insights into building robust and automated data pipelines.

About the Technology
DuckDB makes data analytics fast and fun! You don’t need to set up a Spark or run a cloud data warehouse just to process a few hundred gigabytes of data. DuckDB is easily embeddable in any data analytics application, runs on a laptop, and processes data from almost any source, including JSON, CSV, Parquet, SQLite and Postgres.

About the Book
DuckDB in Action guides you example-by-example from setup, through your first SQL query, to advanced topics like building data pipelines and embedding DuckDB as a local data store for a Streamlit web app. You’ll explore DuckDB’s handy SQL extensions, get to grips with aggregation, analysis, and data without persistence, and use Python to customize DuckDB. A hands-on project accompanies each new topic, so you can see DuckDB in action.

What's Inside
  • Prepare, ingest and query large datasets
  • Build cloud data pipelines
  • Extend DuckDB with custom functionality
  • Fast-paced SQL recap: From simple queries to advanced analytics


About the Reader
For data pros comfortable with Python and CLI tools.

About the Authors
Mark Needham is a blogger and video creator at @‌LearnDataWithMark. Michael Hunger leads product innovation for the Neo4j graph database. Michael Simons is a Java Champion, author, and Engineer at Neo4j.

Quotes
I use DuckDB every day, and I still learned a lot about how DuckDB makes things that are hard in most databases easy!
- Jordan Tigani, Founder, MotherDuck

An excellent resource! Unlocks possibilities for storing, processing, analyzing, and summarizing data at the edge using DuckDB.
- Pramod Sadalage, Director, Thoughtworks

Clear and accessible. A comprehensive resource for harnessing the power of DuckDB for both novices and experienced professionals.
- Qiusheng Wu, Associate Professor, University of Tennessee

Excellent! The book all we ducklings have been waiting for!
- Gunnar Morling, Decodable

Become an O’Reilly member and get unlimited access to this title plus top books and audiobooks from O’Reilly and nearly 200 top publishers, thousands of courses curated by job role, 150+ live events each month,
and much more.

Watch now

Unlock full access

More than 5,000 organizations count on O’Reilly

AirBnbBlueOriginElectronic ArtsHomeDepotNasdaqRakutenTata Consultancy Services

QuotationMarkO’Reilly covers everything we've got, with content to help us build a world-class technology community, upgrade the capabilities and competencies of our teams, and improve overall team performance as well as their engagement.
Julian F.
Head of Cybersecurity
QuotationMarkI wanted to learn C and C++, but it didn't click for me until I picked up an O'Reilly book. When I went on the O’Reilly platform, I was astonished to find all the books there, plus live events and sandboxes so you could play around with the technology.
Addison B.
Field Engineer
QuotationMarkI’ve been on the O’Reilly platform for more than eight years. I use a couple of learning platforms, but I'm on O'Reilly more than anybody else. When you're there, you start learning. I'm never disappointed.
Amir M.
Data Platform Tech Lead
QuotationMarkI'm always learning. So when I got on to O'Reilly, I was like a kid in a candy store. There are playlists. There are answers. There's on-demand training. It's worth its weight in gold, in terms of what it allows me to do.
Mark W.
Embedded Software Engineer

You might also like

Grokking Algorithms, Video Edition

Grokking Algorithms, Video Edition

Aditya Y. Bhargava
DuckDB in Action

DuckDB in Action

Mark Needham, Michael Hunger, Michael Simons

Publisher Resources

ISBN: 9781633437258VEPublisher SupportOtherPublisher WebsitePurchase Link