Skip to Content
Python Polars: The Definitive Guide
book

Python Polars: The Definitive Guide

by Jeroen Janssens, Thijs Nieuwdorp
February 2025
Intermediate to advanced
504 pages
11h 31m
English
O'Reilly Media, Inc.
Content preview from Python Polars: The Definitive Guide

Chapter 14. Joining and Concatenating

Data often comes from multiple sources that you will have to connect and combine in a meaningful way. There are multiple ways to combine DataFrames, which we’ll go over in this chapter.

Funnily enough, this is where Polars once started. Faced with combining two CSV files in Rust, Ritchie Vink started his journey which ultimately led to where we are now. This gives a special sentiment to the operations in this chapter.

In this chapter, you’ll learn:

  • That you can use df.join() to combine DataFrames based on the values in the DataFrames and the strategies outlined here

  • That df.join_asof() is a special join that joins DataFrames based on the nearest value in the other DataFrame

  • How to combine DataFrames using pl.concat(), df.vstack(), df.hstack(), and df.extend()

  • How to combine Series with series.append()

  • The differences between all these methods and when to use them

The instructions to get any files you might need are in Chapter 2. We assume that you have the files in the data subdirectory.

Joining

To combine different DataFrames, Polars offers the df.join() method. It takes the arguments listed in Table 14-1.

Table 14-1. Arguments for the method df.join()
Argument Description

other

The DataFrame to join with.

on

The column to join on when the name is the same in the left and right DataFrames.

left_on and right_on

The columns to join if they have different names in the left and right DataFrames.

how

The join strategy ...

Become an O’Reilly member and get unlimited access to this title plus top books and audiobooks from O’Reilly and nearly 200 top publishers, thousands of courses curated by job role, 150+ live events each month,
and much more.

Read now

Unlock full access

More than 5,000 organizations count on O’Reilly

AirBnbBlueOriginElectronic ArtsHomeDepotNasdaqRakutenTata Consultancy Services

QuotationMarkO’Reilly covers everything we've got, with content to help us build a world-class technology community, upgrade the capabilities and competencies of our teams, and improve overall team performance as well as their engagement.
Julian F.
Head of Cybersecurity
QuotationMarkI wanted to learn C and C++, but it didn't click for me until I picked up an O'Reilly book. When I went on the O’Reilly platform, I was astonished to find all the books there, plus live events and sandboxes so you could play around with the technology.
Addison B.
Field Engineer
QuotationMarkI’ve been on the O’Reilly platform for more than eight years. I use a couple of learning platforms, but I'm on O'Reilly more than anybody else. When you're there, you start learning. I'm never disappointed.
Amir M.
Data Platform Tech Lead
QuotationMarkI'm always learning. So when I got on to O'Reilly, I was like a kid in a candy store. There are playlists. There are answers. There's on-demand training. It's worth its weight in gold, in terms of what it allows me to do.
Mark W.
Embedded Software Engineer

You might also like

Hypermodern Python Tooling

Hypermodern Python Tooling

Claudio Jolowicz

Publisher Resources

ISBN: 9781098156077Errata Page