5

Acero: A Streaming Arrow Execution Engine

We’re almost halfway through this book and only now are we covering performing analytical computations directly with Arrow. Kinda strange, right? At this point, if you’ve been following along, you should have a solid understanding of all the concepts you’ll need to be able to benefit from the compute library.

The Arrow community has built an open source reference implementation of a computation and query engine built on the Arrow format named Acero. To this end, the Acero library exists to facilitate various high-performance implementations of functions that operate on Arrow-formatted data, along with building execution plans for streams of data. This might be to perform logical casting from one data ...

Get In-Memory Analytics with Apache Arrow - Second Edition now with the O’Reilly learning platform.

O’Reilly members experience books, live events, courses curated by job role, and more from O’Reilly and nearly 200 top publishers.