AI & ML Business Data Innovation Research Security

Try the O’Reilly learning platform

With the O’Reilly learning platform, you get the resources and guidance to keep your skills sharp and stay ahead. Try it free for up to 14 days.

Start trial

Try a course for free

Join a live online event on the O’Reilly platform to learn from the experts shaping tech.

See what’s coming soon

Get the Radar Trends newsletter

Your email

Country

Please read our privacy policy.

Content > Topics > Data science

Probabilistic data structures in Python: video excerpt

Use approximations with error bounds to trade-off system resources, e.g., memory or compute time -- especially for large-scale analytics and streaming data.

By Paco Nathan August 1, 2016 • 1 minute read

LinkedIn X Facebook Threads Bluesky Reddit

Paco Nathan (source: O'Reilly Media)

Probabilistic data structures represent a relatively new area of algorithms. You may hear the terms approximation algorithms, sketch algorithms, or online algorithms used to describe similar work. These approaches provide approximations with error bounds: calculate the error bounds in advance, probabilistically, so that well-formed approximations get built into data collections directly. Apps no longer need batch windows, stop-to-fit models, etc. Instead they can sample the needed results at any point in a real-time data stream.

This tutorial is intended for a Python programmer who has some background working with big data, who now needs to learn how to apply probabilistic data structures for analytics with large-scale data and streaming applications, and especially for use cases that require both. This notebook shows Python code examples for five of the more well-known examples and explores their potential use cases.

Here’s a clip:

Click here to see the list of tutorials.

Post topics: Data science

Cloud Computing

Data Engineering

Data Science

AI & ML

Programming Languages

Software Architecture

IT/Ops

Security

Design

Business

Soft Skills

Try the O’Reilly learning platform

Try a course for free

Get the Radar Trends newsletter

Thank you for subscribing to the O’Reilly Radar Trends to Watch newsletter.

Probabilistic data structures in Python: video excerpt