book

Time Series Databases: New Ways to Store and Access Data

by Ted Dunning, Ellen Friedman

December 2014

Intermediate to advanced

60 pages

1h 55m

English

O'Reilly Media, Inc.

Read now

Unlock full access

Preface
In This Book
1. Time Series Data: Why Collect It?
Time Series Data Is an Old IdeaTime Series Data Sets Reveal TrendsA New Look at Time Series Databases
2. A New World for Time Series Databases
Stock Trading and Time Series DataMaking Sense of SensorsTalking to Towers: Time Series and TelecomData Center MonitoringEnvironmental Monitoring: Satellites, Robots, and MoreThe Questions to Be Asked
3. Storing and Processing Time Series Data
Simplest Data Store: Flat FilesMoving Up to a Real Database: But Will RDBMS Suffice?NoSQL Database with Wide TablesNoSQL Database with Hybrid DesignGoing One Step Further: The Direct Blob Insertion DesignWhy Relational Databases Aren’t Quite RightHybrid Design: Where Can I Get One?
4. Practical Time Series Tools
Introduction to Open TSDB: Benefits and LimitationsArchitecture of Open TSDBValue Added: Direct Blob Loading for High PerformanceA New Twist: Rapid Loading of Historical DataSummary of Open Source Extensions to Open TSDB for Direct Blob LoadingAccessing Data with Open TSDBWorking on a Higher LevelAccessing Open TSDB Data Using SQL-on-Hadoop ToolsUsing Apache Spark SQLWhy Not Apache Hive?Adding Grafana or Metrilyx for Nicer DashboardsPossible Future Extensions to Open TSDBCache Coherency Through Restart Logs
5. Solving a Problem You Didn’t Know You Had
The Need for Rapid Loading of Test DataUsing Blob Loader for Direct Insertion into the Storage Tier
6. Time Series Data in Practical Machine Learning
Predictive Maintenance Scheduling
7. Advanced Topics for Time Series Databases
Stationary DataWandering SourcesSpace-Filling Curves
8. What’s Next?
A New Frontier: TSDBs, Internet of Things, and MoreNew Options for Very High-Performance TSDBsLooking to the Future
A. Resources
Tools for Working with NoSQL Time Series DatabasesMore Information About Use Cases Mentioned in This BookAdditional O’Reilly Publications by Dunning and Friedman

About the Authors
Colophon
Copyright

Content preview from Time Series Databases: New Ways to Store and Access Data

Chapter 3. Storing and Processing Time Series Data

As we mentioned in previous chapters, a time series is a sequence of values, each with a time value indicating when the value was recorded. Time series data entries are rarely amended, and time series data is often retrieved by reading a contiguous sequence of samples, possibly after summarizing or aggregating the retrieved samples as they are retrieved. A time series database is a way to store multiple time series such that queries to retrieve data from one or a few time series for a particular time range are particularly efficient. As such, applications for which time range queries predominate are often good candidates for implementation using a time series database. As previously explained, the main topic of this book is the storage and processing of large-scale time series data, and for this purpose, the preferred technologies are NoSQL non-relational databases such as Apache HBase or MapR-DB.

Pragmatic advice for practical implementations of large-scale time series databases is the goal of this book, so we need to focus in on some basic steps that simplify and strengthen the process for real-world applications. We will look briefly at approaches that may be useful for small or medium-sized datasets and then delve more deeply into our main concern: how to implement large-scale TSDBs.

To get to a solid implementation, there are a number of design decisions to make. The drivers for these decisions are the parameters that define ...

Become an O’Reilly member and get unlimited access to this title plus top books and audiobooks from O’Reilly and nearly 200 top publishers, thousands of courses curated by job role, 150+ live events each month,
and much more.

Read now

Unlock full access

More than 5,000 organizations count on O’Reilly

O’Reilly covers everything we've got, with content to help us build a world-class technology community, upgrade the capabilities and competencies of our teams, and improve overall team performance as well as their engagement.

Julian F.

Head of Cybersecurity

I wanted to learn C and C++, but it didn't click for me until I picked up an O'Reilly book. When I went on the O’Reilly platform, I was astonished to find all the books there, plus live events and sandboxes so you could play around with the technology.

Addison B.

Field Engineer

I’ve been on the O’Reilly platform for more than eight years. I use a couple of learning platforms, but I'm on O'Reilly more than anybody else. When you're there, you start learning. I'm never disappointed.

Amir M.

Data Platform Tech Lead

I'm always learning. So when I got on to O'Reilly, I was like a kid in a candy store. There are playlists. There are answers. There's on-demand training. It's worth its weight in gold, in terms of what it allows me to do.

Mark W.

Embedded Software Engineer

Publisher Resources

ISBN: 9781491920909Errata

Cloud Computing

Data Engineering

Data Science

AI & ML

Programming Languages

Software Architecture

IT/Ops

Security

Design

Business

Soft Skills

Time Series Databases: New Ways to Store and Access Data

by Ted Dunning, Ellen Friedman

Chapter 3. Storing and Processing Time Series Data

Become an O’Reilly member and get unlimited access to this title plus top books and audiobooks from O’Reilly and nearly 200 top publishers, thousands of courses curated by job role, 150+ live events each month,
and much more.