Logo Questions Linux Laravel Mysql Ubuntu Git Menu
 

Best Data Structure For Time Series Data

I wonder if someone could take a minute out of their day to give their two cents on my problem.

I would like some suggestions on what would be the best data structure for representing, on disk, a large data set of time series data. The main priority is speed of insertion, with other priorities in decreasing order; speed of retrieval, size on disk, size in memory, speed of removal.

I have seen that B+ trees are often used in database because of their fast search times, but how about for fast insertion times? Is a linked list really the way to go?

like image 628
ChrisBellew Avatar asked Mar 31 '10 03:03

ChrisBellew


People also ask

What is time series data structure?

The TimeSeries data type is a constructor data type that groups together a collection of ROW data type in time stamp order. A ROW data type consists of a group of named columns. The rows in a TimeSeries data type, called elements, each represent one or more data values for a specific time stamp.

How best should time series data be presented?

A line graph is the simplest way to represent time series data. It is intuitive, easy to create, and helps the viewer get a quick sense of how something has changed over time. A line graph uses points connected by lines (also called trend lines) to show how a dependent variable and independent variable changed.

Which data structure is most efficient to?

Arrays. An array is a linear data structure that holds an ordered collection of values. It's the most efficient in storing and accessing a sequence of objects.


1 Answers

You might want to look into HDF5 (Hierarchical Data Format). It's well suited for Time Series data. Implementation wise, it uses B Trees.

like image 132
I82Much Avatar answered Oct 06 '22 03:10

I82Much