What are series and bucket in InfluxDb

Tags:

influxdb

influxdb-2

While trying to understand different concepts of InfluxDb I came across this documentation, where there is a comparision of terms with SQL database.

An InfluxDB measurement is similar to an SQL database table.
InfluxDB tags are like indexed columns in an SQL database.
InfluxDB fields are like unindexed columns in an SQL database.
InfluxDB points are similar to SQL rows.

But there are couple of other terminology which I came across, which I could not clearly understand and wondering if there is an SQL equivalent for that.

Series
Bucket

From what I understand from the documentation

series is the collection of data that share a retention policy, measurement, and tag set.

Does this mean a series is a subset of data in a database table? Or is it like database views ?
I could not see any documentation explaining buckets. I guess this is a new concept in 2.0 release

Can someone please clarify these two concepts.

733

asked Oct 01 '19 18:10

pvpkiran

2 Answers

I have summarized my understanding below:

A bucket is named location with retention policy where time-series data is stored.
A series is a logical grouping of data defined by shared measurement, tag and field.
A measurement is similar to an SQL database table.
A tag is similar to indexed columns in an SQL database.
A field is similar to unindexed columns in an SQL database.
A point is similar to SQL row.

For example, a SQL table workdone:

`Email`	`Status`	`time`	`Completed`
[email protected]	start	1636775801000000000	76
[email protected]	finish	1636775868000000000	120
[email protected]	start	1636775801000000000	0
[email protected]	finish	1636775868000000000	20
[email protected]	start	1636775801000000000	54
[email protected]	finish	1636775868000000000	56

The columns Email and Status are indexed.

Hence:

Measurement: workdone
Tags: Email, Status
Field: Completed
Series (Cardinality = 3 x 2 = 6):
1. Measurement: workdone; Tags: Email: [email protected], Status: start; Field: Completed
2. Measurement: workdone; Tags: Email: [email protected], Status: finish; Field: Completed
3. Measurement: workdone; Tags: Email: [email protected], Status: start; Field: Completed
4. Measurement: workdone; Tags: Email: [email protected], Status: finish; Field: Completed
5. Measurement: workdone; Tags: Email: [email protected], Status: start; Field: Completed
6. Measurement: workdone; Tags: Email: [email protected], Status: finish; Field: Completed

Splitting a logical series across multiple buckets may not improve performance but may complicate flux query as need to include multiple buckets.

117

answered Sep 21 '22 02:09

yoonghm

According to the InfluxDB glossary:

Bucket

A bucket is a named location where time-series data is stored in InfluxDB 2.0. In InfluxDB 1.8+, each combination of a database and a retention policy (database/retention-policy) represents a bucket. Use the InfluxDB 2.0 API compatibility endpoints included with InfluxDB 1.8+ to interact with buckets.

Series

A logical grouping of data defined by shared measurement, tag set, and field key.

answered Sep 21 '22 02:09

Benyamin Jafari

Related questions
                            
                                How to retrive more than 10k lines from InfluxDB using Pandas?
                            
                                How to run InfluxDB on Heroku?
                            
                                Query InfluxDB for specific hours every day
                            
                                Create grafana dashboards with api
                            
                                How can I send just one alert with kapacitor if something is down?
                            
                                Big data with very fast access
                            
                                Merging different granularity time series in influxdb
                            
                                Calculating request per second using InfluxDB on Grafana
                            
                                Docker Daemon stop - Timeout for container defaults 10s
                            
                                How do you INSERT into influxDB using the SQL-like interface?
                            
                                Rename MEASUREMENT
                            
                                Can I create different retention policy for different measurements in influxdb?
                            
                                InfluxDB data structure & database model
                            
                                InfluxDB - what's shard group duration
                            
                                Query tags from InfluxDB with respect of timeFilter for Grafana variables templating
                            
                                how to get databases list at influxdb in v0.8

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

What are series and bucket in InfluxDb

Tags:

influxdb

influxdb-2

pvpkiran

People also ask

2 Answers

yoonghm

Bucket

Series

Benyamin Jafari

Recent Activity

Donate For Us