Cassandra cql: how to select the LAST n rows from a table

Tags:

cassandra

cql3

I want to verify that rows are getting added to the table. What cql statement would show the last n rows from the table below?

Table description below:

cqlsh:timeseries> describe table option_data;

CREATE TABLE option_data (
  ts bigint,
  id text,
  strike decimal,
  callask decimal,
  callbid decimal,
  maturity timestamp,
  putask decimal,
  putbid decimal,
  PRIMARY KEY ((ts), id, strike)
) WITH
  bloom_filter_fp_chance=0.010000 AND
  caching='KEYS_ONLY' AND
  comment='' AND
  dclocal_read_repair_chance=0.100000 AND
  gc_grace_seconds=864000 AND
  index_interval=128 AND
  read_repair_chance=0.000000 AND
  replicate_on_write='true' AND
  populate_io_cache_on_flush='false' AND
  default_time_to_live=0 AND
  speculative_retry='99.0PERCENTILE' AND
  memtable_flush_period_in_ms=0 AND
  compaction={'class': 'SizeTieredCompactionStrategy'} AND
  compression={'sstable_compression': 'LZ4Compressor'};

cqlsh:timeseries>

978

asked Oct 02 '14 20:10

Ivan

1 Answers

You didn't specify last n "by what".

To get the last N per id:

SELECT * FROM option_data WHERE ts=1 ORDER BY id DESC LIMIT N;

ORDER BY clause can only be applied to the second column in a compound primary key. If you need to query by time you will need to think about your data model a little more.

If your queries are most often "last N", you might consider writing something like this:

CREATE TABLE time_series (
    id text,
    t timeuuid,
    data text,
    PRIMARY KEY (id, t)
) WITH CLUSTERING ORDER BY (t DESC)

... where 'id' is your time series id. The CLUSTERING ORDER reverses the order of timeuuid 't', causing the cells to be stored in a natural order for your query.

With this, you would get the last five events as follows:

SELECT * FROM time_series WHERE id='stream id' LIMIT 5;

There is a lot of information out there for time series in Cassandra. I suggest reading some of the more recent articles on the matter.

147

answered Sep 19 '22 18:09

Adam Holmberg

Related questions
                            
                                Consistency Level of Cassandra Lightweight transactions
                            
                                What is the right way to use Cassandra driver from a web application
                            
                                Cassandra seconday index vs materialized view
                            
                                Cassandra selective copy
                            
                                how to take a keyspace as a dump in cassandra?
                            
                                Cassandra denormalization datamodel
                            
                                What to use for session management?
                            
                                Cassandra NOT EQUAL Operator
                            
                                Datastax Java Driver does not connect if one host is missing
                            
                                Fetch all rows in cassandra
                            
                                how UPDATE rows in cassandra using only Partition Key?
                            
                                Cassandra 3.0 and later require Java 8u40 or later
                            
                                Should I call session.close() and cluster. close() after each web API call
                            
                                Cassandra.yaml configuration error- expected '<document start>', but found Scalar
                            
                                SparkSQL error Table Not Found
                            
                                Passing parameter to Cassandra CQL query using DataStax client
                            
                                What is meant by a node in cassandra?
                            
                                what exactly is a map dimension in a multi-dimensional map?
                            
                                What NoSQL DB to use for sparse Time Series like data?
                            
                                RPC timeout in cqlsh - Cassandra

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With