I use <code>Cassandra java driver</code>. I receive 150k requests per second, which I insert to 8 tables having different partition keys. My question is which is a better way: <ul> <li> batch inserting to these tables </li> <li> inserting one by one. </li> </ul> I am asking this question because , considering my request size (150k), batch sounds like the better option but because all the tables have different partition keys, batch appears expensive.

Please check my answer from below link: Cassandra batch query performance on tables having different partition keys Batches are not for improving performance. They are used for ensuring atomicity and isolation. <blockquote> <blockquote> Batching can be effective for single partition write operations. But batches are often mistakenly used in an attempt to optimize performance. Depending on the batch operation, the performance may actually worsen. </blockquote> </blockquote> https://docs.datastax.com/en/cql/3.3/cql/cql_using/useBatch.html If data consistency is not needed among those tables, then use single insert. Single requests are distributed or propagated properly (depends on load balancing policy) among nodes. If you are concerned about request handling and use batch, batches will burden so many extra works on coordinator nodes which will not be efficient I guess :)

Cassandra batch query vs single insert performance

2 Answers

Please check my answer from below link:

Cassandra batch query performance on tables having different partition keys

Batches are not for improving performance. They are used for ensuring atomicity and isolation.

Batching can be effective for single partition write operations. But batches are often mistakenly used in an attempt to optimize performance. Depending on the batch operation, the performance may actually worsen.

https://docs.datastax.com/en/cql/3.3/cql/cql_using/useBatch.html

If data consistency is not needed among those tables, then use single insert. Single requests are distributed or propagated properly (depends on load balancing policy) among nodes. If you are concerned about request handling and use batch, batches will burden so many extra works on coordinator nodes which will not be efficient I guess :)

answered Sep 17 '22 20:09

Chaity

Batches have a HUGE impact on performance instead. The sollution that best suits you as I understand to split into diffirent lists per partition keys and then use batch statements. You will see a huge impact on performance.

answered Sep 18 '22 20:09

giannisapi

Related questions
                            
                                Exception while trying to acquire a JMH lock
                            
                                Is there a way to create package-info.java for existing packages in one move in eclipse?
                            
                                Gradle exclude java class from lib replaced by own class to avoid duplicate
                            
                                Gradle sync failed: Unable to load class 'org.gradle.internal.logging.LoggingManagerInternal'
                            
                                Jersey Multiple Produces
                            
                                JavaFX: Add children to ScrollPane
                            
                                How to capture the List of removed items from Java 8 Stream filtering?
                            
                                Spark - Divide int with column?
                            
                                Can not find JDK and Maven Configuration in Jenkins
                            
                                GraphQL implementation Java [closed]
                            
                                How to install Eclipse Neon behind a proxy
                            
                                Given `T` and `U` where `T extends U` how to return a `U`
                            
                                Set opacity of a decorated JFrame in Java 8
                            
                                Why is volatile keyword not allowed for local variables?
                            
                                Unable to start Chrome CustomTabsIntent in my Android app
                            
                                Filter a list of objects in Android using gradle-retrolambda and Lightweight-Stream-API
                            
                                NoSuchMethodException: java.time.LocalDateTime.<init>() reading CSV using Super CSV
                            
                                Is Stream.count() guranteed to visit each element?
                            
                                How to use Spring Security to custom login page?
                            
                                How to copy/transform an AutoValue object with builder

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Cassandra batch query vs single insert performance

Tags:

java

cassandra

datastax

Prakash P

People also ask

2 Answers

Chaity

giannisapi

Recent Activity

Donate For Us