How does Spring Batch manage transactions (with possibly multiple datasources)?

Tags:

I would like some information about the data flow in a Spring Batch processing but fail to find what I am looking for on the Internet (despite some useful questions on this site).

I am trying to establish standards to use Spring Batch in our company and we are wondering how Spring Batch behaves when several processors in a step updates data on different data sources.

This question focuses on a chunked process but feel free to provide information on other modes.

From what I have seen (please correct me if I am wrong), when a line is read, it follows the whole flow (reader, processors, writer) before the next is read (as opposed to a silo-processing where reader would process all lines, send them to the processor, and so on).

In my case, several processors read data (in different databases) and updates them in the process, and finally the writer inserts data into yet another DB. For now, the JobRepository is not linked to a database, but that would be an independent one, making the thing still a bit more complex.

This model cannot be changed since the data belongs to several business areas.

How is the transaction managed in this case? Is the data committed only once the full chunk is processed? And then, is there a 2-phase commit management? How is it ensured? What development or configuration should be made in order to ensure the consistency of data?

More generally, what would your recommendations be in a similar case?

633

asked May 26 '15 12:05

Chop

1 Answers

Spring batch uses the Spring core transaction management, with most of the transaction semantics arranged around a chunk of items, as described in section 5.1 of the Spring Batch docs.

The transaction behaviour of the readers and writers depends on exactly what they are (eg file system, database, JMS queue etc), but if the resource is configured to support transactions then they will be enlisted by spring automatically. Same goes for XA - if you make the resource endpoint a XA compliant then it will utilise 2 phase commits for it.

Getting back to the chunk transaction, it will set up a transaction on chunk basis, so if you set the commit interval to 5 on a given tasklet then it will open and close a new transaction (that includes all resources managed by the transaction manager) for the set number of reads (defined as commit-interval).

But all of this is set up around reading from a single data source, does that meet your requirement? I'm not sure spring batch can manage a transaction where it reads data from multiple sources and writes the processor result into another database within a single transaction. (In fact I can't think of anything that could do that...)

answered Oct 21 '22 20:10

stringy05

Related questions
                            
                                Running each JUnit test in a separate JVM in Eclipse?
                            
                                How to connect remote EJB module from application client
                            
                                Static Method Memory Allocation
                            
                                What's the correct algorithm to determine number of user-perceived-characters?
                            
                                How to get jmap histogram programmatically?
                            
                                Trying to use Rhino, getEngineByName("JavaScript") returns null in OpenJDK 7
                            
                                Consuming RESTful service over https with certificate using Java
                            
                                Given an array [a1b2c3d4] convert to [abcd1234]
                            
                                Spring 3.2 and Jackson 2: add custom object mapper
                            
                                Displaying an animated PNG (apng) using Swing?
                            
                                Unexpected complexity of common methods (size) in Java Collections Framework?
                            
                                Java special math functions library
                            
                                Primitive alternative to Guava Table
                            
                                How can I handle IOException when Kafka is down?
                            
                                Mixing two audio streams into a single audio stream in android?
                            
                                Bluetooth Connection failed "java.io.IOException: read failed, socket might closed or timeout, read ret: -1"
                            
                                Alternatives to DDLUtils from apache
                            
                                Scale down the Valo theme’s spacing and widget size to that of the Reindeer theme
                            
                                When executing proguard-maven-plugin, "CreateProcess error=206, The filename or extension is too long" occurs
                            
                                Changing data on GET page request (dealing with preloading requests)

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

How does Spring Batch manage transactions (with possibly multiple datasources)?

Tags:

java

transactions

spring-batch

Chop

People also ask

1 Answers

stringy05

Recent Activity

Donate For Us