What is the difference between spring batch remote chunking and remote partitioning? I can not understand the difference between remote chunking and remote partitioning in spring batch. Could anybody please explain?

Remote Partitioning Partitioning is a master/slave step configuration that allows for partitions of data to be processed in parallel. Each partition is described via some metadata. For example, if you were processing a database table, partition 1 may be ids 0-100, partition 2 being 101-200, etc. For Spring Batch, a master step uses a Partitioner to generate ExecutionContexts that contain the metadata for each partition. These ExecutionContexts are distributed to slave step for processing by a PartitionHandler (for remote partitioning, the MessageChannelPartitionHandler is typically used). The slaves execute their step and return the resulting statuses for aggregation by the master. Things to note about remote partitioning: <ul> <li>Input and output are local to the slaves. For example, if the input is a file, the slaves need access to the file.</li> <li>Slaves need access to the JobRepository. Slaves are fully defined Spring Batch steps and so they need JobRepository access.</li> </ul> Remote Chunking Remote chunking is similar to remote partitioning in that it is a master/slave configuration. However with remote chunking, the data is read at by the master and sent over the wire to the slave for processing. Once the processing is done, the result of the ItemProcessor is returned to the master for writing. Things to note about remote chunking: <ul> <li>All I/O is done by the master.</li> <li>The slaves handle processing only and therefore do not need JobRepository access.</li> <li>Remote chunking is more I/O intensive than remote partitioning since the actual data is sent over the wire instead of metadata describing it.</li> </ul> I did a talk on scaling Spring Batch and do a demonstration of remote partitioning that you can watch here: http://www.youtube.com/watch?v=CYTj5YT7CZU

Difference between spring batch remote chunking and remote partitioning

1 Answers

Remote Partitioning

Partitioning is a master/slave step configuration that allows for partitions of data to be processed in parallel. Each partition is described via some metadata. For example, if you were processing a database table, partition 1 may be ids 0-100, partition 2 being 101-200, etc. For Spring Batch, a master step uses a Partitioner to generate ExecutionContexts that contain the metadata for each partition. These ExecutionContexts are distributed to slave step for processing by a PartitionHandler (for remote partitioning, the MessageChannelPartitionHandler is typically used). The slaves execute their step and return the resulting statuses for aggregation by the master.

Things to note about remote partitioning:

Input and output are local to the slaves. For example, if the input is a file, the slaves need access to the file.
Slaves need access to the JobRepository. Slaves are fully defined Spring Batch steps and so they need JobRepository access.

Remote Chunking

Remote chunking is similar to remote partitioning in that it is a master/slave configuration. However with remote chunking, the data is read at by the master and sent over the wire to the slave for processing. Once the processing is done, the result of the ItemProcessor is returned to the master for writing.

Things to note about remote chunking:

All I/O is done by the master.
The slaves handle processing only and therefore do not need JobRepository access.
Remote chunking is more I/O intensive than remote partitioning since the actual data is sent over the wire instead of metadata describing it.

I did a talk on scaling Spring Batch and do a demonstration of remote partitioning that you can watch here: http://www.youtube.com/watch?v=CYTj5YT7CZU

answered Oct 16 '22 10:10

Michael Minella

Related questions
                            
                                LSB/MSB handling in Java
                            
                                JPA TemporalType.Date giving wrong date
                            
                                Upgrading to springframework.scheduling.concurrent?
                            
                                change a functions argument's values?
                            
                                opening jar file with admin privilege
                            
                                Getting an exception ORA-00942: table or view does not exist - when inserting into an existing table
                            
                                How to add Java EE plugin in plain eclipse
                            
                                Setting socket read timeout with javax.xml.soap.SOAPConnection
                            
                                Created File Has No Parent?
                            
                                Collections.newSetFromMap(»ConcurrentHashMap«) vs. Collections.synchronizedSet(»HashSet«)
                            
                                org.json.simple cannot be resolved
                            
                                Opening JNLP File in Java 6 JRE instead of JRE 7
                            
                                Java Square Root Integer Operations Without Casting?
                            
                                Oracle current_date or sysdate without hours, minutes, seconds [duplicate]
                            
                                Formatting numbers using DecimalFormat
                            
                                How to check whether a known uri file exists in Android storage?
                            
                                Splitting a multipage TIFF image into individual images (Java)
                            
                                Comma separated values within JSP for-each tag
                            
                                traversing a non binary tree in java [closed]
                            
                                Binding a Label's text property (in an FXML file) to an IntegerProperty (in a controller)

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Difference between spring batch remote chunking and remote partitioning

Tags:

java

spring

spring-batch

javalearner

People also ask

1 Answers

Michael Minella

Recent Activity

Donate For Us