Improve insert performance massively

Tags:

In my application I need to massively improve insert performance. Example: A file with about 21K records takes over 100 min to insert. There are reasons it can takes some time, like 20 min or so but over 100 min is just too long.

Data is inserted into 3 tables (many-to-many). Id's are generated from a sequence but I have already googled and set hibernate.id.new_generator_mappings = true and allocationSize + sequence increment to 1000.

Also the amount of data is not anything extraordinary at all, the file is 90 mb.

I have verified with visual vm that most of the time is spent in jdbc driver (postgresql) and hibernate. I think the issue is related to a unique constraint in the child table. The service layer makes a manual check (=SELECT) before inserting. If the record already exists, it reuses it instead of waiting for a constraint exception.

So to sum it up for the specific file there will be 1 insert per table (could be different but not for this file which is the ideal (fastest) case). That means total 60k inserts + 20k selects. Still over 100 min seems very long (yeah hardware counts and it is on a simple PC with 7200 rpm drive, no ssd or raid). However this is an improved version over a previous application (plain jdbc) on which the same insert on this hardware took about 15 min. Considering that in both cases about 4-5 min is spent on "pre-processing" the increase is massive.

Any tips who this could be improved? Is there any batch loading functionality?

766

asked Nov 13 '12 05:11

beginner_

1 Answers

see

spring-data JPA: manual commit transaction and restart new one

Add entityManager.flush() and entityManager.clear() after every n-th call to save() method. If you use hibernate add hibernate.jdbc.batch_size=100 which seems like a reasonable choice.

Performance increase was > 10x, probably close to 100x.

114

answered Jan 04 '23 07:01

beginner_

Related questions
                            
                                Set the fetch size with Spring Data
                            
                                Hibernate PersistentSet remove() operation not working
                            
                                Spring Boot JPA CrudRepository
                            
                                Why has javax.persistence-api been replaced by jakarta.persistence-api in spring data jpa starter?
                            
                                Cannot configure Spring Data JPA: Specified class is an interface
                            
                                Persist OneToOne relation with SpringData JPA
                            
                                Spring Data JPA - How can I fetch data from a Date column using only month and day?
                            
                                Spring data jpa default schema for native query in repository
                            
                                Get single row in JPA
                            
                                @CreatedBy and @LastModifiedDate are no longer working with ZonedDateTime?
                            
                                MySQL Date Changes to Yesterday's date After JPA Save
                            
                                Spring Data JPA getting a Projection from an Entity with a Query
                            
                                How to access in memory h2 database of one spring boot application from another spring boot application
                            
                                How to add a list of Predicates to CriteriaBuilder.or
                            
                                Can not set int field to null value
                            
                                Cleanly destroy Spring Application Context
                            
                                How to use Spring Data Repositories as Vaadin JPAContainer
                            
                                Getting duplicate items when querying a collection with Spring Data Rest

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Improve insert performance massively

Tags:

spring-data-jpa

bulkinsert

beginner_

People also ask

1 Answers

beginner_

Recent Activity

Donate For Us