I have a large table that I'd like to access via a Spring Data Repository. Currently, I'm trying to extend the <code>PagingAndSortingRepository</code> interface but it seems I can only define methods that return lists, eg.: <pre class="prettyprint"><code>public interface MyRepository extends PagingAndSortingRepository<MyEntity, Integer> { @Query(value="SELECT * ...") List<MyEntity> myQuery(Pageable p); } </code></pre> On the other hand, the <code>findAll()</code> method that comes with <code>PagingAndSortingRepository</code> returns an <code>Iterable</code> (and I suppose that the data is not loaded into memory). Is it possible to define custom queries that also return Iterable and/or don't load all the data into memory at once? Are there any alternatives for handling large tables?

We have the classical consulting answer here: it depends. As the implementation of the method is store specific, we depend on the underlying store API. In case of JPA there's no chance to provide streaming access as <code>….getResultList()</code> returns a <code>List</code>. Hence we also expose the <code>List</code> to the client as especially JPA developers might be used to working with lists. So for JPA the only option is using the pagination API. For a store like Neo4j we support the streaming access as the repositories return <code>Iterable</code> on CRUD methods as well as on the execution of finder methods.

The implementation of <code>findAll()</code> simply loads the entire list of all entities into memory. Its <code>Iterable</code> return type doesn't imply that it implements some sort of database level cursor handling. On the other hand your custom <code>myQuery(Pageable)</code> method will only load one page worth of entities, because the generated implementation honours its <code>Pageable</code> parameter. You can declare its return type either as <code>Page</code> or <code>List</code>. In the latter case you still receive the same (restricted) number of entities, but not the metadata that a <code>Page</code> would additionally carry. So you basically did the right thing to avoid loading all entities into memory in your custom query. Please review the related documentation here.

How to handle a large set of data using Spring Data Repositories?

Tags:

repository

spring

spring-data

I have a large table that I'd like to access via a Spring Data Repository.

Currently, I'm trying to extend the PagingAndSortingRepository interface but it seems I can only define methods that return lists, eg.:

public interface MyRepository extends 
        PagingAndSortingRepository<MyEntity, Integer>
{
  @Query(value="SELECT * ...")
  List<MyEntity> myQuery(Pageable p);
}

On the other hand, the findAll() method that comes with PagingAndSortingRepository returns an Iterable (and I suppose that the data is not loaded into memory).

Is it possible to define custom queries that also return Iterable and/or don't load all the data into memory at once?

Are there any alternatives for handling large tables?

429

asked Mar 05 '13 18:03

José Ricardo

2 Answers

We have the classical consulting answer here: it depends. As the implementation of the method is store specific, we depend on the underlying store API. In case of JPA there's no chance to provide streaming access as ….getResultList() returns a List. Hence we also expose the List to the client as especially JPA developers might be used to working with lists. So for JPA the only option is using the pagination API.

For a store like Neo4j we support the streaming access as the repositories return Iterable on CRUD methods as well as on the execution of finder methods.

100

answered Jan 24 '23 20:01

Oliver Drotbohm

The implementation of findAll() simply loads the entire list of all entities into memory. Its Iterable return type doesn't imply that it implements some sort of database level cursor handling.

On the other hand your custom myQuery(Pageable) method will only load one page worth of entities, because the generated implementation honours its Pageable parameter. You can declare its return type either as Page or List. In the latter case you still receive the same (restricted) number of entities, but not the metadata that a Page would additionally carry.

So you basically did the right thing to avoid loading all entities into memory in your custom query.

Please review the related documentation here.

answered Jan 24 '23 21:01

zagyi

Related questions
                            
                                Resolving entity URI in custom controller (Spring HATEOAS)
                            
                                (Spring MVC / Jackson) Mapping query parameters to @ModelAttribute: LOWERCASE_WITH_UNDERSCORE to SNAKE_CASE fails
                            
                                restTemplate.getforobject(),exchange(),entity() .is there any pros and cons for each method?
                            
                                Failed to auto-configure a DataSource: 'spring.datasource.url'
                            
                                How to upload encoded base64 image to the server using spring
                            
                                Why does Spring MockMvc result not contain a cookie?
                            
                                How to inherit application.properties in Spring?
                            
                                Can I have Spring's @Component on enum?
                            
                                How to operate on PostgreSQL interval datatype using jdbc/spring-jdbc not using PGInterval?
                            
                                Override bean definition in java config
                            
                                How to nicely handle file upload MaxUploadSizeExceededException with Spring Security
                            
                                UnsatisfiedDependencyException: Error creating bean with name 'entityManagerFactory'
                            
                                Deserialize JSON containing (_links and _embedded) using spring-hateoas
                            
                                CrudRepository inside my custom repository implementation
                            
                                Difference between Quartz Job and Scheduling Tasks with Spring?
                            
                                What's the meaning of Realm in spring security
                            
                                How to wire Interdependent beans in Spring?
                            
                                Domain driven design and transactions in Spring environment
                            
                                @Autowired HttpServletResponse
                            
                                ref vs depends-on attributes in Spring

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With