I have been prototyping my new application in Spring Data REST backed by Spring Data JPA & Hibernate, which has been a fantastic productivity booster for my team, but as the data model becomes more complex the performance is going down the tubes. Looking at the executed SQL, I see two separate but related problems: <ol> <li>When using a <code>Projection</code> with only a few properties to reduce the size of my payload, SDR is still loading the entire entity graph, with all the overhead that incurs. EDIT: filed DATAREST-1089</li> <li>There seems to be no way to specify eager loading using JPA, as SDR auto-generates the repository methods so I can't add <code>@EntityGraph</code> to them. (and per DATAREST-905 below, even that doesn't work) EDIT: addressed in Cepr0's answer below, though this can only be applied to each finder method once. See DATAJPA-749</li> </ol> I have one key model that I use several different projections of depending on the context (list page, view page, autocomplete, related item page, etc), so implementing one custom <code>ResourceProcessor</code> doesn't seem like a solution.) Has anyone found a way around these problems? Otherwise anyone with a non-trivial object graph will see performance deteriorate drastically as their model grows. My research: <ul> <li> How do I avoid n+1 queries with Spring Data Rest? (from 2013)</li> <li> https://jira.spring.io/browse/DATAJPA-466 ( Add support for lazy loading configuration via JPA 2.1 fetch-/loadgraph.)</li> <li> https://jira.spring.io/browse/DATAREST-905 ( No way to avoid loading all child relations in spring-data-rest?) (2016, unanswered)</li> </ul>

To fight with 1+N issue I use the following two approaches: @EntityGraph I use '@EntityGraph' annotation in Repository for <code>findAll</code> method. Just override it: <pre class="prettyprint lang-java prettyprint-override"><code>@Override @EntityGraph(attributePaths = {"author", "publisher"}) Page<Book> findAll(Pageable pageable); </code></pre> This approach is suitable for all "reading" methods of Repository. Cache I use cache to reduce the impact of 1+N issue for complex Projections. Suppose we have Book entity to store the book data and Reading entity to store the information about the number of readings of a specific Book and its reader rating. To get this data we can make a Projection like this: <pre class="prettyprint lang-java prettyprint-override"><code>@Projection(name = "bookRating", types = Book.class) public interface WithRatings { String getTitle(); String getIsbn(); @Value("#{@readingRepo.getBookRatings(target)}") Ratings getRatings(); } </code></pre> Where <code>readingRepo.getBookRatings</code> is the method of ReadingRepository: <pre class="prettyprint lang-java prettyprint-override"><code>@RestResource(exported = false) @Query("select avg(r.rating) as rating, count(r) as readings from Reading r where r.book = ?1") Ratings getBookRatings(Book book); </code></pre> It also return a projection that store "rating" info: <pre class="prettyprint lang-java prettyprint-override"><code>@JsonSerialize(as = Ratings.class) public interface Ratings { @JsonProperty("rating") Float getRating(); @JsonProperty("readings") Integer getReadings(); } </code></pre> The request of <code>/books?projection=bookRating</code> will cause the invocation of <code>readingRepo.getBookRatings</code> for every Book which will lead to redundant N queries. To reduce the impact of this we can use the cache: Preparing the cache in the SpringBootApplication class: <pre class="prettyprint lang-java prettyprint-override"><code>@SpringBootApplication @EnableCaching public class Application { //... @Bean public CacheManager cacheManager() { Cache bookRatings = new ConcurrentMapCache("bookRatings"); SimpleCacheManager manager = new SimpleCacheManager(); manager.setCaches(Collections.singletonList(bookRatings)); return manager; } } </code></pre> Then adding a corresponding annotation to <code>readingRepo.getBookRatings</code> method: <pre class="prettyprint lang-java prettyprint-override"><code>@Cacheable(value = "bookRatings", key = "#a0.id") @RestResource(exported = false) @Query("select avg(r.rating) as rating, count(r) as readings from Reading r where r.book = ?1") Ratings getBookRatings(Book book); </code></pre> And implementing the cache eviction when Book data is updated: <pre class="prettyprint lang-java prettyprint-override"><code>@RepositoryEventHandler(Reading.class) public class ReadingEventHandler { private final @NonNull CacheManager cacheManager; @HandleAfterCreate @HandleAfterSave @HandleAfterDelete public void evictCaches(Reading reading) { Book book = reading.getBook(); cacheManager.getCache("bookRatings").evict(book.getId()); } } </code></pre> Now all subsequent requests of <code>/books?projection=bookRating</code> will get rating data from our cache and will not cause redundant requests to the database. More info and working example is here.

How to avoid N+1 queries with Spring Data REST Projections?

Tags:

spring-data-jpa

spring-data-rest

I have been prototyping my new application in Spring Data REST backed by Spring Data JPA & Hibernate, which has been a fantastic productivity booster for my team, but as the data model becomes more complex the performance is going down the tubes. Looking at the executed SQL, I see two separate but related problems:

When using a Projection with only a few properties to reduce the size of my payload, SDR is still loading the entire entity graph, with all the overhead that incurs. EDIT: filed DATAREST-1089
There seems to be no way to specify eager loading using JPA, as SDR auto-generates the repository methods so I can't add @EntityGraph to them. (and per DATAREST-905 below, even that doesn't work) EDIT: addressed in Cepr0's answer below, though this can only be applied to each finder method once. See DATAJPA-749

I have one key model that I use several different projections of depending on the context (list page, view page, autocomplete, related item page, etc), so implementing one custom ResourceProcessor doesn't seem like a solution.)

Has anyone found a way around these problems? Otherwise anyone with a non-trivial object graph will see performance deteriorate drastically as their model grows.

My research:

How do I avoid n+1 queries with Spring Data Rest? (from 2013)
https://jira.spring.io/browse/DATAJPA-466 ( Add support for lazy loading configuration via JPA 2.1 fetch-/loadgraph.)
https://jira.spring.io/browse/DATAREST-905 ( No way to avoid loading all child relations in spring-data-rest?) (2016, unanswered)

983

asked Jun 05 '17 23:06

joshwa

1 Answers

To fight with 1+N issue I use the following two approaches:

@EntityGraph

I use '@EntityGraph' annotation in Repository for findAll method. Just override it:

@Override
@EntityGraph(attributePaths = {"author", "publisher"})
Page<Book> findAll(Pageable pageable);

This approach is suitable for all "reading" methods of Repository.

Cache

I use cache to reduce the impact of 1+N issue for complex Projections.

Suppose we have Book entity to store the book data and Reading entity to store the information about the number of readings of a specific Book and its reader rating. To get this data we can make a Projection like this:

@Projection(name = "bookRating", types = Book.class)
public interface WithRatings {

    String getTitle();
    String getIsbn();

    @Value("#{@readingRepo.getBookRatings(target)}")
    Ratings getRatings();
}

Where readingRepo.getBookRatings is the method of ReadingRepository:

@RestResource(exported = false)
@Query("select avg(r.rating) as rating, count(r) as readings from Reading r where r.book = ?1")
Ratings getBookRatings(Book book);

It also return a projection that store "rating" info:

@JsonSerialize(as = Ratings.class)
public interface Ratings {

    @JsonProperty("rating")
    Float getRating();

    @JsonProperty("readings")
    Integer getReadings();
}

The request of /books?projection=bookRating will cause the invocation of readingRepo.getBookRatings for every Book which will lead to redundant N queries.

To reduce the impact of this we can use the cache:

Preparing the cache in the SpringBootApplication class:

@SpringBootApplication
@EnableCaching
public class Application {

    //...

    @Bean
    public CacheManager cacheManager() {

        Cache bookRatings = new ConcurrentMapCache("bookRatings");

        SimpleCacheManager manager = new SimpleCacheManager();
        manager.setCaches(Collections.singletonList(bookRatings));

        return manager;
    }
}

Then adding a corresponding annotation to readingRepo.getBookRatings method:

@Cacheable(value = "bookRatings", key = "#a0.id")
@RestResource(exported = false)
@Query("select avg(r.rating) as rating, count(r) as readings from Reading r where r.book = ?1")
Ratings getBookRatings(Book book);

And implementing the cache eviction when Book data is updated:

@RepositoryEventHandler(Reading.class)
public class ReadingEventHandler {

    private final @NonNull CacheManager cacheManager;

    @HandleAfterCreate
    @HandleAfterSave
    @HandleAfterDelete
    public void evictCaches(Reading reading) {
        Book book = reading.getBook();
        cacheManager.getCache("bookRatings").evict(book.getId());
    }
}

Now all subsequent requests of /books?projection=bookRating will get rating data from our cache and will not cause redundant requests to the database.

More info and working example is here.

113

answered Nov 05 '22 14:11

Cepr0

Related questions
                            
                                How to configure ElastiCache Redis for Spring Data Access
                            
                                How to handle Enums as string in spring data jpa Specification
                            
                                JPA named query match a list of tuples in IN clause
                            
                                Spring Boot @RepositoryEventHandler not invoked
                            
                                Spring Batch - How to prevent database commits with jobLauncherTestUtils
                            
                                How to unit test pessimistic locking with Spring Boot + Spring Data JPA
                            
                                object references an unsaved transient instance - save the transient instance before flushing : Spring Data JPA
                            
                                TransactionRequiredException: no transaction is in progress while using JPAItemWriter
                            
                                Sorting is not working with Pageable in Springboot
                            
                                Spring data alternatives
                            
                                How to group by result with JPA and JpaSpecificationExecutor?
                            
                                spring-data-jpa call Stored Procedure
                            
                                In spring boot JPA, how to properly POST an object whose entity representation has a foreign key association to a different entity?
                            
                                Spring Data JPA OutOfMemoryError: GC overhead limit exceeded
                            
                                Spring Data JPA Invalid page.sort Parameters
                            
                                Spring Data - Overriding default methods for some repositories
                            
                                Spring data rest one to many delete child objects
                            
                                Spring Data Serialization of Embeddable with ManyToOne References
                            
                                JPA Spring repository filtering multiple parameters
                            
                                Generate automatically Spring Data Repositories from JPA entities

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With