So just yesterday i started learning graphql its really interesting, and quite easy to learn and understand actually. i started reading some articles and i found the N+1 problem. i found this example here Query <pre class="prettyprint"><code># getting the top 100 reviews { top100Reviews { body author { name } } } </code></pre> Schema <pre class="prettyprint"><code> const typeDefs = gql` type User { id: ID! name: String } type Review { id: ID! body: String author: User product: Product } type Query { top100Reviews: [Review] } `; </code></pre> and finally the resolvers <pre class="prettyprint"><code>const resolver = { Query: { top100Reviews: () => get100Reviews(), }, Review: { author: (review) => getUser(review.authorId), }, }; </code></pre> in this article he said <blockquote> When we execute the following query to get the top 100 reviews and the corresponding author names, we first make a single call to retrieve 100 records of review from database and then for each review, we make another call to the database to fetch the user details given the author ID. </blockquote> cant we just remove the <code>Review</code> from the resolver and just make a simple JOIN (if im in sql) in the get100Reviews method i dont get it why we did the Review resolver if we gonna have N+1 problem while we can just make simple JOIN in the Query resolver. Im i understanding GraphQL right ?? Please some one shed some light here, and tell me. Thanks !!

You are correct -- using a join would let you make a single database query instead of 101. The problem is that in practice, you wouldn't just have one join -- your review data model might include associations with any number of other models, each one requiring its own join clause. Not only that, but those models might have relationships to other models themselves. Trying to craft a single SQL query that will account for all possible GraphQL queries becomes not only difficult, but also prohibitively expensive. A client might request only the reviews with none of their associated models, but the query to fetch those reviews now include 30 additional, unnecessary views. That query might have taken less than a second but now takes 10. Consider also that relationships between types can be circular: <pre class="prettyprint"><code>{ reviews { author { reviews { author } } } } </code></pre> In this case, the depth of a query is indeterminate and it is impossible to create a single SQL query that would accommodate any possible GraphQL query. Using a library like dataloader allows us to alleviate the N+1 problem through batching while keeping any individual SQL query as lean as possible. That said, you'll still end up with multiple queries. An alternative approach is to utilize the GraphQLResolveInfo object passed to the resolver to determine which fields were requested in the first place. Then if you like, you can make only the necessary joins in your query. However, parsing the <code>info</code> object and constructing that sort of query can be a daunting task, especially once you start dealing with deeply nested associations. On the other hand, <code>dataloader</code> is a more simple and intuitive solution.

I just wrote a package that I believe can solve N+1 problems in most cases on GraphQL on Nodejs. Check it out! https://github.com/oney/sequelize-proxy It basically uses data loaders to batch multiple queries to single one but it further leverages features and association definitions in sequelize to make it more accurate and efficient.

I don't understand the GraphQL N+1 problem

Tags:

express

graphql

apollo-server

So just yesterday i started learning graphql its really interesting, and quite easy to learn and understand actually. i started reading some articles and i found the N+1 problem. i found this example here

Query

# getting the top 100 reviews
{
  top100Reviews {
    body
    author {
      name
    }
  }
}

Schema


const typeDefs = gql`
  type User {
    id: ID!
    name: String
  }
  type Review {
    id: ID!
    body: String
    author: User
    product: Product
  }
  type Query {
    top100Reviews: [Review]
  }
`;

and finally the resolvers

const resolver = {
  Query: {
    top100Reviews: () => get100Reviews(),
  },
  Review: {
    author: (review) => getUser(review.authorId),
  },
};

in this article he said

When we execute the following query to get the top 100 reviews and the corresponding author names, we first make a single call to retrieve 100 records of review from database and then for each review, we make another call to the database to fetch the user details given the author ID.

cant we just remove the Review from the resolver and just make a simple JOIN (if im in sql) in the get100Reviews method

i dont get it why we did the Review resolver if we gonna have N+1 problem while we can just make simple JOIN in the Query resolver.

Im i understanding GraphQL right ??

Please some one shed some light here, and tell me.

Thanks !!

911

asked Mar 24 '20 13:03

wassimbj

2 Answers

You are correct -- using a join would let you make a single database query instead of 101.

The problem is that in practice, you wouldn't just have one join -- your review data model might include associations with any number of other models, each one requiring its own join clause. Not only that, but those models might have relationships to other models themselves. Trying to craft a single SQL query that will account for all possible GraphQL queries becomes not only difficult, but also prohibitively expensive. A client might request only the reviews with none of their associated models, but the query to fetch those reviews now include 30 additional, unnecessary views. That query might have taken less than a second but now takes 10.

Consider also that relationships between types can be circular:

{
  reviews {
    author {
      reviews {
        author
      }
    }
  }
}

In this case, the depth of a query is indeterminate and it is impossible to create a single SQL query that would accommodate any possible GraphQL query.

Using a library like dataloader allows us to alleviate the N+1 problem through batching while keeping any individual SQL query as lean as possible. That said, you'll still end up with multiple queries. An alternative approach is to utilize the GraphQLResolveInfo object passed to the resolver to determine which fields were requested in the first place. Then if you like, you can make only the necessary joins in your query. However, parsing the info object and constructing that sort of query can be a daunting task, especially once you start dealing with deeply nested associations. On the other hand, dataloader is a more simple and intuitive solution.

answered Sep 21 '22 21:09

Daniel Rearden

I just wrote a package that I believe can solve N+1 problems in most cases on GraphQL on Nodejs. Check it out! https://github.com/oney/sequelize-proxy

It basically uses data loaders to batch multiple queries to single one but it further leverages features and association definitions in sequelize to make it more accurate and efficient.

answered Sep 18 '22 21:09

user2790103

Related questions
                            
                                Make a secure oauth API with passport.js and express.js (node.js)
                            
                                PassportJS: How to get req.user in my views
                            
                                Angular 7 app getting CORS error from angular client
                            
                                Uploading images with Mongoose, Express and AngularJS
                            
                                Angular routing in HTML5mode with Node.js
                            
                                How to type `request.query` in express using TypeScript?
                            
                                gulp.js livereload with express server?
                            
                                Node.js Express vs. Flatiron
                            
                                Parsing JSON post requests in Node.js with Express 4
                            
                                Password Reset In NodeJS
                            
                                Importing/exporting the Express router using ES6 import + export keywords and Babel
                            
                                Cannot use import statement outside modules
                            
                                URL Generation for Routes in Express
                            
                                Correct way of starting mongodb and express?
                            
                                Jade - Loading templates from different directories
                            
                                ExpressJS & Mongoose REST API structure: best practices?
                            
                                How to update req.user session object set by passportjs?
                            
                                dependency cycle detected import/no-cycle
                            
                                ElectronJS code protection 2018
                            
                                NestJS - How to access post body using @Body() decorator?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With