Cosmos Db Graph - Performance and throughput of Gremlin.Net vs Microsoft.Graph

Tags:

As I'm learning how to use graph with Cosmos DB, I found two Microsoft tutorials:

One using Gremlin.Net
The other using Microsoft.Azure.Graph pre-release

While I use the same query, its execution differs.

Using Gremlin.Net, it executes at once. I very often (I'd say 70% of the time) get a RequestRateTooLargeException. If I understand correctly, it means that I keep reaching the 400RU/s limit that I chose to start with. However, when the query goes trough, it is twice as fast a the solution with Microsoft.Azure.Graph.

Indeed, with Micorosft.Azure.Graph, I have to call ExecuteNextAsync in a loop which returns one result at a time.

So the questions are:
1°) Which method should I use for better performance?
2°) How can I know the RU of my query so I can fine tune it?
3°) Is it possible to increase the throughput of an existing collection?

Update

Re question 3, I found that in the "Data Explorer" blade of my database, there is a "Scale & Settings" for my graph where I can update the throughput.

Update2

Re question 2, we can't get the RU charged when using the first method (Gremlin.Net) but the Microsoft.Graph the method ExecuteNextAsync returns a FeedResponse with a field RequestCharge.

492

asked Feb 26 '18 17:02

François

1 Answers

The reason you are hitting RequestRateTooLarge exceptions (429 status code) via Gremlin.NET vs Microsoft.Azure.Graphs is likely due to the difference between the retry policy on CosmosDB Gremlin server vs the default retry policy for DocumentClient.

The default retry behavior for DocumentClient with regards to these errors is described here:

By default, the DocumentClientException with status code 429 is returned after a cumulative wait time of 30 seconds if the request continues to operate above the request rate.

Therefore, Microsoft.Azure.Graphs may be internally handling and retrying these errors from the server and eventually succeeding. This has the benefit of improving request reliability but obfuscates the request rate errors, and will impact execution duration.

On CosmosDB Gremlin server, this retry policy allowance is reduced significantly, so RequestRateTooLargeException errors will be surfaced sooner.

To answer your questions:

1. Which method should I use for better performance?

Using CosmosDB Gremlin server via Gremlin.NET is expected to see better performance. Microsoft.Azure.Graphs uses a different request processing approach which involves more round-trips to the server so it has overhead, in addition to being a number of releases behind what is deployed to the server.

2. How can I know the RU of my query so I can fine tune it?

RU charges will be included in the Gremlin server responses as attributes. Currently Gremlin.NET doesn't have a way of exposing attributes on the response, however changes to the client driver are being discussed here.

In the interim, you an observe how frequently your requests hit 429 errors through the Metrics blade on your Azure CosmosDB Account portal. This presents aggregated views of number of requests, requests that exceeded capacity, storage quota etc. for a given collection.

3. Is it possible to increase the throughput of an existing collection?

As you found, you can increase throughput for an existing collection via the portal. Alternatively, this can be programmatically via Microsoft.Azure.Documents SDK.

In closing, my recommendation would be to add a retry policy around Gremlin.NET requests to handle these exceptions and match on RequestRateTooLargeException message.

When response status attributes are exposed on Gremlin.NET, they will include:

Request charge,
CosmosDB specific status code (eg. 429), and
Retry-after value, which indicates the time to wait in order to avoid hitting 429 errors.

118

answered Oct 03 '22 04:10

Oliver Towers

Related questions
                            
                                Creating and comparing dates inside CosmosDB stored procedures
                            
                                mongodb spring connection lost overnight
                            
                                Stored procedure azure Cosmos DB returns empty collection
                            
                                Cosmos DB 408 response in Azure Function
                            
                                Understanding the x-ms-resource-usage in DocumentDB response header
                            
                                DocumentDb Emulator not working - Service Unavailable
                            
                                Azure Cosmos DB Mongodb $t and $v
                            
                                Cosmos DB (DocumentDB API): Efficient way to query most recent document by partition ID?
                            
                                Why am I seeing different index behaviour between 2 seemingly identical CosmosDb Collections
                            
                                Azure Cosmos DB - Update existing documents with an additional field
                            
                                Many tiny documents in CosmosDB
                            
                                Creating a query dynamically in documentDb
                            
                                C# MongoDB driver only returning 100 results
                            
                                Unable to cast object of type 'System.Linq.EnumerableQuery to type 'Microsoft.Azure.Documents.Linq.IDocumentQuery
                            
                                Mocking IDocumentQuery in Unit Test that uses Linq queries
                            
                                Repository that support query by partition key without change interface
                            
                                Use custom JsonSerializerSettings with DocumentDb in Azure Function
                            
                                Issue while getting the latest record by time from cosmos database
                            
                                SqlQuerySpec QueryBuilder for CosmosDb

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Cosmos Db Graph - Performance and throughput of Gremlin.Net vs Microsoft.Graph

Tags:

graph-databases

gremlin

azure-cosmosdb

François

People also ask

1 Answers

Oliver Towers

Recent Activity

Donate For Us