I have created a basic implementation of high level client over Neo4J (https://github.com/impetus-opensource/Kundera/tree/trunk/kundera-neo4j) and want to compare its performance with Native neo4j driver (and maybe SpringData too). This way I would be able to determine overhead my library is putting over native driver. I plan to create an extension of YCSB for Neo4J. My question is: what should be considered as a basic unit of object to be written into neo4j (should it be a single node or a couple of nodes joined by an edge). What's current practice in Neo4J world. How people benchmarking neo4j performance are doing it.

See graphdb-benchmarks The project graphdb-benchmarks is a benchmark between popular graph dataases. Currently the framework supports Titan, OrientDB, Neo4j and Sparksee. The purpose of this benchmark is to examine the performance of each graph database in terms of execution time. The benchmark is composed of four workloads, Clustering, Massive Insertion, Single Insertion and Query Workload. Every workload has been designed to simulate common operations in graph database systems. Clustering Workload (CW): CW consists of a well-known community detection algorithm for modularity optimization, the Louvain Method. We adapt the algorithm on top of the benchmarked graph databases and employ cache techniques to take advantage of both graph database capabilities and in-memory execution speed. We measure the time the algorithm needs to converge. Massive Insertion Workload (MIW): Create the graph database and configure it for massive loading, then we populate it with a particular dataset. We measure the time for the creation of the whole graph. Single Insertion Workload (SIW): Create the graph database and load it with a particular dataset. Every object insertion (node or edge) is committed directly and the graph is constructed incrementally. We measure the insertion time per block, which consists of one thousand edges and the nodes that appear during the insertion of these edges. Query Workload (QW): Execute three common queries: FindNeighbours (FN): finds the neighbours of all nodes. FindAdjacentNodes (FA): finds the adjacent nodes of all edges. FindShortestPath (FS): finds the shortest path between the first node and 100 randomly picked nodes.

Neo4J Performance Benchmarking

Tags:

neo4j

kundera

I have created a basic implementation of high level client over Neo4J (https://github.com/impetus-opensource/Kundera/tree/trunk/kundera-neo4j) and want to compare its performance with Native neo4j driver (and maybe SpringData too). This way I would be able to determine overhead my library is putting over native driver.

I plan to create an extension of YCSB for Neo4J.

My question is: what should be considered as a basic unit of object to be written into neo4j (should it be a single node or a couple of nodes joined by an edge). What's current practice in Neo4J world. How people benchmarking neo4j performance are doing it.

367

asked Mar 01 '13 06:03

Amresh

2 Answers

There's already been some work for benchmarking Neo4J with Gatling: http://maxdemarzi.com/2013/02/14/neo4j-and-gatling-sitting-in-a-tree-performance-t-e-s-t-ing/

You could maybe adapt it.

answered Sep 19 '22 22:09

Stephane Landelle

See graphdb-benchmarks

The project graphdb-benchmarks is a benchmark between popular graph dataases. Currently the framework supports Titan, OrientDB, Neo4j and Sparksee. The purpose of this benchmark is to examine the performance of each graph database in terms of execution time. The benchmark is composed of four workloads, Clustering, Massive Insertion, Single Insertion and Query Workload. Every workload has been designed to simulate common operations in graph database systems.

Clustering Workload (CW): CW consists of a well-known community detection algorithm for modularity optimization, the Louvain Method. We adapt the algorithm on top of the benchmarked graph databases and employ cache techniques to take advantage of both graph database capabilities and in-memory execution speed. We measure the time the algorithm needs to converge.

Massive Insertion Workload (MIW): Create the graph database and configure it for massive loading, then we populate it with a particular dataset. We measure the time for the creation of the whole graph.

Single Insertion Workload (SIW): Create the graph database and load it with a particular dataset. Every object insertion (node or edge) is committed directly and the graph is constructed incrementally. We measure the insertion time per block, which consists of one thousand edges and the nodes that appear during the insertion of these edges.

Query Workload (QW): Execute three common queries: FindNeighbours (FN): finds the neighbours of all nodes. FindAdjacentNodes (FA): finds the adjacent nodes of all edges. FindShortestPath (FS): finds the shortest path between the first node and 100 randomly picked nodes.

answered Sep 23 '22 22:09

Somnath Muluk

Related questions
                            
                                Counting primitives in Neo4j
                            
                                Directional Relationships with different name for each direction
                            
                                Return All Nodes in Shortest Path as Object List
                            
                                Combining depth- and breadth-first traversals in a single cypher query
                            
                                SET in combination with CASE statement in cypher
                            
                                neo4j: how to query subgraph
                            
                                How can I run multiple Neo4j databases on a single server?
                            
                                Neo4j, Cypher: Conditional Create
                            
                                neo4j v2.2.0 default password not working?
                            
                                Spring Data Neo4j 4 - No Identity Field Found For Class
                            
                                How to return all the properties of a node with values
                            
                                Multiple properties of node to identify node Uniquely
                            
                                Is it possible to have SQL Like Clause in Neo4J CQL?
                            
                                Neo4j: Create index for nodes with same property
                            
                                Return the concatenated string of properties via single Cypher call
                            
                                'neo4j-admin' is not recognized as an internal or external command, operable program or batch file
                            
                                How to configure Neo4j embedded to run apoc procedures?
                            
                                How is data stored in a graph database? [duplicate]
                            
                                Neo4j Cypher: Find exact match to array Node property in WHERE clause
                            
                                Deleting indexed nodes in Neo4j

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With