I am using Neo4j CE 3.1.1 and I have a relationship WRITES between authors and books. I want to find the N (say N=10 for example) books with the largest number of authors. Following some examples I found, I came up with the query: <pre class="prettyprint"><code>MATCH (a)-[r:WRITES]->(b) RETURN r, COUNT(r) ORDER BY COUNT(r) DESC LIMIT 10 </code></pre> When I execute this query in the Neo4j browser I get 10 books, but these do not look like the ones written by most authors, as they show only a few WRITES relationships to authors. If I change the query to <pre class="prettyprint"><code>MATCH (a)-[r:WRITES]->(b) RETURN b, COUNT(r) ORDER BY COUNT(r) DESC LIMIT 10 </code></pre> Then I get the 10 books with the most authors, but I don't see their relationship to authors. To do so, I have to write additional queries explicitly stating the name of a book I found in the previous query: <pre class="prettyprint"><code>MATCH ()-[r:WRITES]->(b) WHERE b.title="Title of a book with many authors" RETURN r </code></pre> What am I doing wrong? Why isn't the first query working as expected?

You are very close: after sorting, it is necessary to rediscover the authors. For example: <pre class="prettyprint"><code>MATCH (a:Author)-[r:WRITES]->(b:Book) WITH b, COUNT(r) AS authorsCount ORDER BY authorsCount DESC LIMIT 10 MATCH (b)<-[:WRITES]-(a:Author) RETURN b, COLLECT(a) AS authors ORDER BY size(authors) DESC </code></pre>

Neo4j: Query to find the nodes with most relationships, and their connected nodes

Tags:

graph-databases

neo4j

cypher

I am using Neo4j CE 3.1.1 and I have a relationship WRITES between authors and books. I want to find the N (say N=10 for example) books with the largest number of authors. Following some examples I found, I came up with the query:

MATCH (a)-[r:WRITES]->(b)
RETURN r,
COUNT(r) ORDER BY COUNT(r) DESC LIMIT 10

When I execute this query in the Neo4j browser I get 10 books, but these do not look like the ones written by most authors, as they show only a few WRITES relationships to authors. If I change the query to

MATCH (a)-[r:WRITES]->(b)
RETURN b,
COUNT(r) ORDER BY COUNT(r) DESC LIMIT 10

Then I get the 10 books with the most authors, but I don't see their relationship to authors. To do so, I have to write additional queries explicitly stating the name of a book I found in the previous query:

MATCH ()-[r:WRITES]->(b)
WHERE b.title="Title of a book with many authors"
RETURN r

What am I doing wrong? Why isn't the first query working as expected?

421

asked Feb 14 '17 23:02

st1led

2 Answers

Aggregations only have context based on the non-aggregation columns, and with your match, a unique relationship will only occur once in your results.

So your first query is asking for each relationship on a row, and the count of that particular relationship, which is 1.

You might rewrite this in a couple different ways.

One is to collect the authors and order on the size of the author list:

MATCH (a)-[:WRITES]->(b)
RETURN b, COLLECT(a) as authors
ORDER BY SIZE(authors) DESC LIMIT 10

You can always collect the author and its relationship, if the relationship itself is interesting to you.

EDIT

If you happen to have labels on your nodes (you absolutely SHOULD have labels on your nodes), you can try a different approach by matching to all books, getting the size of the incoming :WRITES relationships to each book, ordering and limiting on that, and then performing the match to the authors:

MATCH (b:Book)
WITH b, SIZE(()-[:WRITES]->(b)) as authorCnt
ORDER BY authorCnt DESC LIMIT 10
MATCH (a)-[:WRITES]->(b)
RETURN b, a

You can collect on the authors and/or return the relationship as well, depending on what you need from the output.

103

answered Sep 28 '22 00:09

InverseFalcon

You are very close: after sorting, it is necessary to rediscover the authors. For example:

MATCH (a:Author)-[r:WRITES]->(b:Book)
WITH b, 
     COUNT(r) AS authorsCount
     ORDER BY authorsCount DESC LIMIT 10
MATCH (b)<-[:WRITES]-(a:Author)
RETURN b, 
       COLLECT(a) AS authors
       ORDER BY size(authors) DESC

answered Sep 28 '22 02:09

stdob--

Related questions
                            
                                Tutorial about using Neo4js along with Node.js
                            
                                Neo4j export & import data
                            
                                Clarification on multiple MATCH patterns in a Cypher query
                            
                                Neo4j Cypher: How do you unpack nodes from a path to allow for further matching?
                            
                                how to use two match statements in a cypher query
                            
                                Paging and sorting in Spring Data Neo4j 4
                            
                                Neo4j Design: Property vs "Node & Relationship"
                            
                                Neo4j output format
                            
                                How to DELETE nodes or relationship with NULL properties in neo4j 2.0 with cypher
                            
                                Neo4j Cypher - creating nodes and setting labels with LOAD CSV
                            
                                How to migrate/shift/copy/move data in Neo4j
                            
                                Neo4j, get all relationships between a set of nodes
                            
                                Is It Possible for a Label or Property Name to Include Spaces?
                            
                                Neo4j in Docker - Max Heap Size Causes Hard crash 137
                            
                                Best way to connect n nodes to a single node?
                            
                                Use Gremlin to find the shortest path in a graph avoiding a given list of vertices?
                            
                                SecurityError: Failed to establish secure connection to 'EOF occurred in violation of protocol (_ssl.c:841)'
                            
                                Cypher query to check if list1 contains any item from list2
                            
                                Neo4j Importing local CSV File
                            
                                Creating Family Tree with Neo4J

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With