How to model Graph data in Postgresql? [closed]

Tags:

How would one go about storing and querying sparse directed or undirected graphs in Postgresql. There is something like pggraph, but that is still in planning stage.

I realize dedicated graph databases like Neo4J are best suited for this. However is there way to implement same within Postgresql, by using extension or a data type, which would avoid adding another database engine.dtata

553

asked Dec 25 '13 20:12

jethar

2 Answers

In essence, there are some techniques to efficiently query graph data within an SQL database, that apply to highly specialized scenarios.

You could opt to maintain a GRIPP index, for instance, if your interests lie in finding shortest paths. (It basically works a bit like pre-ordered tree index, applied to graphs.) To the best of my knowledge, none of these techniques are standardized yet.

With that being said, and seeing your comment that mentions social networks, the odds are that each of them will be overkill.

If your interest primarily lies in fetching data related to a user's friends, or something equivalent in the sense that it amounts to querying a node's neighborhood, the number of nodes you'll need to traverse in joins is so tiny that there is no need for specialized tools, data structures, etc.: simply use recursive CTEs.

http://www.postgresql.org/docs/current/static/queries-with.html

WITH provides a way to write auxiliary statements for use in a larger query. These statements, which are often referred to as Common Table Expressions or CTEs, can be thought of as defining temporary tables that exist just for one query.

For optimal performance when using the latter, shift as many where conditions within the with (...) part of the query, so as to eliminate nodes early.

135

answered Oct 13 '22 00:10

Denis de Bernardy

Use PostgreSQL for the underlying storage and use networkX or iGraph via PL/Python for the processing engine.

In their book Graph Databases, Ian Robinson, Jim Webber, and Emil Eifrem make a distinction between the underlying storage and the processing engine. If you look at the answer I followed in a recent problem (see here), you will see that I'm using PostgreSQL for the underlying storage and networkX as the processing engine. The performance gain relative to my original solution was huge (and similar to the ones described in the "Graph Databases" book) and implementing it was very easy.

answered Oct 12 '22 23:10

Ian Gow

Related questions
                            
                                Dialect needs to be explicitly supplied as of v4.0.0
                            
                                debugging postgresql trigger
                            
                                Very slow Spring Boot application startup
                            
                                SQLalchemy not find table for creating foreign key
                            
                                PostgreSQL: Warning: Console code page (437) differs from Windows code page (1252)
                            
                                Failed to load sql modules into the database cluster during PostgreSQL Installation
                            
                                Force max length for string in PostgreSQL
                            
                                Can I make a plpgsql function return an integer without using a variable?
                            
                                Store common query as column?
                            
                                How do I do a schema only backup and restore in PostgreSQL?
                            
                                String concatenation with a null seems to nullify the entire string - is that desired behavior in Postgres?
                            
                                PostgreSQL rename attribute in jsonb field
                            
                                DBeaver Can't access non-default database
                            
                                Dropping column in Postgres on a large dataset
                            
                                Postgresql DROP TABLE doesn't work
                            
                                PostgreSQL: Select data with a like on timestamp field
                            
                                Export Postgres Database into CSV file
                            
                                Remove array values in pgSQL
                            
                                Inserting array values
                            
                                Psycopg2, Postgresql, Python: Fastest way to bulk-insert

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

How to model Graph data in Postgresql? [closed]

Tags:

graph

postgresql

graph-databases

jethar

People also ask

2 Answers

Denis de Bernardy

Ian Gow

Recent Activity

Donate For Us