I have a large graph of a few 100 GBs so I can't store it in RAM. The graph has multi-edges and there are labels for each edge. I would like to perform queries such as the following:
What's a good way to store a graph on disk for these sorts of queries?
This is for a single high performance PC not a distributed setting. My preferred programming language is Python.
Graph data is kept in store files, each of which contain data for a specific part of the graph, such as nodes, relationships, labels and properties. Dividing the storage in this way facilitates highly performant graph traversals (as detailed above).
You can try to use Neo4J graph database. It has support for Python language and rich query syntax which will give you an ability to perform all the queries you mentioned.
If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!
Donate Us With