breadth-first-search on huge graph with little ram

Tags:

I currently have a graph that has about 10 million nodes and 35 million edges. For now the complete graph is loaded into memory at program start. This takes a couple of minutes (it is Java after all) and needs about half a gigabyte of RAM. For now it runs on a machine with a dual core processor and 4 gigabytes of RAM.

When the graph is searched using a breadth-first-search the memory usage rises to a peak of one gigabyte and it takes ten seconds on average.

I would like to deploy the program on a couple of computers. The functionality apart from the graph search does take very little resources. My target system is very miniature and has only 512 megabytes of RAM.

Any suggestions on how to implement a method (probably using a database) to search that graph without consuming too much memory? The program is idle most of the time as it is accessing a hardware device, so the path-finding could take about 5 minutes max for the mentioned graph...

Thanks for any thoughts thrown in my direction.

UPDATE:

Just found neo4j. Anybody knows if it would be suitable for this kind of humongous graph?

945

asked Feb 13 '10 18:02

allesblinkt

1 Answers

Your question is a little vague, but in general, a good strategy that mostly follows breadth first semantics while using the same amount of memory as depth-first search is Iterative Deepening. The idea is that you do a depth-first search limited to 1 level at first; if that fails to find a solution, start from scratch and limit it to 2 levels; if that fails, try 3 levels, and so on.

This may seem a bit redundant at first, but since you're doing a depth-first search, you keep much fewer nodes in memory, and always search one less level than a straightforward breadth-first search. Since the amount of nodes in a level grows exponentially, on larger graphs, it's very likely that saving that one last extra level pays off for trying all preceding layers redundantly.

127

answered Nov 15 '22 10:11

Max Shawabkeh

Related questions
                            
                                Objective-C memory management, xml parser and other non-trivial examples
                            
                                A fail-safe way to prevent GD image library from running out of memory? (PHP)
                            
                                Behaviour of static variables in dynamically linked libraries (C/C++)
                            
                                BSS, Stack, Heap, Data, Code/Text - Where each of these start in memory?
                            
                                largest memory map allocation size?
                            
                                Pointers and memory scope
                            
                                Does every server in a MongoDB replica set need to have exactly the same RAM?
                            
                                Java using more memory than the allocated memory
                            
                                How to get total size of free memory in C in linux?
                            
                                Does a c++ program automatically free memory when it crashes?
                            
                                Why is a 1 character .NET string 32 bytes in x64?
                            
                                Javascript concerns of using 'var' on an already-existing variable
                            
                                Compiler specific memory initialization
                            
                                Python MemoryError in Scipy Radial Basis Function (scipy.interpolate.rbf)
                            
                                Why does Process.PrivateMemorySize64 /1024 not match what Windows Task Manager Memory (Private Working Set)?
                            
                                What the GPU memory means in chrome task manager
                            
                                App memory usage difference between simulator and device
                            
                                How do I make LeakSanitizer ignore end of program leaks
                            
                                Convert pytorch tensor to opencv mat and vice versa in C++

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

breadth-first-search on huge graph with little ram

Tags:

path

memory

graph

breadth-first-search

allesblinkt

People also ask

1 Answers

Max Shawabkeh

Recent Activity

Donate For Us