Sparse Graph Implementation & Performance in C++

Tags:

I'm currently working on a directed graph data structure in C++ (no Boost GL for this project). The primary application will be identifying connected components and sinks. The graphs are expected to be sparse (E ~ 4V upper limit on num edges) and will all be uniform weight. I'm trying to decide between adjacency list, incidence list or possibly some other representation that I haven't heard of yet (adj. matrix not an option bc of sparsity). The bottleneck is likely going to be space overall and speed of graph initialization: Graphs will be initialized from potentially huge arrays such that each element in the array will end up being a vertex with a directed edge to one of its neighboring elements. To get the edges for each vertex, all its neighboring elements must be compared first.

My questions are: (1) Which representation is typically faster to initialize and also fast for BFS traversal, (2) What algorithms (other than vanilla BFS) are there for finding connected components? I know it's O(V+E) using BFS (which is optimal, I think) but I'm worried about the size of the intermediate queue as the graph width grows exponentially with height.

Don't have too much experience with graph implementations, so I'd be grateful for any suggestions.

237

asked Mar 08 '13 00:03

42point2

1 Answers

Consider a layout as follows:

enter image description here

An adjacency list can be implemented as an array of [Nx4] (n being 3 in this case, and 4 because you are saying that 4 is the maximum number of edges in your case) in the following form:

2  3  0  0
3  0  0  0
0  0  0  0

the above representation assumes that the number of vertices are in sorted order where first index into the array is given by (v-1).

Incidence list on the other hand, requires you to define a vertex list, an edge list and connection elements in between (incidence list - graph).

Both are good in terms of space usage compared to an adjacency matrix since your graph is very sparse, as you stated.

My suggestion would be to go with the adjacency list, which you can initialize as an [Nx4] contiguous array in the memory (since you are saying that you will have at most 4 edges for one vertex). This representation will be faster to initialize. (Also, this representation will perform better in terms of cache efficiency.)

However, if you expect the size of your graph changing dynamically and frequently, incidence lists might be better since they are generally implemented as lists which are non contiguous spaces (see the link above). De-allocation and allocation of the adjacency array might be undesirable in that case.

117

answered Sep 20 '22 13:09

meyumer

Related questions
                            
                                Pass subclasses to a function that takes their superclass
                            
                                Converting GCC IR to LLVM IR
                            
                                Simple C++ Encryption - Decryption Library? [duplicate]
                            
                                CUDA syntax error '<'
                            
                                Performance of different math functions in x86?
                            
                                Texture not displaying properly - probably coordinates are wrong OpenGL, C++
                            
                                Using utf-8 characters in log4cxx
                            
                                unordered_map of boost::noncopyable can't return references from operator[]
                            
                                convert a string to a variant in c++
                            
                                What is the correct implementation of move constructor (and others)?
                            
                                Creating cross platform OpenGL off-screen context
                            
                                Avoiding an "inheritance by dominance" warning for a mocked std::fstream class
                            
                                How to trap stack overflow with pthread?
                            
                                How to insert a function in LLVM module
                            
                                Boost threads: in IOS, thread_info object is being destructed before the thread finishes executing
                            
                                How to store universal references
                            
                                C++ Allocating memory on the heap and stack?
                            
                                Type trait to identify primary base class
                            
                                rand() and srand() in C++
                            
                                How do I prevent boost::optional<T> from being constructed erroneously with 0?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Sparse Graph Implementation & Performance in C++

Tags:

c++

algorithm

graph

breadth-first-search

directed-acyclic-graphs

42point2

People also ask

1 Answers

meyumer

Recent Activity

Donate For Us