'Stable' multi-dimensional scaling algorithm

Tags:

I have a wireless mesh network of nodes, each of which is capable of reporting its 'distance' to its neighbors, measured in (simplified) signal strength to them. The nodes are geographically in 3d space but because of radio interference, the distance between nodes need not be trigonometrically (trigonomically?) consistent. I.e., given nodes A, B and C, the distance between A and B might be 10, between A and C also 10, yet between B and C 100.

What I want to do is visualize the logical network layout in terms of connectness of nodes, i.e. include the logical distance between nodes in the visual.

So far my research has shown the multidimensional scaling (MDS) is designed for exactly this sort of thing. Given that my data can be directly expressed as a 2d distance matrix, it's even a simpler form of the more general MDS.

Now, there seem to be many MDS algorithms, see e.g. http://homepage.tudelft.nl/19j49/Matlab_Toolbox_for_Dimensionality_Reduction.html and http://tapkee.lisitsyn.me/ . I need to do this in C++ and I'm hoping I can use a ready-made component, i.e. not have to re-implement an algo from a paper. So, I thought this: https://sites.google.com/site/simpmatrix/ would be the ticket. And it works, but:

The layout is not stable, i.e. every time the algorithm is re-run, the position of the nodes changes (see differences between image 1 and 2 below - this is from having been run twice, without any further changes). This is due to the initialization matrix (which contains the initial location of each node, which the algorithm then iteratively corrects) that is passed to this algorithm - I pass an empty one and then the implementation derives a random one. In general, the layout does approach the layout I expected from the given input data. Furthermore, between different runs, the direction of nodes (clockwise or counterclockwise) can change. See image 3 below.
The 'solution' I thought was obvious, was to pass a stable default initialization matrix. But when I put all nodes initially in the same place, they're not moved at all; when I put them on one axis (node 0 at 0,0 ; node 1 at 1,0 ; node 2 at 2,0 etc.), they are moved along that axis only. (see image 4 below). The relative distances between them are OK, though.

So it seems like this algorithm only changes distance between nodes, but doesn't change their location.

Thanks for reading this far - my questions are (I'd be happy to get just one or a few of them answered as each of them might give me a clue as to what direction to continue in):

Where can I find more information on the properties of each of the many MDS algorithms?
Is there an algorithm that derives the complete location of each node in a network, without having to pass an initial position for each node?
Is there a solid way to estimate the location of each point so that the algorithm can then correctly scale the distance between them? I have no geographic location of each of these nodes, that is the whole point of this exercise.
Are there any algorithms to keep the 'angle' at which the network is derived constant between runs?

If all else fails, my next option is going to be to use the algorithm I mentioned above, increase the number of iterations to keep the variability between runs at around a few pixels (I'd have to experiment with how many iterations that would take), then 'rotate' each node around node 0 to, for example, align nodes 0 and 1 on a horizontal line from left to right; that way, I would 'correct' the location of the points after their relative distances have been determined by the MDS algorithm. I would have to correct for the order of connected nodes (clockwise or counterclockwise) around each node as well. This might become hairy quite quickly.

Obviously I'd prefer a stable algorithmic solution - increasing iterations to smooth out the randomness is not very reliable.

Thanks.

EDIT: I was referred to cs.stackexchange.com and some comments have been made there; for algorithmic suggestions, please see https://cs.stackexchange.com/questions/18439/stable-multi-dimensional-scaling-algorithm .

Image 1 - with random initialization matrix:

Image 1 - with random initialization matrix

Image 2 - after running with same input data, rotated when compared to 1:

Image 2 - after running with same input data, rotated when compared to 1

Image 3 - same as previous 2, but nodes 1-3 are in another direction:

Image 3 - same as previous 2, but nodes 1-3 are in another direction

Image 4 - with the initial layout of the nodes on one line, their position on the y axis isn't changed:

Image 4 - with the initial layout of the nodes on one line, their position on the y axis isn't changed

309

asked Nov 28 '13 10:11

Roel

1 Answers

Most scaling algorithms effectively set "springs" between nodes, where the resting length of the spring is the desired length of the edge. They then attempt to minimize the energy of the system of springs. When you initialize all the nodes on top of each other though, the amount of energy released when any one node is moved is the same in every direction. So the gradient of energy with respect to each node's position is zero, so the algorithm leaves the node where it is. Similarly if you start them all in a straight line, the gradient is always along that line, so the nodes are only ever moved along it.

(That's a flawed explanation in many respects, but it works for an intuition)

Try initializing the nodes to lie on the unit circle, on a grid or in any other fashion such that they aren't all co-linear. Assuming the library algorithm's update scheme is deterministic, that should give you reproducible visualizations and avoid degeneracy conditions.

If the library is non-deterministic, either find another library which is deterministic, or open up the source code and replace the randomness generator with a PRNG initialized with a fixed seed. I'd recommend the former option though, as other, more advanced libraries should allow you to set edges you want to "ignore" too.

answered Oct 28 '22 20:10

Andy Jones

Related questions
                            
                                How can I set the RTS with ioctl() in a Mac plugin?
                            
                                Using declaration for type-dependent template name
                            
                                Boost.python vs Cython for C++/python interface
                            
                                Why and how do extra parentheses change the type of an expression in C++ (C++11)?
                            
                                Programmatically selecting integrated graphics in nVidia Optimus
                            
                                Setting the application icon with CMake
                            
                                C++11: Are there reasons why some Regular Types should not have `std::hash` specialised?
                            
                                Using Swift with Qt
                            
                                "Polynomial too big for FFT" error on NTL
                            
                                Segmentation fault itself is hanging
                            
                                Should allocator construct() default initialize instead of value initializing?
                            
                                Pool allocators with virtual destructors
                            
                                Unaligned load versus unaligned store
                            
                                Non-dependent name lookup and lambda
                            
                                getting iaxclient to send audio to/get audio from buffer instead of audio-device
                            
                                What are the differences between the various boost ublas sparse vectors?
                            
                                Is it possible to stop a single thread during debug in Linux?
                            
                                H.264 muxed to MP4 using libavformat not playing back
                            
                                GCC refuses to emit long calls for operator new/delete on PowerPC
                            
                                How to tell advance() to use += operator on input iterators if they aren't random access

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

'Stable' multi-dimensional scaling algorithm

Tags:

c++

algorithm

graph-algorithm

Roel

People also ask

1 Answers

Andy Jones

Recent Activity

Donate For Us