A* Algorithm for very large graphs, any thoughts on caching shortcuts?

Tags:

I'm writing a courier/logistics simulation on OpenStreetMap maps and have realised that the basic A* algorithm as pictured below is not going to be fast enough for large maps (like Greater London).

The green nodes correspond to ones that were put in the open set/priority queue and due to the huge number (the whole map is something like 1-2 million), it takes 5 seconds or so to find the route pictured. Unfortunately 100ms per route is about my absolute limit.

Currently, the nodes are stored in both an adjacency list and also a spatial 100x100 2D array.

I'm looking for methods where I can trade off preprocessing time, space and if needed optimality of the route, for faster queries. The straight-line Haversine formula for the heuristic cost is the most expensive function according to the profiler - I have optimised my basic A* as much as I can.

For example, I was thinking if I chose an arbitrary node X from each quadrant of the 2D array and run A* between each, I can store the routes to disk for subsequent simulations. When querying, I can run A* search only in the quadrants, to get between the precomputed route and the X.

Is there a more refined version of what I've described above or perhaps a different method I should pursue. Many thanks!

For the record, here are some benchmark results for arbitrarily weighting the heuristic cost and computing the path between 10 pairs of randomly picked nodes:

Weight // AvgDist% // Time (ms) 1       1       1461.2 1.05    1       1327.2 1.1     1       900.7 1.2     1.019658848     196.4 1.3     1.027619169     53.6 1.4     1.044714394     33.6 1.5     1.063963413     25.5 1.6     1.071694171     24.1 1.7     1.084093229     24.3 1.8     1.092208509     22 1.9     1.109188175     22.5 2       1.122856792     18.2 2.2     1.131574742     16.9 2.4     1.139104895     15.4 2.6     1.140021962     16 2.8     1.14088128      15.5 3       1.156303676     16 4       1.20256964      13 5       1.19610861      12.9

Surprisingly increasing the coefficient to 1.1 almost halved the execution time whilst keeping the same route.

244

asked Apr 15 '15 17:04

drspa44

2 Answers

You should be able to make it much faster by trading off optimality. See Admissibility and optimality on wikipedia.

The idea is to use an epsilon value which will lead to a solution no worse than 1 + epsilon times the optimal path, but which will cause fewer nodes to be considered by the algorithm. Note that this does not mean that the returned solution will always be 1 + epsilon times the optimal path. This is just the worst case. I don't know exactly how it would behave in practice for your problem, but I think it is worth exploring.

You are given a number of algorithms that rely on this idea on wikipedia. I believe this is your best bet to improve the algorithm and that it has the potential to run in your time limit while still returning good paths.

Since your algorithm does deal with millions of nodes in 5 seconds, I assume you also use binary heaps for the implementation, correct? If you implemented them manually, make sure they are implemented as simple arrays and that they are binary heaps.

answered Sep 21 '22 14:09

IVlad

There are specialist algorithms for this problem that do a lot of pre-computation. From memory, the pre-computation adds information to the graph that A* uses to produce a much more accurate heuristic than straight line distance. Wikipedia gives the names of a number of methods at http://en.wikipedia.org/wiki/Shortest_path_problem#Road_networks and says that Hub Labelling is the leader. A quick search on this turns up http://research.microsoft.com/pubs/142356/HL-TR.pdf. An older one, using A*, is at http://research.microsoft.com/pubs/64505/goldberg-sp-wea07.pdf.

Do you really need to use Haversine? To cover London, I would have thought you could have assumed a flat earth and used Pythagoras, or stored the length of each link in the graph.

answered Sep 21 '22 14:09

mcdowella

Related questions
                            
                                Is a list (potentially) divisible by another?
                            
                                SVM - hard or soft margins?
                            
                                Tersest way to create an array of integers from 1..20 in JavaScript
                            
                                How to estimate download time remaining (accurately)?
                            
                                Difference between hamiltonian path and euler path
                            
                                What is the fastest integer factorization algorithm?
                            
                                Where can I learn more about the Google search "did you mean" algorithm? [duplicate]
                            
                                Given Prime Number N, Compute the Next Prime?
                            
                                Reverse the ordering of words in a string
                            
                                Javascript algorithm to find elements in array that are not in another array
                            
                                B-tree faster than AVL or RedBlack-Tree? [closed]
                            
                                Is there an algorithm for color mixing that works like mixing real colors?
                            
                                Fastest way to get the integer part of sqrt(n)?
                            
                                Algorithm for "nice" grid line intervals on a graph
                            
                                Graph auto-layout algorithm
                            
                                Why increase pointer by two while finding loop in linked list, why not 3,4,5?
                            
                                An inverse Fibonacci algorithm?
                            
                                Millions of 3D points: How to find the 10 of them closest to a given point?
                            
                                Performance issue: Java vs C++
                            
                                best way to pick a random subset from a collection?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

A* Algorithm for very large graphs, any thoughts on caching shortcuts?

Tags:

algorithm

graph-algorithm

a-star

shortest-path

openstreetmap

drspa44

People also ask

2 Answers

IVlad

mcdowella

Recent Activity

Donate For Us