Alternatives to the Dynamic Time Warping (DTW) method

Tags:

time-series

I am doing some research into methods of comparing time series data. One of the algorithms that I have found being used for matching this type of data is the DTW (Dynamic Time Warping) algorithm.

The data I have, resemble the following structure (this can be one path):

Path    Event      Time            Location (x,y)
  1       1       2:30:02             1,5
  1       2       2:30:04             2,7
  1       3       2:30:06             4,4
...
...

Now, I was wondering whether there are other algorithms that would be suitable to find the closest match for the given path.

593

asked Jan 13 '12 00:01

2 Answers

The keyword you are looking for is "(dis-)similarity measures".

Euclidean Distance (ED) as referred to by Adam Mihalcin (first answer) is easily computable and somehow reflects the natural understanding of the word distance in natural language. Yet when comparing two time series, DTW is to be preffered - especially when applied to real world data.

1) ED can only be applied to series of equal length. Therefore when points are missing, ED simply is not computable (unless also cutting the other sequence, thus loosing more information).

2) ED does not allow time-shifting or time-warping opposed to all algorithms which are based on DTW.

Thus ED is not a real alternative to DTW, because the requirements and restrictions are much higher. But to answer your question, I want to recommend to you this lecture:

Time-series clustering – A decade review Saeed Aghabozorgi, Ali Seyed Shirkhorshidi, Teh Ying Wah http://www.sciencedirect.com/science/article/pii/S0306437915000733

This paper gives an overview about (dis-)similarity measures used in time series clustering. Here a little excerpt to motivate your actually reading the paper:

enter image description here

139

answered Nov 15 '22 00:11

Nikolas Rieble

If two paths are the same length, say n, then they are really points in an 2n-dimensional space. The first location determines the first two dimensions, the second location determines the next two dimensions, and so on. For example, if we just take the three points in your example, the path can be represented as the single 6-dimensional point (1, 5, 2, 7, 4, 4). If we want to compare this to another three-point path, we can compute either the Euclidean distance (square root of the sum of squares of per-dimension distances between the two points) or the Manhattan distance (sum of the per-dimension differences).

For example, the boring path that stays at (0, 0) for all three times becomes the 6-dimensional point (0, 0, 0, 0, 0, 0). Then the Euclidean distance between this point and your example path is sqrt((1-0)^2 + (5-0)^2 + (2-0)^2 + (7-0)^2 + (4-0)^2 + (4-0)^2) = sqrt(111) = 10.54. The Manhattan distance is abs(1-0) + abs(5-0) + abs(2-0) + abs(7-0) + abs(4-0) + abs(4-0) = 23. This kind of a difference between the metrics is not unusual, since the Manhattan distance is provably at least as great as the Euclidean distance.

Of course one problem with this approach is that not all paths will be of the same length. However, you can easily cut off the longer path to the same length as the shorter path, or consider the shorter of the two paths to stay at the same location or moving in the same direction after measurements end, until both paths are the same length. Either approach will introduce some inaccuracies, but no matter what you do you have to deal with the fact that you are missing data on the short path and have to make up for it somehow.

EDIT:

Assuming that path1 and path2 are both List<Tuple<int, int>> objects containing the points, we can cut off the longer list to match the shorter list as:

// Enumerable.Zip stops when it finishes one of the sequences
List<Tuple<int, int, int, int>> matchingPoints = Enumerable.Zip(path1, path2,
    (tupl1, tupl2) =>
        Tuple.Create(tupl1.Item1, tupl1.Item2, tupl2.Item1, tupl2.Item2));

Then, you can use the following code to find the Manhattan distance:

int manhattanDistance = matchingPoints
    .Sum(tupl => Math.Abs(tupl.Item1 - tupl.Item3)
               + Math.Abs(tupl.Item2 - tupl.Item4));

With the same assumptions as for the Manhattan distance, we can generate the Euclidean distance as:

int euclideanDistanceSquared = matchingPoints
    .Sum(tupl => Math.Pow(tupl.Item1 - tupl.Item3, 2)
               + Math.Pow(tupl.Item2 - tupl.Item4, 2));
double euclideanDistance = Math.Sqrt(euclideanDistanceSquared);

answered Nov 15 '22 00:11

Adam Mihalcin

Related questions
                            
                                Linq query with Contains only works with is IQueryable is in external variable
                            
                                protobuf-net message serialized size property
                            
                                Opensource projects to learn asp.net mvc 3
                            
                                Notification Window - Preventing the window from ever getting focus
                            
                                Download Windows Updates Using C#
                            
                                DDD / DI (Unity) / .NET / Composition Root - Domain Services
                            
                                Importance of the key size in the Rfc2898DeriveBytes (PBKDF2) implementation
                            
                                C# Video Player [closed]
                            
                                Plugin system with MVC3, Razor and C#
                            
                                C# / C++ in same solution - DllImport not finding DLL
                            
                                Exporting a ListView to Excel format
                            
                                LINQ lambda - convert int to string
                            
                                GC of delegates, what am I missing? (my delegate is not collected)
                            
                                Why is this double-checked locking correct? (.NET)
                            
                                OpenXml: What is the difference between SdtBlock and SdtCell?
                            
                                Client-side validation of input type date does not work
                            
                                How can I avoid command clutter in the ViewModel?
                            
                                Erroneous email receiver display when using German umlauts and a comma in name
                            
                                Get current ClickOnce's application publisher name?
                            
                                How the CLR locates pdb symbol files

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With