How do you measure latency in low-latency environments?

Tags:

Here's the setup... Your system is receiving a stream of data that contains discrete messages (usually between 32-128 bytes per message). As part of your processing pipeline, each message passes through two physically separate applications which exchange the data using a low-latency approach (such as messaging over UDP) or RDMA and finally to a client via the same mechanism.

Assuming you can inject yourself at any level, including wire protocol analysis, what tools and/or techniques would you use to measure the latency of your system. As part of this, I'm assuming that every message that is delivered to the system results in a corresponding (though not equivalent) message being pushed through the system and delivered to the client.

The only tool that I've seen on the market like this is TS-Associates TipOff. I'm sure that with the right access you could probably measure the same information using a wire analysis tool (ala wireshark) and the right dissectors, but is this the right approach or are there any commodity solutions that I can use?

481

asked Aug 05 '09 21:08

Ajaxx

2 Answers

Your last paragraph is the typical way it needs to be done. The usual suspects in this field (at least as far as I know for market data (wall street) latency) are:

TSA (TS Associates)
Correlix
Corvil
Napatech (hardware capture devices)
Endace (hardware capture devices)

There was another badly run company that recently burned through their VC money (4 million?).

For data that is processed (let's say at a direct exchange feed or RMDS or other server that changes the protocol) into different formats you need to be able to parse the payloads to correlate the messages. It can be challenging since sometimes data vendors do not expose the message definitions.

I think there are hardware devices that will inject payload information with timestamps in it so the client can see these. Of course, as another poster pointed out - the question of time is very important. All the devices and clients have to have the same reference point for time. It has to be accurate...

The last time I spoke with TSA, an installation with 4 observation points was on the order of $150k. I suspect that the others listed above are similar in price.

The hardware cards listed above start around $2k (for a bare bones card) and go up (significantly) from there.

To do it in software you'd need to have clients using pcap (or something similar) and look at the payloads and try to match them up. In some cases it is difficult to get this to be deterministic - especially at the start of a "session" or if messages are missing from one pipe. Usually after some threshold if you don't match something, you just drop it.

EDIT: DISCLAIMER: I am also part of the venture now and should disclose that.

135

answered Nov 10 '22 00:11

Tim

A recent paper might be of some use (and would also be much cheaper than hardware-based solutions). There are also ways of fairly accurately accounting for clock skew; the last time I seriously looked into one-way latency measurement research (a couple years ago), the most accurate technique was a linear programming algorithm by Sue Moon (with reference code conveniently available here), but without using some rather modern linear programming techniques, it's fairly impractical to do as an online algorithm; it's best just to record timestamps without doing any calculations periodically throughout the day, and then run the LP algorithm on the accumulated data afterwards. There were a few other techniques that were quick enough to be done on-line (including the seminal paper by Vern Paxson), but they were all much less accurate.

answered Nov 10 '22 00:11

strangelydim

Related questions
                            
                                Theoretical minimum round-trip-time for a packet to travel over/under the North Atlantic Sea?
                            
                                How to calculate packet time from latency and bandwidth
                            
                                latency when pressing headset button in iphone
                            
                                How do I get tickless kernel to work? nohz_full, rcu_nocbs, isolcpus what else?
                            
                                Ruby GC execution exceeding ~250-320ms per request
                            
                                How can I reduce Google App Engine datastore latency?
                            
                                What considerations go into predicting latency for operations on modern superscalar processors and how can I calculate them by hand?
                            
                                How to speed up website loading for opposite side of planet
                            
                                AWS latency between Zones within a same Region
                            
                                asynchronous IO io_submit latency in Ubuntu Linux
                            
                                C#, EF & LINQ : slow at inserting large (7Mb) records into SQL Server
                            
                                Check latency if server denies ping requests
                            
                                Techniques to Reduce CPU to GPU Data Transfer Latency
                            
                                Best OS to deploy a low latency Java application?
                            
                                What is the difference between latency and response time?
                            
                                Latencies issues which G1GC
                            
                                How to determine latency of a remote server through the browser
                            
                                Latency of requesting a object from S3 to EC2 [closed]
                            
                                how can I simulate network latency on my developer machine?
                            
                                Loading and displaying large text files

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

How do you measure latency in low-latency environments?

Tags:

measurement

latency

Ajaxx

People also ask

2 Answers

Tim

strangelydim

Recent Activity

Donate For Us