Logo Questions Linux Laravel Mysql Ubuntu Git Menu
 

What would cause UDP packets to be dropped when being sent to localhost?

I'm sending very large (64000 bytes) datagrams. I realize that the MTU is much smaller than 64000 bytes (a typical value is around 1500 bytes, from my reading), but I would suspect that one of two things would happen - either no datagrams would make it through (everything greater than 1500 bytes would get silently dropped or cause an error/exception to be thrown) or the 64000 byte datagrams would get chunked into about 43 1500 byte messages and transmitted transparently.

Over a long run (2000+ 64000 byte datagrams), about 1% (which seems abnormally high for even a LAN) of the datagrams get dropped. I might expect this over a network, where datagrams can arrive out of order, get dropped, filtered, and so on. However, I did not expect this when running on localhost.

What is causing the inability to send/receive data locally? I realize UDP is unreliable, but I didn't expect it to be so unreliable on localhost. I'm wondering if it's just a timing issue since both the sending and receiving components are on the same machine.

For completeness, I've included the code to send/receive datagrams.

Sending:

DatagramSocket socket = new DatagramSocket(senderPort);  int valueToSend = 0;  while (valueToSend < valuesToSend || valuesToSend == -1) {     byte[] intBytes = intToBytes(valueToSend);      byte[] buffer = new byte[bufferSize - 4];       //this makes sure that the data is put into an array of the size we want to send     byte[] bytesToSend = concatAll(intBytes, buffer);      System.out.println("Sending " + valueToSend + " as " + bytesToSend.length + " bytes");      DatagramPacket packet = new DatagramPacket(bytesToSend,                         bufferSize, receiverAddress, receiverPort);      socket.send(packet);      Thread.sleep(delay);      valueToSend++; } 

Receiving:

DatagramSocket socket = new DatagramSocket(receiverPort);  while (true) {     DatagramPacket packet = new DatagramPacket(             new byte[bufferSize], bufferSize);      System.out.println("Waiting for datagram...");     socket.receive(packet);      int receivedValue = bytesToInt(packet.getData(), 0);      System.out.println("Received: " + receivedValue             + ". Expected: " + expectedValue);      if (receivedValue == expectedValue) {         receivedDatagrams++;         totalDatagrams++;     }     else {         droppedDatagrams++;         totalDatagrams++;     }      expectedValue = receivedValue + 1;     System.out.println("Expected Datagrams: " + totalDatagrams);     System.out.println("Received Datagrams: " + receivedDatagrams);     System.out.println("Dropped Datagrams: " + droppedDatagrams);     System.out.println("Received: "             + ((double) receivedDatagrams / totalDatagrams));     System.out.println("Dropped: "             + ((double) droppedDatagrams / totalDatagrams));     System.out.println(); } 
like image 444
Thomas Owens Avatar asked Nov 01 '11 15:11

Thomas Owens


People also ask

Why do UDP packets get dropped?

The UDP packet loss is especially affected by TCP traffic and its flow control mechanism. This is because TCP flow control continues to increase its window size until packet loss occurs if the advertised window size is large enough.

What causes UDP errors?

Packet receive errors doesn't include the problems occured on network card level, it shows only received packets on udp protocol stack. Main reasons for packet receive errors: udp packet header corruption or checksum problems. packet receive buffer problems in application or kernel side.

How often are UDP packets dropped?

Packet loss happens for multiple reasons. Primarily it is caused by errors on individual links and network congestion. Packet loss due to errors on the link is very low, when links are working properly. Less than 0.01% is not unusual.

Will UDP resend a dropped packet?

This is where the User Datagram Protocol (UDP) comes in. The UDP does not require a connection, and it will not resend data packets if there are errors. UDP's data transmission time is much faster because UDP is connectionless, making it better suited for industries like VoIP and video games.


2 Answers

Overview

What is causing the inability to send/receive data locally?

Mostly buffer space. Imagine sending a constant 10MB/second while only able to consume 5MB/second. The operating system and network stack can't keep up, so packets are dropped. (This differs from TCP, which provides flow control and re-transmission to handle such a situation.)

Even when data is consumed without overflowing buffers, there might be small time slices where data cannot be consumed, so the system will drop packets. (Such as during garbage collection, or when the OS task switches to a higher-priority process momentarily, and so forth.)

This applies to all devices in the network stack. A non-local network, an Ethernet switch, router, hub, and other hardware will also drop packets when queues are full. Sending a 10MB/s stream through a 100MB/s Ethernet switch while someone else tries to cram 100MB/s through the same physical line will cause dropped packets.

Increase both the socket buffers size and operating system's socket buffer size.

Linux

The default socket buffer size is typically 128k or less, which leaves very little room for pausing the data processing.

sysctl

Use sysctl to increase the transmit (write memory [wmem]) and receive (read memory [rmem]) buffers:

  • net.core.wmem_max
  • net.core.wmem_default
  • net.core.rmem_max
  • net.core.rmem_default

For example, to bump the value to 8 megabytes:

sysctl -w net.core.rmem_max=8388608 

To make the setting persist, update /etc/sysctl.conf as well, such as:

net.core.rmem_max=8388608 

An in-depth article on tuning the network stack dives into far more details, touching on multiple levels of how packets are received and processed in Linux from the kernel's network driver through ring buffers all the way to C's recv call. The article describes additional settings and files to monitor when diagnosing network issues. (See below.)

Before making any of the following tweaks, be sure to understand how they affect the network stack. There is a real possibility of rendering your network unusable. Choose numbers appropriate for your system, network configuration, and expected traffic load:

  • net.core.rmem_max=8388608
  • net.core.rmem_default=8388608
  • net.core.wmem_max=8388608
  • net.core.wmem_default=8388608
  • net.ipv4.udp_mem='262144 327680 434274'
  • net.ipv4.udp_rmem_min=16384
  • net.ipv4.udp_wmem_min=16384
  • net.core.netdev_budget=600
  • net.ipv4.ip_early_demux=0
  • net.core.netdev_max_backlog=3000

ethtool

Additionally, ethtool is useful to query or change network settings. For example, if ${DEVICE} is eth0 (use ip address or ipconfig to determine your network device name), then it may be possible to increase the RX and TX buffers using:

  • ethtool -G ${DEVICE} rx 4096
  • ethtool -G ${DEVICE} tx 4096

iptables

By default, iptables will log information about packets, which consumes CPU time, albeit minimal. For example, you can disable logging of UDP packets on port 6004 using:

iptables -t raw -I PREROUTING 1 -p udp --dport 6004 -j NOTRACK iptables -I INPUT 1 -p udp --dport 6004 -j ACCEPT 

Your particular port and protocol will vary.

Monitoring

Several files contain information about what is happening to network packets at various stages of sending and receiving. In the following list ${IRQ} is the interrupt request number and ${DEVICE} is the network device:

  • /proc/cpuinfo - shows number of CPUs available (helpful for IRQ-balancing)
  • /proc/irq/${IRQ}/smp-affinity - shows IRQ affinity
  • /proc/net/dev - contains general packet statistics
  • /sys/class/net/${DEVICE}/queues/QUEUE/rps_cpus - relates to Receive Packet Steering (RPS)
  • /proc/softirqs - used for ntuple filtering
  • /proc/net/softnet_stat - for packet statistics, such as drops, time squeezes, CPU collisions, etc.
  • /proc/sys/net/core/flow_limit_cpu_bitmap - shows packet flow (can help diagnose drops between large and small flows)
  • /proc/net/snmp
  • /proc/net/udp

Summary

Buffer space is the most likely culprit for dropped packets. There are numerous buffers strewn throughout the network stack, each having its own impact on sending and receiving packets. Network drivers, operating systems, kernel settings, and other factors can affect packet drops. There is no silver bullet.

Further Reading

  • https://github.com/leandromoreira/linux-network-performance-parameters
  • http://man7.org/linux/man-pages/man7/udp.7.html
  • http://www.ethernetresearch.com/geekzone/linux-networking-commands-to-debug-ipudptcp-packet-loss/
like image 133
nos Avatar answered Oct 09 '22 03:10

nos


UDP pkts scheduling may be handled by multiple threads on OS level. That would explain why you receive them out of order even on 127.0.0.1.

like image 45
Niko Avatar answered Oct 09 '22 03:10

Niko