Prev: [RFC PATCH v2 3/5] netdev: add tracepoints to netdev layer
Next: [PATCH] memory hotplug disable boot option
From: Koki Sanagi on 24 Jun 2010 21:00 CHANGE-LOG since v1: 1) fix to enable "perf trace record netdev-times" run with a script. 2) add some options to "perf trace report netdev-times". A detail is in PATCH 5/5. These patch-set adds tracepoints to show us a process of packets. Using these tracepoints and existing points, we can get the time when packet passes through some points in transmit or receive sequence. For example, this is an output of perf script which is attached by patch 5/5. 79074.756672832sec cpu=1 irq_entry(+0.000000msec,irq=77:eth3) |------------softirq_raise(+0.001277msec) irq_exit (+0.002278msec) | | softirq_entry(+0.003562msec) | |---netif_receive_skb(+0.006279msec,len=100) | | | skb_copy_datagram_iovec(+0.038778msec, 2285:sshd) | napi_poll_exit(+0.017160msec, eth3) | softirq_exit(+0.018248msec) The above is a receive side. Like this, it can show receive sequence from interrupt(irq_entry) to application(skb_copy_datagram_iovec). There are 8 tracepoints in this side. All events except for skb_copy_datagram_iovec can be associated with each other by CPU number. skb_copy_datagram_iovec can be associated with netif_receive_skb by skbaddr. This script shows one NET_RX softirq and events related to it. All relative time bases on first irq_entry which raise NET_RX softirq. dev len dev_queue_xmit|----------|dev_hard_start_xmit|-----|free_skb | | | eth3 114 79044.417123332sec 0.005242msec 0.103843msec eth3 114 79044.580090422sec 0.002306msec 0.103632msec eth3 114 79044.719078251sec 0.002288msec 0.104093msec The above is a transmit side. There are three tracepoints in this side. Point1 is before putting a packet to Qdisc. point2 is after ndo_start_xmit in dev_hard_start_xmit. It indicates finishing putting a packet to driver. point3 is in consume_skb and dev_kfree_skb_irq. It indicates freeing a transmitted packet. Values of this script are, from left, device name, length of a packet, a time of point1, an interval time between point1 and point2 and an interval time between point2 and point3. These times are useful to analyze a performance or to detect a point where packet delays. For example, - NET_RX softirq calling is late. - Application is late to take a packet. - It takes much time to put a transmitting packet to driver (It may be caused by packed queue) And also, these tracepoint help us to investigate a network driver's trouble from memory dump because ftrace records it to memory. And ftrace is so light even if always trace on. So, in a case investigating a problem which doesn't reproduce, it is useful. Thanks, Koki Sanagi. -- To unsubscribe from this list: send the line "unsubscribe linux-kernel" in the body of a message to majordomo(a)vger.kernel.org More majordomo info at http://vger.kernel.org/majordomo-info.html Please read the FAQ at http://www.tux.org/lkml/ |