block I/O tracing #196

cvonelm · 2021-11-04T15:34:41Z

I'm separating this from #194 so we can have a nice high level discussion there and get into the nitty gritty details of the implementation here.

event reading

by design there is one perf event stream per cpu that we read separately. This is problematic in this case, because one thread on one cpu can issue a block I/O request and a completely different thread (usually a kernel thread) that might be on a completely different cpu will receive the completion event.

A small BPF python hack shows, that separate issue/complete CPUs isn't an edge case : The majority of events have different CPUs where they were issued and completed, meaning that just discarding those events isn't an option.

So instead we probably need to cache the issue and complete events at measuring time and try to construct a coherent view based on the local event observations later

cvonelm · 2021-11-05T08:17:24Z

I/O Handle in Otf2xx

in OTF2 there is the concept of the I/O handle, which you need to create before you can write I/O operations and destroy afterwards, corresponding to the classical concept of opening and closing a file or a network connection. Block I/O however is "stateless" so there is no direct equivalent. Do we assign one handle to every block device, or one handle to every block?

bmario · 2021-11-05T12:33:51Z

For the record:

Block I/O will only be available in system monitoring mode
We try to implement Block I/O using otf2 IO records
- one IoHandle per block device
- The request issued and request completed tracepoint will be mapped to IoOperationBegin/IoOperationIssued and IoOperationComplete

cvonelm · 2021-12-02T15:26:38Z

To measure latency you have to match up the queue insert with queue remove. However the events do not have a simple id that allows us to match them up. I've now tested matching the events up in different ways:

replay the kernel FIFO based on the events we have

This assumes that the queue for every block device is a FIFO. To match the inserts to complete, simply replay the behaviour of the kernel FIFO in lo2s, based on the events and timestamps we have.

Pro

If the underlying queue is really one FIFO we could replicate the kernel FIFO perfectly and get perfectly correct latency values.

Con

Completely useless if the underlying data structure is not a FIFO

Match based on sector number.

This basically assumes, that the sector that is written or read is unique and thus can be used as a key to match inserts with completes.

Pro

probably the closest thing that we can get to a real unique id with tracepoints

Con

due to caching the same sector shouldnt be read or written overlappingly very often, but it is not a true unique id.

Match based on the address of the `struct *request`

This is what biosnoop does. Use the memory address of the struct *request, which encodes the block I/O request, as a key. This address doesn't change, as it is alive throughout the whole request.

Pro

Pretty much a unique id

Con

requires BPF

Event losses

percentage of events for which no matching insert could be found

replay FIFO = 43% (so sadly no FIFO behaviour here)

match based on sector = 0.3%
match based on struct * request = 0.2%

Latency Histograms

struct *request as a key:

sector as a key:

One could now test further if they match the same insert and complete events, but the latency histogram looks so that we can simply use the sector as the unique id.

cvonelm · 2021-12-17T12:14:27Z

This is how it looks in Vampir for a simple test trace:

cvonelm self-assigned this Nov 4, 2021

cvonelm mentioned this issue Dec 14, 2021

feat(block I/O): Implement Block I/O tracing #197

Merged

bmario closed this as completed in #197 Mar 15, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

block I/O tracing #196

block I/O tracing #196

cvonelm commented Nov 4, 2021

cvonelm commented Nov 5, 2021

bmario commented Nov 5, 2021

cvonelm commented Dec 2, 2021 •

edited

Loading

cvonelm commented Dec 17, 2021

block I/O tracing #196

block I/O tracing #196

Comments

cvonelm commented Nov 4, 2021

event reading

cvonelm commented Nov 5, 2021

I/O Handle in Otf2xx

bmario commented Nov 5, 2021

cvonelm commented Dec 2, 2021 • edited Loading

replay the kernel FIFO based on the events we have

Match based on sector number.

Match based on the address of the struct *request

Event losses

Latency Histograms

cvonelm commented Dec 17, 2021

cvonelm commented Dec 2, 2021 •

edited

Loading

Match based on the address of the `struct *request`