[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: [Xen-devel] RING_HAS_UNCONSUMED_REQUESTS oddness

To: Ian Campbell <Ian.Campbell@xxxxxxxxxx>
From: Zoltan Kiss <zoltan.kiss@xxxxxxxxxx>
Date: Wed, 12 Mar 2014 17:14:13 +0000
Cc: "xen-devel@xxxxxxxxxxxxxxxxxxxx" <xen-devel@xxxxxxxxxxxxxxxxxxxx>, Wei Liu <wei.liu2@xxxxxxxxxx>, Tim Deegan <tim@xxxxxxx>
Delivery-date: Wed, 12 Mar 2014 17:14:23 +0000
List-id: Xen developer discussion <xen-devel.lists.xen.org>

On 12/03/14 15:37, Ian Campbell wrote:

On Wed, 2014-03-12 at 15:14 +0000, Zoltan Kiss wrote:

On 12/03/14 14:30, Ian Campbell wrote:

On Wed, 2014-03-12 at 14:27 +0000, Zoltan Kiss wrote:

On 12/03/14 10:28, Ian Campbell wrote:

On Tue, 2014-03-11 at 23:24 +0000, Zoltan Kiss wrote:

On 11/03/14 15:44, Ian Campbell wrote:

Is it the case that this macro considers a request to be unconsumed if
the *response* to a request is outstanding as well as if the request
itself is still on the ring?

I don't think that would make sense. I think everywhere where this macro
is called the caller is not interested in pending request (pending means
consumed but not responded)


It might be interested in such pending requests in some of the
pathological cases I allude to in the next paragraph though?

For example if the ring has unconsumed requests but there are no slots
free for a response, it would be better to treat it as no unconsumed
requests until space opens up for a response, otherwise something else
just has to abort the processing of the request when it notices the lack
of space.

(I'm totally speculating here BTW, I don't have any concrete idea why
things are done this way...)

I wonder if this apparently weird construction is due to pathological
cases when one or the other end is not picking up requests/responses?
i.e. trying to avoid deadlocking the ring or generating an interrupt
storm when the ring it is full of one or the other or something along
those lines?


Also, let me quote again my example about when rsp makes sense:

"To clarify what does this do, let me show an example:
req_prod = 253
req_cons = 256
rsp_prod_pvt = 0


I think to make sense of this I need to see the sequence of reads/writes
from both parties in a sensible ordering which would result in reads
showing the above. i.e. a demonstration of the race not just an
assertion that if the values are read as is things makes sense.


Let me extend it:

- callback reads req_prod = 253


callback == backend? Which context is this code running in? Which part
of the system is the callback logically part of?

Yes, it is part of the backend, the function which handles when we canrelease a slot back. With grant copy we don't have such thing, but withmapping xenvif_zerocopy_callback does this (or in classic kernel, it hada different name, but we called it page destructor). It can run from anycontext, it depends on who calls kfree_skb.

- frontend writes req_prod, now its 256
- backend picks it up, and consumes those slots, req_cons become 256


"it"? Do you mean req_prod? Please be precise.

Yes, I meant req_prod. And backend means NAPI instance here.

- callback reads req_cons = 256


But the backend has also seen req_prod at 256 at this point, hasn't it?
You said so above but said "it" so I'm not sure. If the callback is part
of the backend then why hasn't it also seen this?

Yes, the NAPI instance have seen it, but the callback has not. It werecalled from another context.

- req is UINT_MAX-3 therefore, but actually there isn't any request to
consume, it should be 0


Only if something is ignoring the fact that it has seen req_prod == 256.

If callback is some separate entity to backend within dom0 then what you
have here is an internal inconsistency in dom0 AFAICT. IOW it seems like
you are missing some synchronisation and/or have two different entities
acting as backend.

The callback only needs to know whether it should poke the NAPI instanceor not. There is this special case, if there are still a few unconsumedrequest, but the ring is nearly full of pending requests andxenvif_tx_pending_slots_available says NAPI should bail out, we have toschedule it back once we have enough free pending slots again.As I said in an another mail of this thread, this poking happens in thecallback, but actually it should be moved to the dealloc thread. Howeverthinking further, this whole xenvif_tx_pending_slots_available stuffseems to be unnecessary to me:It supposed to check if we have enough slot in the pending ring for themaximum number of possible slots, otherwise the backend bails out. Itdoes so because if the backend start to consume the requests from theshared ring but runs out free slots in the pending ring, we are introuble. But the pending ring supposed to have the same amount of slotsas the shared one. And a consumed but not responded slot from the sharedring means a used slot in the pending ring. Therefore the frontend won'tbe able to push more than (MAX_PENDING_REQS - nr_pending_reqs(vif))requests to the ring anyway. At least in practice, as MAX_PENDING_REQS =RING_SIZE(...). If we could bind the two to each other directly, we canget rid of this unnecessary checking, and whoever release the usedpending slots should not poke the NAPI instance, because the frontendwill call an interrupt if it sends a new packet anyway.

- callback reads rsp_prod_pvt = 0, because backend haven't responded to
any requests
- rsp is therefore 256 - (256 -0) = 0
- the macro returns rsp, as it is smaller. And that's good, because
despite the macro failed to determine the number of unconsumed requests,
at least it detected that the ring is full with consumed but not replied
requests, so there shouldn't be any unconsumed req

And I call this best effort because if rsp_prod_pvt is e.g. 10, rsp will
be then 10 as well, we return it, and the caller thinks there are
unconsumed requests, despite there isn't any.

Zoli



_______________________________________________
Xen-devel mailing list
Xen-devel@xxxxxxxxxxxxx
http://lists.xen.org/xen-devel

Follow-Ups:
- Re: [Xen-devel] RING_HAS_UNCONSUMED_REQUESTS oddness
  - From: Ian Campbell

References:
- [Xen-devel] RING_HAS_UNCONSUMED_REQUESTS oddness
  - From: Zoltan Kiss
- Re: [Xen-devel] RING_HAS_UNCONSUMED_REQUESTS oddness
  - From: Ian Campbell
- Re: [Xen-devel] RING_HAS_UNCONSUMED_REQUESTS oddness
  - From: Zoltan Kiss
- Re: [Xen-devel] RING_HAS_UNCONSUMED_REQUESTS oddness
  - From: Ian Campbell
- Re: [Xen-devel] RING_HAS_UNCONSUMED_REQUESTS oddness
  - From: Zoltan Kiss
- Re: [Xen-devel] RING_HAS_UNCONSUMED_REQUESTS oddness
  - From: Ian Campbell
- Re: [Xen-devel] RING_HAS_UNCONSUMED_REQUESTS oddness
  - From: Zoltan Kiss
- Re: [Xen-devel] RING_HAS_UNCONSUMED_REQUESTS oddness
  - From: Ian Campbell

Prev by Date: Re: [Xen-devel] Dealing with non-existent BDF devices in VT-d and in the hardware.
Next by Date: Re: [Xen-devel] [RFC PATCH v2 1/1] Add pci_hole_min_size
Previous by thread: Re: [Xen-devel] RING_HAS_UNCONSUMED_REQUESTS oddness
Next by thread: Re: [Xen-devel] RING_HAS_UNCONSUMED_REQUESTS oddness
Index(es):
- Date
- Thread

Lists.xenproject.org is hosted with RackSpace, monitoring our
servers 24x7x365 and backed by RackSpace's Fanatical Support®.