[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: [PATCH V3 15/23] xen/arm: Stick around in leave_hypervisor_to_guest until I/O has completed

To: Volodymyr Babchuk <Volodymyr_Babchuk@xxxxxxxx>, Oleksandr Tyshchenko <olekstysh@xxxxxxxxx>
From: Julien Grall <julien@xxxxxxx>
Date: Tue, 1 Dec 2020 12:46:56 +0000
Cc: "xen-devel@xxxxxxxxxxxxxxxxxxxx" <xen-devel@xxxxxxxxxxxxxxxxxxxx>, Oleksandr Tyshchenko <Oleksandr_Tyshchenko@xxxxxxxx>, Stefano Stabellini <sstabellini@xxxxxxxxxx>, Julien Grall <julien.grall@xxxxxxx>
Delivery-date: Tue, 01 Dec 2020 12:47:15 +0000
List-id: Xen developer discussion <xen-devel.lists.xenproject.org>

Hi Volodymyr,

On 30/11/2020 20:51, Volodymyr Babchuk wrote:

Oleksandr Tyshchenko writes:

From: Oleksandr Tyshchenko <oleksandr_tyshchenko@xxxxxxxx>

This patch adds proper handling of return value of
vcpu_ioreq_handle_completion() which involves using a loop
in leave_hypervisor_to_guest().

The reason to use an unbounded loop here is the fact that vCPU
shouldn't continue until an I/O has completed. In Xen case, if an I/O
never completes then it most likely means that something went horribly
wrong with the Device Emulator. And it is most likely not safe to
continue. So letting the vCPU to spin forever if I/O never completes
is a safer action than letting it continue and leaving the guest in
unclear state and is the best what we can do for now.

This wouldn't be an issue for Xen as do_softirq() would be called at
every loop. In case of failure, the guest will crash and the vCPU
will be unscheduled.


Why you don't block vcpu there and unblock it when response is ready?

The vCPU will already block while waiting for the event channel. See thecall wait_for_event_channel() in the ioreq code. However, you can stillreceive spurious unblock (e.g. the event channel notificaiton isreceived before the I/O is unhandled). So you have to loop around andcheck if there are more work to do.

If
I got it right, "client" vcpu will spin in the loop, eating own
scheduling budget with no useful work done.


You can't really do much about that if you have a rogue/buggy device model.

In the worst case, it will
prevent "server" vcpu from running.

How so? Xen will raise the schedule softirq if the I/O is handled. Youwould have a pretty buggy system if your "client" vCPU is considered tobe a much higher priority than your "server" vCPU.


Cheers,

--
Julien Grall

Prev by Date: Re: [PATCH v4 01/23] xl / libxl: s/pcidev/pci and remove DEFINE_DEVICE_TYPE_STRUCT_X
Next by Date: Re: [PATCH v4 02/23] libxl: make libxl__device_list() work correctly for LIBXL__DEVICE_KIND_PCI...
Previous by thread: Re: [PATCH v4 01/23] xl / libxl: s/pcidev/pci and remove DEFINE_DEVICE_TYPE_STRUCT_X
Next by thread: Re: [PATCH V3 15/23] xen/arm: Stick around in leave_hypervisor_to_guest until I/O has completed
Index(es):
- Date
- Thread

Lists.xenproject.org is hosted with RackSpace, monitoring our
servers 24x7x365 and backed by RackSpace's Fanatical Support®.