Xen project Mailing List

Re: Design session notes: GPU acceleration in Xen

To: Demi Marie Obenour <demi@xxxxxxxxxxxxxxxxxxxxxx>

Date: Mon, 17 Jun 2024 11:13:38 +0200

Autocrypt: addr=jbeulich@xxxxxxxx; keydata= xsDiBFk3nEQRBADAEaSw6zC/EJkiwGPXbWtPxl2xCdSoeepS07jW8UgcHNurfHvUzogEq5xk hu507c3BarVjyWCJOylMNR98Yd8VqD9UfmX0Hb8/BrA+Hl6/DB/eqGptrf4BSRwcZQM32aZK 7Pj2XbGWIUrZrd70x1eAP9QE3P79Y2oLrsCgbZJfEwCgvz9JjGmQqQkRiTVzlZVCJYcyGGsD /0tbFCzD2h20ahe8rC1gbb3K3qk+LpBtvjBu1RY9drYk0NymiGbJWZgab6t1jM7sk2vuf0Py O9Hf9XBmK0uE9IgMaiCpc32XV9oASz6UJebwkX+zF2jG5I1BfnO9g7KlotcA/v5ClMjgo6Gl MDY4HxoSRu3i1cqqSDtVlt+AOVBJBACrZcnHAUSuCXBPy0jOlBhxPqRWv6ND4c9PH1xjQ3NP nxJuMBS8rnNg22uyfAgmBKNLpLgAGVRMZGaGoJObGf72s6TeIqKJo/LtggAS9qAUiuKVnygo 3wjfkS9A3DRO+SpU7JqWdsveeIQyeyEJ/8PTowmSQLakF+3fote9ybzd880fSmFuIEJldWxp Y2ggPGpiZXVsaWNoQHN1c2UuY29tPsJgBBMRAgAgBQJZN5xEAhsDBgsJCAcDAgQVAggDBBYC AwECHgECF4AACgkQoDSui/t3IH4J+wCfQ5jHdEjCRHj23O/5ttg9r9OIruwAn3103WUITZee e7Sbg12UgcQ5lv7SzsFNBFk3nEQQCACCuTjCjFOUdi5Nm244F+78kLghRcin/awv+IrTcIWF hUpSs1Y91iQQ7KItirz5uwCPlwejSJDQJLIS+QtJHaXDXeV6NI0Uef1hP20+y8qydDiVkv6l IreXjTb7DvksRgJNvCkWtYnlS3mYvQ9NzS9PhyALWbXnH6sIJd2O9lKS1Mrfq+y0IXCP10eS FFGg+Av3IQeFatkJAyju0PPthyTqxSI4lZYuJVPknzgaeuJv/2NccrPvmeDg6Coe7ZIeQ8Yj t0ARxu2xytAkkLCel1Lz1WLmwLstV30g80nkgZf/wr+/BXJW/oIvRlonUkxv+IbBM3dX2OV8 AmRv1ySWPTP7AAMFB/9PQK/VtlNUJvg8GXj9ootzrteGfVZVVT4XBJkfwBcpC/XcPzldjv+3 HYudvpdNK3lLujXeA5fLOH+Z/G9WBc5pFVSMocI71I8bT8lIAzreg0WvkWg5V2WZsUMlnDL9 mpwIGFhlbM3gfDMs7MPMu8YQRFVdUvtSpaAs8OFfGQ0ia3LGZcjA6Ik2+xcqscEJzNH+qh8V m5jjp28yZgaqTaRbg3M/+MTbMpicpZuqF4rnB0AQD12/3BNWDR6bmh+EkYSMcEIpQmBM51qM EKYTQGybRCjpnKHGOxG0rfFY1085mBDZCH5Kx0cl0HVJuQKC+dV2ZY5AqjcKwAxpE75MLFkr wkkEGBECAAkFAlk3nEQCGwwACgkQoDSui/t3IH7nnwCfcJWUDUFKdCsBH/E5d+0ZnMQi+G0A nAuWpQkjM1ASeQwSHEeAWPgskBQL

Cc: Xenia Ragiadakou <burzalodowa@xxxxxxxxx>, Marek Marczykowski-Górecki <marmarek@xxxxxxxxxxxxxxxxxxxxxx>, Ray Huang <ray.huang@xxxxxxx>, Xen developer discussion <xen-devel@xxxxxxxxxxxxxxxxxxxx>, Andrew Cooper <andrew.cooper3@xxxxxxxxxx>, Roger Pau Monné <roger.pau@xxxxxxxxxx>

Delivery-date: Mon, 17 Jun 2024 09:13:47 +0000

List-id: Xen developer discussion <xen-devel.lists.xenproject.org>

On 17.06.2024 02:38, Demi Marie Obenour wrote: > On Fri, Jun 14, 2024 at 10:39:37AM +0200, Roger Pau Monné wrote: >> On Fri, Jun 14, 2024 at 10:12:40AM +0200, Jan Beulich wrote: >>> On 14.06.2024 09:21, Roger Pau Monné wrote: >>>> On Fri, Jun 14, 2024 at 08:38:51AM +0200, Jan Beulich wrote: >>>>> On 13.06.2024 20:43, Demi Marie Obenour wrote: >>>>>> GPU acceleration requires that pageable host memory be able to be mapped >>>>>> into a guest. >>>>> >>>>> I'm sure it was explained in the session, which sadly I couldn't attend. >>>>> I've been asking Ray and Xenia the same before, but I'm afraid it still >>>>> hasn't become clear to me why this is a _requirement_. After all that's >>>>> against what we're doing elsewhere (i.e. so far it has always been >>>>> guest memory that's mapped in the host). I can appreciate that it might >>>>> be more difficult to implement, but avoiding to violate this fundamental >>>>> (kind of) rule might be worth the price (and would avoid other >>>>> complexities, of which there may be lurking more than what you enumerate >>>>> below). >>>> >>>> My limited understanding (please someone correct me if wrong) is that >>>> the GPU buffer (or context I think it's also called?) is always >>>> allocated from dom0 (the owner of the GPU). The underling memory >>>> addresses of such buffer needs to be mapped into the guest. The >>>> buffer backing memory might be GPU MMIO from the device BAR(s) or >>>> system RAM, and such buffer can be paged by the dom0 kernel at any >>>> time (iow: changing the backing memory from MMIO to RAM or vice >>>> versa). Also, the buffer must be contiguous in physical address >>>> space. >>> >>> This last one in particular would of course be a severe restriction. >>> Yet: There's an IOMMU involved, isn't there? >> >> Yup, IIRC that's why Ray said it was much more easier for them to >> support VirtIO GPUs from a PVH dom0 rather than classic PV one. >> >> It might be easier to implement from a classic PV dom0 if there's >> pv-iommu support, so that dom0 can create it's own contiguous memory >> buffers from the device PoV. > > What makes PVH an improvement here? I thought PV dom0 uses an identity > mapping for the IOMMU, True, but see how Roger mentioned PV IOMMU (which would allow a domain to move away from this identity mapping). Jan > while a PVH dom0 uses an IOMMU that mirrors the > dom0 second-stage page tables. In both cases, the device physical > addresses are identical to dom0’s physical addresses. > > PV is terrible for many reasons, so I’m okay with focusing on PVH dom0, > but I’d like to know why there is a difference.

©2013 Xen Project, A Linux Foundation Collaborative Project. All Rights Reserved.
Linux Foundation is a registered trademark of The Linux Foundation.
Xen Project is a trademark of The Linux Foundation.