[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: [PATCH] iommu/amd-vi: do not error if device referenced in IVMD is not behind any IOMMU


  • To: Roger Pau Monné <roger.pau@xxxxxxxxxx>
  • From: Jan Beulich <jbeulich@xxxxxxxx>
  • Date: Wed, 9 Oct 2024 13:28:19 +0200
  • Autocrypt: addr=jbeulich@xxxxxxxx; keydata= xsDiBFk3nEQRBADAEaSw6zC/EJkiwGPXbWtPxl2xCdSoeepS07jW8UgcHNurfHvUzogEq5xk hu507c3BarVjyWCJOylMNR98Yd8VqD9UfmX0Hb8/BrA+Hl6/DB/eqGptrf4BSRwcZQM32aZK 7Pj2XbGWIUrZrd70x1eAP9QE3P79Y2oLrsCgbZJfEwCgvz9JjGmQqQkRiTVzlZVCJYcyGGsD /0tbFCzD2h20ahe8rC1gbb3K3qk+LpBtvjBu1RY9drYk0NymiGbJWZgab6t1jM7sk2vuf0Py O9Hf9XBmK0uE9IgMaiCpc32XV9oASz6UJebwkX+zF2jG5I1BfnO9g7KlotcA/v5ClMjgo6Gl MDY4HxoSRu3i1cqqSDtVlt+AOVBJBACrZcnHAUSuCXBPy0jOlBhxPqRWv6ND4c9PH1xjQ3NP nxJuMBS8rnNg22uyfAgmBKNLpLgAGVRMZGaGoJObGf72s6TeIqKJo/LtggAS9qAUiuKVnygo 3wjfkS9A3DRO+SpU7JqWdsveeIQyeyEJ/8PTowmSQLakF+3fote9ybzd880fSmFuIEJldWxp Y2ggPGpiZXVsaWNoQHN1c2UuY29tPsJgBBMRAgAgBQJZN5xEAhsDBgsJCAcDAgQVAggDBBYC AwECHgECF4AACgkQoDSui/t3IH4J+wCfQ5jHdEjCRHj23O/5ttg9r9OIruwAn3103WUITZee e7Sbg12UgcQ5lv7SzsFNBFk3nEQQCACCuTjCjFOUdi5Nm244F+78kLghRcin/awv+IrTcIWF hUpSs1Y91iQQ7KItirz5uwCPlwejSJDQJLIS+QtJHaXDXeV6NI0Uef1hP20+y8qydDiVkv6l IreXjTb7DvksRgJNvCkWtYnlS3mYvQ9NzS9PhyALWbXnH6sIJd2O9lKS1Mrfq+y0IXCP10eS FFGg+Av3IQeFatkJAyju0PPthyTqxSI4lZYuJVPknzgaeuJv/2NccrPvmeDg6Coe7ZIeQ8Yj t0ARxu2xytAkkLCel1Lz1WLmwLstV30g80nkgZf/wr+/BXJW/oIvRlonUkxv+IbBM3dX2OV8 AmRv1ySWPTP7AAMFB/9PQK/VtlNUJvg8GXj9ootzrteGfVZVVT4XBJkfwBcpC/XcPzldjv+3 HYudvpdNK3lLujXeA5fLOH+Z/G9WBc5pFVSMocI71I8bT8lIAzreg0WvkWg5V2WZsUMlnDL9 mpwIGFhlbM3gfDMs7MPMu8YQRFVdUvtSpaAs8OFfGQ0ia3LGZcjA6Ik2+xcqscEJzNH+qh8V m5jjp28yZgaqTaRbg3M/+MTbMpicpZuqF4rnB0AQD12/3BNWDR6bmh+EkYSMcEIpQmBM51qM EKYTQGybRCjpnKHGOxG0rfFY1085mBDZCH5Kx0cl0HVJuQKC+dV2ZY5AqjcKwAxpE75MLFkr wkkEGBECAAkFAlk3nEQCGwwACgkQoDSui/t3IH7nnwCfcJWUDUFKdCsBH/E5d+0ZnMQi+G0A nAuWpQkjM1ASeQwSHEeAWPgskBQL
  • Cc: Andrew Cooper <andrew.cooper3@xxxxxxxxxx>, Willi Junga <xenproject@xxxxxx>, xen-devel@xxxxxxxxxxxxxxxxxxxx
  • Delivery-date: Wed, 09 Oct 2024 11:28:23 +0000
  • List-id: Xen developer discussion <xen-devel.lists.xenproject.org>

On 09.10.2024 13:13, Roger Pau Monné wrote:
> On Wed, Oct 09, 2024 at 12:52:29PM +0200, Jan Beulich wrote:
>> On 09.10.2024 10:03, Roger Pau Monné wrote:
>>> On Tue, Oct 08, 2024 at 04:01:28PM +0200, Jan Beulich wrote:
>>>> On 08.10.2024 12:47, Roger Pau Monne wrote:
>>>>> IVMD table contains restrictions about memory which must be mandatory 
>>>>> assigned
>>>>> to devices (and which permissions it should use), or memory that should be
>>>>> never accessible to devices.
>>>>>
>>>>> Some hardware however contains ranges in IVMD that reference devices 
>>>>> outside of
>>>>> the IVHD tables (in other words, devices not behind any IOMMU).  Such 
>>>>> mismatch
>>>>> will cause Xen to fail in register_range_for_device(), ultimately leading 
>>>>> to
>>>>> the IOMMU being disabled, and Xen crashing as x2APIC support might be 
>>>>> already
>>>>> enabled and relying on the IOMMU functionality.
>>>>
>>>> I find it hard to believe that on x86 systems with IOMMUs some devices 
>>>> would
>>>> be left uncovered by any IOMMU. Is it possible that IVHD is flawed there? 
>>>> In
>>>> which case we might rightfully refuse to boot? (Can you share e.g. that
>>>> "iommu=debug" output that results from parsing the tables on that system?)
>>>
>>> I'm afraid I don't have any of such systems to test myself, however I
>>> have the contents of IVRS:
>>>
>>>   ACPI Table Header
>>> ------------------------------------------------------------------
>>> Signature          : IVRS
>>> Length             : 0x000001F8
>>> Revision           : 0x02
>>> Checksum           : 0x06
>>> OEM ID             : AMD  
>>> OEM Table ID       : AmdTable
>>> OEM Revision       : 0x00000001
>>> Creator ID         : AMD 
>>> Creator Revision   : 0x00000001
>>> IVinfo             : 0x00203043
>>>       IVHD
>>>     ----------------------------------------------------------------
>>>     Type                  : 0x10
>>>     Flags                 : 0xB0
>>>     Length                : 0x0044
>>>     IOMMU Device ID       : 0x0002
>>>     Capability Offset     : 0x0040
>>>     IOMMU Base Address    : 0x00000000FD200000
>>>     Segment Group         : 0x0000
>>>     IOMMU Info            : 0x0000
>>>     IOMMU Feature Info    : 0x80048F6E
>>>               Range
>>>             --------------------------------------------------
>>>             Type                  : 0x03
>>>             Start of Range        : 0x0003
>>>             End of Range          : 0xFFFE
>>>             DTE Setting           : 0x00
>>>               Alias Range
>>>             --------------------------------------------------
>>>             Type                  : 0x43
>>>             Start of Range        : 0xFF00
>>>             End of Range          : 0xFFFF
>>>             DTE Setting           : 0x00
>>>             Source Device ID      : 0x00A5
>>>               Special Device
>>>             --------------------------------------------------
>>>             Type                  : 0x48
>>>             Device ID             : 0x0000
>>>             DTE Setting           : 0x00
>>>             Source Device ID      : 0x00A0
>>>             Handle                : 0x00
>>>             Variety               : HPET
>>>               Special Device
>>>             --------------------------------------------------
>>>             Type                  : 0x48
>>>             Device ID             : 0x0000
>>>             DTE Setting           : 0xD7
>>>             Source Device ID      : 0x00A0
>>>             Handle                : 0x21
>>>             Variety               : IOAPIC
>>>               Special Device
>>>             --------------------------------------------------
>>>             Type                  : 0x48
>>>             Device ID             : 0x0000
>>>             DTE Setting           : 0x00
>>>             Source Device ID      : 0x0001
>>>             Handle                : 0x22
>>>             Variety               : IOAPIC
>>>       IVHD
>>>     ----------------------------------------------------------------
>>>     Type                  : 0x11
>>>     Flags                 : 0x30
>>>     Length                : 0x0054
>>>     IOMMU Device ID       : 0x0002
>>>     Capability Offset     : 0x0040
>>>     IOMMU Base Address    : 0x00000000FD200000
>>>     Segment Group         : 0x0000
>>>     IOMMU Info            : 0x0000
>>>     IOMMU Feature Info    : 0x00048000
>>>               Range
>>>             --------------------------------------------------
>>>             Type                  : 0x03
>>>             Start of Range        : 0x0003
>>>             End of Range          : 0xFFFE
>>>             DTE Setting           : 0x00
>>>               Alias Range
>>>             --------------------------------------------------
>>>             Type                  : 0x43
>>>             Start of Range        : 0xFF00
>>>             End of Range          : 0xFFFF
>>>             DTE Setting           : 0x00
>>>             Source Device ID      : 0x00A5
>>>               Special Device
>>>             --------------------------------------------------
>>>             Type                  : 0x48
>>>             Device ID             : 0x0000
>>>             DTE Setting           : 0x00
>>>             Source Device ID      : 0x00A0
>>>             Handle                : 0x00
>>>             Variety               : HPET
>>>               Special Device
>>>             --------------------------------------------------
>>>             Type                  : 0x48
>>>             Device ID             : 0x0000
>>>             DTE Setting           : 0xD7
>>>             Source Device ID      : 0x00A0
>>>             Handle                : 0x21
>>>             Variety               : IOAPIC
>>>               Special Device
>>>             --------------------------------------------------
>>>             Type                  : 0x48
>>>             Device ID             : 0x0000
>>>             DTE Setting           : 0x00
>>>             Source Device ID      : 0x0001
>>>             Handle                : 0x22
>>>             Variety               : IOAPIC
>>>       IVMD
>>>     ----------------------------------------------------------------
>>>     Type                                 : 0x22
>>>     Flags                                : 0x08
>>>     Length                               : 0x0020
>>>     DeviceID                             : 0x0000
>>>     AuxiliaryData                        : 0x0FFF
>>>     Reserved                             : 0x0000000000000000
>>>     IVMD Start Address                   : 0x0000000096191000
>>>     IVMD Memory Block Length             : 0x0000000000000022
>>>       IVMD
>>>     ----------------------------------------------------------------
>>>     Type                                 : 0x22
>>>     Flags                                : 0x08
>>>     Length                               : 0x0020
>>>     DeviceID                             : 0x0000
>>>     AuxiliaryData                        : 0x0FFF
>>>     Reserved                             : 0x0000000000000000
>>>     IVMD Start Address                   : 0x0000000097D9E000
>>>     IVMD Memory Block Length             : 0x0000000000000022
>>>       IVMD
>>>     ----------------------------------------------------------------
>>>     Type                                 : 0x22
>>>     Flags                                : 0x08
>>>     Length                               : 0x0020
>>>     DeviceID                             : 0x0000
>>>     AuxiliaryData                        : 0x0FFF
>>>     Reserved                             : 0x0000000000000000
>>>     IVMD Start Address                   : 0x0000000097D9D000
>>>     IVMD Memory Block Length             : 0x0000000000000022
>>>       IVHD
>>>     ----------------------------------------------------------------
>>>     Type                  : 0x40
>>>     Flags                 : 0x30
>>>     Length                : 0x00D0
>>>     IOMMU Device ID       : 0x0002
>>>     Capability Offset     : 0x0040
>>>     IOMMU Base Address    : 0x00000000FD200000
>>>     Segment Group         : 0x0000
>>>     IOMMU Info            : 0x0000
>>>     IOMMU Feature Info    : 0x00048000
>>>               Range
>>>             --------------------------------------------------
>>>             Type                  : 0x03
>>>             Start of Range        : 0x0003
>>>             End of Range          : 0xFFFE
>>>             DTE Setting           : 0x00
>>>               Alias Range
>>>             --------------------------------------------------
>>>             Type                  : 0x43
>>>             Start of Range        : 0xFF00
>>>             End of Range          : 0xFFFF
>>>             DTE Setting           : 0x00
>>>             Source Device ID      : 0x00A5
>>>               Special Device
>>>             --------------------------------------------------
>>>             Type                  : 0x48
>>>             Device ID             : 0x0000
>>>             DTE Setting           : 0x00
>>>             Source Device ID      : 0x00A0
>>>             Handle                : 0x00
>>>             Variety               : HPET
>>>               Special Device
>>>             --------------------------------------------------
>>>             Type                  : 0x48
>>>             Device ID             : 0x0000
>>>             DTE Setting           : 0xD7
>>>             Source Device ID      : 0x00A0
>>>             Handle                : 0x21
>>>             Variety               : IOAPIC
>>>               Special Device
>>>             --------------------------------------------------
>>>             Type                  : 0x48
>>>             Device ID             : 0x0000
>>>             DTE Setting           : 0x00
>>>             Source Device ID      : 0x0001
>>>             Handle                : 0x22
>>>             Variety               : IOAPIC
>>>               Variable Length ACPI HID Device
>>>             --------------------------------------------------
>>>             Type                  : 0xF0
>>>             Device ID             : 0x00A5
>>>             DTE Setting           : 0x40
>>>             Hardware ID           : AMDI0020
>>>             Extended DTE Setting  : 
>>>             Unique ID Format      : 2
>>>             Unique ID Length      : 9
>>>             Unique ID             : \_SB.FUR0
>>>               Variable Length ACPI HID Device
>>>             --------------------------------------------------
>>>             Type                  : 0xF0
>>>             Device ID             : 0x00A5
>>>             DTE Setting           : 0x40
>>>             Hardware ID           : AMDI0020
>>>             Extended DTE Setting  : 
>>>             Unique ID Format      : 2
>>>             Unique ID Length      : 9
>>>             Unique ID             : \_SB.FUR1
>>>               Variable Length ACPI HID Device
>>>             --------------------------------------------------
>>>             Type                  : 0xF0
>>>             Device ID             : 0x00A5
>>>             DTE Setting           : 0x40
>>>             Hardware ID           : AMDI0020
>>>             Extended DTE Setting  : 
>>>             Unique ID Format      : 2
>>>             Unique ID Length      : 9
>>>             Unique ID             : \_SB.FUR2
>>>               Variable Length ACPI HID Device
>>>             --------------------------------------------------
>>>             Type                  : 0xF0
>>>             Device ID             : 0x00A5
>>>             DTE Setting           : 0x40
>>>             Hardware ID           : AMDI0020
>>>             Extended DTE Setting  : 
>>>             Unique ID Format      : 2
>>>             Unique ID Length      : 9
>>>             Unique ID             : \_SB.FUR3
>>>
>>> FWIW, I've checked on one of the AMD server systems we have on the
>>> lab, and the IVHD entries are fairly similar to the ones here, as
>>> neither the PCI Host Bridge, nor the IOMMU are covered by any IVHD
>>> block.  That system however doesn't have any IVMD blocks.
>>
>> Mine are a little different. The Dinar (Fam15) has an IVHD entry just
>> for the range 0-2 (host bridge, <nothing>, IOMMU). The Rome (Fam17)
>> has an IVHD entry just for 0 (host bridge), but not for the IOMMU. I
>> think it is entirely reasonable for host bridge(s) and IOMMU(s) to not
>> be covered by any IVHD. They aren't devices that would require
>> servicing by an IOMMU.
>>
>> Looking at the code I think we want to do things a little differently
>> though: Pull find_iommu_for_device() out of register_range_for_device()
>> and have parse_ivmd_device_range() do the skipping when there's no
>> IOMMU for a device.
> 
> What about parse_ivmd_device_select()?  The IOMMU check would also need
> to be duplicated there, which is not ideal IMO.

That's not ideal, but a reasonably small price to pay.

>> Plus error when no device in the range is covered
>> by an IOMMU, or if any two devices are covered by different IOMMUs.
> 
> I'm not sure I understand you last comment: do you mean to return an
> error if a IVMD block range covers devices assigned to different
> IOMMUs?  If that's the case, I'm afraid I don't agree, I don't see
> anywhere in the spec that notes a IVMD block range can apply to
> devices assigned to different IOMMUs.

Hmm, right, I take back that part.

> I also think returning an error when no device in the IVMD range is
> covered by an IOMMU is dubious.  Xen will already print warning
> messages about such firmware inconsistencies, but refusing to boot is
> too strict.

I disagree. We shouldn't enable DMA remapping in such an event. Whereas
the "refusing to boot" is interrupt remapping related iirc, if x2APIC
is already enabled. We need to properly separate the two (and the
discussion there was started quite a long time ago, but it got stuck at
some point); until such time it is simply an undesirable side effect of
the inappropriate implementation that in certain case we fail boot when
we shouldn't.

Jan



 


Rackspace

Lists.xenproject.org is hosted with RackSpace, monitoring our
servers 24x7x365 and backed by RackSpace's Fanatical Support®.