Xen project Mailing List

Re: [PATCH] x86/i8259: do not assume interrupts always target CPU0

To: Roger Pau Monne <roger.pau@xxxxxxxxxx>

Date: Mon, 25 Sep 2023 11:55:25 +0200

Arc-authentication-results: i=1; mx.microsoft.com 1; spf=pass smtp.mailfrom=suse.com; dmarc=pass action=none header.from=suse.com; dkim=pass header.d=suse.com; arc=none

Arc-message-signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=microsoft.com; s=arcselector9901; h=From:Date:Subject:Message-ID:Content-Type:MIME-Version:X-MS-Exchange-AntiSpam-MessageData-ChunkCount:X-MS-Exchange-AntiSpam-MessageData-0:X-MS-Exchange-AntiSpam-MessageData-1; bh=PkhX9fBHFnm8D4sR7SXofUd78KR7Tart4aEfQ+w+b4Q=; b=gQFI+0TX6cu9D2TtvT4wiDlsnqdcwln3mrJ2b6GQpx9+ySqWo6ZDygeol4f1GnwEgAKFMVBl4nHyZQZ+g5Y+LqA44NhMCg7zQB+xnoe8uyqs/bqC/Jw8Nnz0fVf+IpwOOAps9Kj0y5cJ+FHvZTmrUkggmTYzv6u2uj4SbUePgTAQBOomioLvEsm9aT4ZVjJHvdXJ++hW40x1P/kYed01HCR9aY7SWG4fRf9P9TkFXdmc8kVXTVHzvERad24wmNq2ZCNO3JhYzZANzDmppQSPcifkbu8g4Y9tiASZlfqaoKL9Q6fNp5RAGniDITUP+GMVuDtQvRtDJ5+Qv1LTHjmvpg==

Arc-seal: i=1; a=rsa-sha256; s=arcselector9901; d=microsoft.com; cv=none; b=TK4j+5klyH6SzsUpDOSfe3u5fMHUzxqET4Q1e6nq4MVr3J/jv+ObvTEXHUSqRtJJC8+FYzB4/B9B3ogb1FItA9vaK/yMiscFqVKa4R0pXDG7vy9EtzpKqbw6MIUXGQdG+uhLNapnO+4RaGE/UisKhTJfByaWEJBm03lDMvf/FkZrR43dq9q6qh7BFSf7hIfjqPlc591XdTvZ0ka21mnvb0fW3ASaJXxj9gWfwRB8PvpBag3TcmDtBdhPLxBTx7vyUDtsY+Q+pw8IVGVxKB4dAE+DnPReLWqlcQxFW2E3GfjewLZS5RXSqBuwC67gAUy/vgyxwl+Wstp4Gvh1A0DI6g==

Authentication-results: dkim=none (message not signed) header.d=none;dmarc=none action=none header.from=suse.com;

Cc: Andrew Cooper <andrew.cooper3@xxxxxxxxxx>, Wei Liu <wl@xxxxxxx>, xen-devel@xxxxxxxxxxxxxxxxxxxx

Delivery-date: Mon, 25 Sep 2023 09:55:39 +0000

List-id: Xen developer discussion <xen-devel.lists.xenproject.org>

On 19.09.2023 13:06, Roger Pau Monne wrote: > Sporadically we have seen the following during AP bringup on AMD platforms > only: > > microcode: CPU59 updated from revision 0x830107a to 0x830107a, date = > 2023-05-17 > microcode: CPU60 updated from revision 0x830104d to 0x830107a, date = > 2023-05-17 > CPU60: No irq handler for vector 27 (IRQ -2147483648) > microcode: CPU61 updated from revision 0x830107a to 0x830107a, date = > 2023-05-17 > > This is similar to the issue raised on Linux commit 36e9e1eab777e, where they > also observed i8259 (active) vectors getting delivered to CPUs different than > 0. > > Adjust the target CPU mask of i8259 interrupt descriptors to contain all > possible CPUs, so that APs will reserve the vector at startup if any legacy > IRQ > is still delivered through the i8259. Note that if the IO-APIC takes over > those interrupt descriptors the CPU mask will be reset. > > Spurious i8259 interrupt vectors however (IRQ7 and IRQ15) can be injected even > when all i8259 pins are masked, and hence need to be handled on all CPUs. > Reserve such vectors in order to prevent dynamic interrupt sources from using > them. > > Finally, handle spurious i8259 interrupts on all CPUs and adjust the printed > message to display the CPU where the spurious interrupt has been received, so > it looks like: > > microcode: CPU1 updated from revision 0x830107a to 0x830107a, date = > 2023-05-17 > cpu1: spurious 8259A interrupt: IRQ7 > microcode: CPU2 updated from revision 0x830104d to 0x830107a, date = > 2023-05-17 > > Fixes: 3fba06ba9f8b ('x86/IRQ: re-use legacy vector ranges on APs') > Signed-off-by: Roger Pau Monné <roger.pau@xxxxxxxxxx> > --- > One theory I have is that the APs at some point (before jumping into Xen code) > have the local APIC hardware-disabled, and hence are considered valid targets > by the i8259, but by the time the vector is fetched from the i8259 the > interrupt has either been masked, or already consumed by a different CPU. Aiui with LAPIC disabled, IRQs should only be possible to go to CPU0, for there simply not being any SMP without LAPICs. Did you check that there are unmasked ExtINT LVT0 on APs? And unmasked PIC IRQs? (Although, for the latter, it could of course be that by the time we gain control, they're all masked again, but an IRQ was in the meantime classified as spurious.) > --- a/xen/arch/x86/i8259.c > +++ b/xen/arch/x86/i8259.c > @@ -222,7 +222,8 @@ static bool _mask_and_ack_8259A_irq(unsigned int irq) > is_real_irq = false; > /* Report spurious IRQ, once per IRQ line. */ > if (!(spurious_irq_mask & irqmask)) { > - printk("spurious 8259A interrupt: IRQ%d.\n", irq); > + printk("cpu%u: spurious 8259A interrupt: IRQ%u\n", > + smp_processor_id(), irq); > spurious_irq_mask |= irqmask; > } Nit: Perhaps, to be in line with the other message in context below, "CPU%u: ..."? > @@ -349,7 +350,20 @@ void __init init_IRQ(void) > continue; > desc->handler = &i8259A_irq_type; > per_cpu(vector_irq, cpu)[LEGACY_VECTOR(irq)] = irq; > - cpumask_copy(desc->arch.cpu_mask, cpumask_of(cpu)); > + > + /* > + * The interrupt affinity logic never targets interrupts to offline > + * CPUs, hence it's safe to use cpumask_all here. > + * > + * Legacy PIC interrupts are only targeted to CPU0, but depending on > + * the platform they can be distributed to any online CPU in > hardware. I'm unaware of such a distribution mechanism. Do you have a reference? (If I recall correctly, there needs to be a unique entity in the system that runs the INTA protocol with the [master] PIC.) > + * The kernel has no influence on that. So all active legacy vectors > + * must be installed on all CPUs. > + * > + * IO-APIC will change the destination mask if/when taking ownership > of > + * the interrupt. > + */ > + cpumask_copy(desc->arch.cpu_mask, &cpumask_all); > desc->arch.vector = LEGACY_VECTOR(irq); > } > > --- a/xen/arch/x86/irq.c > +++ b/xen/arch/x86/irq.c > @@ -466,6 +466,14 @@ int __init init_irq_data(void) > vector++ ) > __set_bit(vector, used_vectors); > > + /* > + * Mark i8259 spurious vectors as used to avoid (re)using them. > Otherwise > + * it won't be possible to distinguish between device triggered > interrupts > + * or spurious i8259 ones. > + */ You certainly mean {L,IO}APIC device triggered interrupts here? If so, why would they not be distinguishable? ExtINT IRQs don't set ISR bits, iirc. > + __set_bit(LEGACY_VECTOR(7), used_vectors); > + __set_bit(LEGACY_VECTOR(15), used_vectors); > + > return 0; > } Assuming no IRQs are handled through the PIC (which ought to be the case on virtually all systems), we'd have masked all of them before any APs are brought up. With them masked, aiui even spurious interrupts cannot occur. Still you're permanently removing close to 1% of an already known scarce resource. Can we, if at all, do this just temporarily, such that later on the vectors can become usable again? (See also setup_local_APIC() for ExtINT setup - only the BSP would ever have LVT0 unmasked once we've finished with our setup.) Jan > @@ -1920,7 +1928,6 @@ void do_IRQ(struct cpu_user_regs *regs) > kind = ""; > if ( !(vector >= FIRST_LEGACY_VECTOR && > vector <= LAST_LEGACY_VECTOR && > - !smp_processor_id() && > bogus_8259A_irq(vector - FIRST_LEGACY_VECTOR)) ) > { > printk("CPU%u: No irq handler for vector %02x (IRQ %d%s)\n",

©2013 Xen Project, A Linux Foundation Collaborative Project. All Rights Reserved.
Linux Foundation is a registered trademark of The Linux Foundation.
Xen Project is a trademark of The Linux Foundation.