|
[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index] Re: [Xen-devel] linux-3.9-rc0 regression from 3.8 SATA controller not detected under xen
On Thu, Feb 28, 2013 at 12:57:24AM +0100, Sander Eikelenboom wrote:
>
> Wednesday, February 27, 2013, 11:22:18 PM, you wrote:
>
>
> > On 2/27/2013 3:41 PM, Sander Eikelenboom wrote:
> >> Wednesday, February 27, 2013, 8:28:10 PM, you wrote:
> >>
> >>> On Wed, Feb 27, 2013 at 06:50:59PM +0100, Sander Eikelenboom wrote:
> >>>> Wednesday, February 27, 2013, 1:54:31 PM, you wrote:
> >>>>
> >>>>>>>> On 27.02.13 at 12:46, Sander Eikelenboom <linux@xxxxxxxxxxxxxx>
> >>>>>>>> wrote:
> >>>>>> [ 89.338827] ahci: probe of 0000:00:11.0 failed with error -22
> >>>>> Which is -EINVAL. With nothing else printed, I'm afraid you need to
> >>>>> find the origin of this return value by instrumenting the involved
> >>>>> call tree.
> >>>> Just wondering, is multiple msi's per device actually supported by xen ?
> >>> That is very good question. I know we support MSI-X b/c 1GB or 10GB NICs
> >>> use them and they work great with Xen.
> >>> BTW, this is merge:
> >>> ommit 5800700f66678ea5c85e7d62b138416070bf7f60
> >>> Merge: 266d7ad af8d102
> >>> Author: Linus Torvalds <torvalds@xxxxxxxxxxxxxxxxxxxx>
> >>> Date: Tue Feb 19 19:07:27 2013 -0800
> >>> Merge branch 'x86-apic-for-linus' of
> >>> git://git.kernel.org/pub/scm/linux/kernel/git/tip/tip
> >>>
> >>> Pull x86/apic changes from Ingo Molnar:
> >>> "Main changes:
> >>>
> >>> - Multiple MSI support added to the APIC, PCI and AHCI code -
> >>> acked
> >>> by all relevant maintainers, by Alexander Gordeev.
> >>>
> >>> The advantage is that multiple AHCI ports can have multiple MSI
> >>> irqs assigned, and can thus spread to multiple CPUs.
> >>>
> >>> [ Drivers can make use of this new facility via the
> >>> pci_enable_msi_block_auto() method ]
> >>
> >>
> >>> With MSI per device, the hypercall that ends up happening is:
> >>> PHYSDEVOP_map_pirq with:
> >>> map_irq.domid = domid;
> >>> map_irq.type = MAP_PIRQ_TYPE_MSI_SEG;
> >>> map_irq.index = -1;
> >>> map_irq.pirq = -1;
> >>> map_irq.bus = dev->bus->number |
> >>> (pci_domain_nr(dev->bus) << 16);
> >>> map_irq.devfn = dev->devfn;
> >>> Which would imply that we are doing this call multiple times?
> >>> (This is xen_initdom_setup_msi_irqs).
> >>> It looks like pci_enable_msi_block_auto is the multiple MSI one
> >>> and it should perculate down to xen_initdom_setup_msi_irqs.
> >>> Granted the xen_init.. does not do anything with the 'nvec' call.
> >>> So could I ask you try out your hunch by doing three things:
> >>> 1). Instrument xen_initdom_setup_msi_irqs to see if the
> >>> nvec has anything but 1 and in its loop instrument to
> >>> see if it has more than on MSI attribute?
> >>> 2). The ahci driver has ahci_init_interrupts which only does
> >>> the multiple MSI thing if AHCI_HFLAG_NO_MSI is not set.
> >>> If you edit drivers/ata/ahci ahci_port_info for the SB600 (or 700?)
> >>> to have AHCI_HFLAG_NO_MSI flag (you probably want to do this
> >>> seperatly from 1).
> >>> 3). Checkout before merge 5800700f66678ea5c85e7d62b138416070bf7f60
> >>> and try 266d7ad7f4fe2f44b91561f5b812115c1b3018ab?
> >>
> >> So of interest are commits:
> >> - 5ca72c4f7c412c2002363218901eba5516c476b1
> >> - 08261d87f7d1b6253ab3223756625a5c74532293
> >> - 51906e779f2b13b38f8153774c4c7163d412ffd9
> >>
> >> Hmmm reading the commit message of
> >> 51906e779f2b13b38f8153774c4c7163d412ffd9:
> >>
> >> x86/MSI: Support multiple MSIs in presense of IRQ remapping
> >>
> >> The MSI specification has several constraints in comparison with
> >> MSI-X, most notable of them is the inability to configure MSIs
> >> independently. As a result, it is impossible to dispatch
> >> interrupts from different queues to different CPUs. This is
> >> largely devalues the support of multiple MSIs in SMP systems.
> >>
> >> Also, a necessity to allocate a contiguous block of vector
> >> numbers for devices capable of multiple MSIs might cause a
> >> considerable pressure on x86 interrupt vector allocator and
> >> could lead to fragmentation of the interrupt vectors space.
> >>
> >> This patch overcomes both drawbacks in presense of IRQ remapping
> >> and lets devices take advantage of multiple queues and per-IRQ
> >> affinity assignments.
> >>
> >> At least makes clear why baremetal does boot and xen doesn't:
> >>
> >> Baremetal behaves differently and thus boots because interrupt remapping
> >> gets disabled on boot by the kernel iommu code due to the buggy bios iommu
> >> errata, so according to the commit message above it doesn't even try the
> >> multiple MSI per device scenario.
> >>
> >> So the question is if it can be enabled in Xen (and if it actually could
> >> be beneficial because the commit messages seems to indicate that could be
> >> questionable).
> >> If not, the check in arch/x86/kernel/apic/io_apic.c:setup_msi_irqs should
> >> fail
> > Except that function in Xen is not run. that is b/c
> > x86_msi_ops.setup_msi_irqs end up pointing to xen_initdom_setup_irqs.
> > While if IOMMU is enabled it gets set to irq_remapping_setup_msi_irqs.
>
> > So a fix like this:
> > diff --git a/arch/x86/pci/xen.c b/arch/x86/pci/xen.c
> > index 56ab749..47f8cca 100644
> > --- a/arch/x86/pci/xen.c
> > +++ b/arch/x86/pci/xen.c
> > @@ -263,6 +263,9 @@ static int xen_initdom_setup_msi_irqs(struct pci_dev
> > *dev, int nvec, int type)
> > int ret = 0;
> > struct msi_desc *msidesc;
>
> > + if (type == PCI_CAP_ID_MSI && nvec > 1)
> > + return 1;
> > +
> > list_for_each_entry(msidesc, &dev->msi_list, list) {
> > struct physdev_map_pirq map_irq;
> > domid_t domid;
>
>
> > (sorry about the paste getting messed up here) - ought to do it? As for
> > example on one of my AMD machines there is no IOMMU, and this is where
> > AHCI does work under baremetal but not under Xen.
>
> Yes it boots again :-)
Great! Are you OK if I put 'Reported-and-Tested-by:" tag on the patch with your
name for the above quick fix?
Thanks!
>
> [ 37.742109] SE | bus: 'pci': really_probe: probing driver ahci with device
> 0000:00:11.0
> [ 37.773491] really_probe: pinctrl_bind_pins(0000:00:11.0) ret: 0
> [ 37.798862] ahci 0000:00:11.0: SE | ahci_init_one start
> [ 37.822040] ahci 0000:00:11.0: version 3.0
> [ 37.841606] xen: registering gsi 19 triggering 0 polarity 1
> [ 37.865577] xen: --> pirq=19 -> irq=19 (gsi=19)
> [ 37.913087] ahci 0000:00:11.0: SE | pcim_enable_device(pdev) rc:0
> [ 37.938519] ahci 0000:00:11.0: SE pcim_iomap_regions_request_all(pdev, 1
> << ahci_pci_bar, DRV_NAME) rc:0
> [ 37.974447] ahci 0000:00:11.0: xen_initdom_setup_msi_irqs nvec: 4 type:5
> [ 38.001806] ahci 0000:00:11.0: xen_initdom_setup_msi_irqs nvec: 1 type:5
> [ 38.029026] ahci 0000:00:11.0: SE pci_enable_msi_block_auto(pdev, &maxvec)
> rc:1
> [ 38.057960] ahci 0000:00:11.0: SE | n_msis: 1
> [ 38.078065] ahci 0000:00:11.0: SE | ahci_configure_dma_masks(pdev,
> hpriv->cap & HOST_CAP_64) rc:0
> [ 38.112045] ahci 0000:00:11.0: SE | ahci_pci_reset_controller(host) rc:0
> [ 38.139426] ahci 0000:00:11.0: AHCI 0001.0200 32 slots 4 ports 6 Gbps 0xf
> impl SATA mode
> [ 38.170664] ahci 0000:00:11.0: flags: 64bit ncq sntf ilck pm led clo pmp
> pio slum part
> [ 38.201684] ahci 0000:00:11.0: SE | me here 1
> [ 38.221977] ahci 0000:00:11.0: SE | me here 2
> [ 38.244756] scsi0 : ahci
> [ 38.259700] scsi1 : ahci
> [ 38.274411] scsi2 : ahci
> [ 38.289278] scsi3 : ahci
> [ 38.303718] ata1: SATA max UDMA/133 abar m1024@0xf96ff000 port 0xf96ff100
> irq 121
> [ 38.332566] ata2: SATA max UDMA/133 abar m1024@0xf96ff000 port 0xf96ff180
> irq 121
> [ 38.361366] ata3: SATA max UDMA/133 abar m1024@0xf96ff000 port 0xf96ff200
> irq 121
> [ 38.390080] ata4: SATA max UDMA/133 abar m1024@0xf96ff000 port 0xf96ff280
> irq 121
> [ 38.418787] really_probe: dev->bus->probe(0000:00:11.0) ret: 0
> [ 38.442420] really_probe: 0000:00:11.0 done ret: 1
>
>
> > We can future wise implement a better version of this to deal with
> > multiple MSIs, but lets make sure to first get it booting.
> >> --
> >> Sander
> >>
> >>
> >>
> >>
> >>>> --
> >>>> Sander
> >>>>
> >>>>> Jan
> >>>>
> >>>>
> >>>>
> >>>> _______________________________________________
> >>>> Xen-devel mailing list
> >>>> Xen-devel@xxxxxxxxxxxxx
> >>>> http://lists.xen.org/xen-devel
> >>>>
> >>
> >>
>
>
>
>
> _______________________________________________
> Xen-devel mailing list
> Xen-devel@xxxxxxxxxxxxx
> http://lists.xen.org/xen-devel
>
_______________________________________________
Xen-devel mailing list
Xen-devel@xxxxxxxxxxxxx
http://lists.xen.org/xen-devel
|
![]() |
Lists.xenproject.org is hosted with RackSpace, monitoring our |