[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: [Xen-devel] PV-vNUMA issue: topology is misinterpreted by the guest



On Thu, Jul 16, 2015 at 11:56:50AM +0100, Andrew Cooper wrote:
> On 16/07/15 11:47, Jan Beulich wrote:
> >>>> On 16.07.15 at 12:32, <dario.faggioli@xxxxxxxxxx> wrote:
> >> root@test:~# numactl --hardware
> >> available: 2 nodes (0-1)
> >> node 0 cpus: 0 1
> >> node 0 size: 475 MB
> >> node 0 free: 382 MB
> >> node 1 cpus: 2 3
> >> node 1 size: 495 MB
> >> node 1 free: 475 MB
> >> node distances:
> >> node   0   1 
> >>   0:  10  10 
> >>   1:  20  10
> >>
> >> root@test:~# cat 
> >> /sys/devices/system/cpu/cpu0/topology/thread_siblings_list 
> >> 0-1
> >> root@test:~# cat /sys/devices/system/cpu/cpu0/topology/core_siblings_list 
> >> 0-3
> >> root@test:~# cat 
> >> /sys/devices/system/cpu/cpu2/topology/thread_siblings_list 
> >> 2-3
> >> root@test:~# cat /sys/devices/system/cpu/cpu2/topology/core_siblings_list 
> >> 0-3
> >>
> >> So the complain during boot seems to be against 'core_siblings' (which
> >> was not what I expected, but perhaps I misremember the meaning of
> >> "core_siblings" VS. "thread_siblings" VS. smt-siblings in Linux; I'll
> >> double check).
> >>
> >> Anyway, is there anything we can do to fix or workaround things?
> > Make the guest honor topology also at the CPUID layer. Whether
> > that's by not wrongly consuming the respective CPUID bits (i.e. a
> > guest side change) or reflecting PV state in what the hypervisor
> > returns I'm not sure about. While the latter might be more clean,
> > I'd be afraid this might get in the way of what the tool stack wants
> > to see.
> 
> Xen's CPUID handling currently has no concept of per-core and
> per-package data in the cpuid policy.  The guest sees the information

Can / Will Xen have that concept in the future?

Wei.

> from the pcpu which happened to be running libxc at the time the policy
> was decided, with a Xen-level fudge factor applied.
> 
> There is also the regular problem of (failing to) trap cpuid
> instructions in PV guests, so any "numa aware" software which doesn't
> use the force override prefix will still fail in the above way.
> 
> Lots of issues in this area are discussed in
> http://xenbits.xen.org/people/andrewcoop/feature-levelling/feature-levelling-E.pdf
> , especially the extra work section at the end.
> 
> ~Andrew

_______________________________________________
Xen-devel mailing list
Xen-devel@xxxxxxxxxxxxx
http://lists.xen.org/xen-devel


 


Rackspace

Lists.xenproject.org is hosted with RackSpace, monitoring our
servers 24x7x365 and backed by RackSpace's Fanatical Support®.