[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: [Xen-devel] vpmu=1 and running 'perf top' within a PVHVM guest eventually hangs dom0 and hypervisor has stuck vCPUS. Romley-EP (model=45, stepping=2)

To: "Konrad Rzeszutek Wilk" <konrad.wilk@xxxxxxxxxx>
From: "Jan Beulich" <JBeulich@xxxxxxxx>
Date: Wed, 13 Mar 2013 08:33:15 +0000
Cc: suravee.suthikulpanit@xxxxxxx, qing.he@xxxxxxxxx, eddie.dong@xxxxxxxxx, dietmar.hahn@xxxxxxxxxxxxxx, xen-devel <xen-devel@xxxxxxxxxxxxx>, jun.nakajima@xxxxxxxxx, jiongxi.li@xxxxxxxxx, boris.ostrovsky@xxxxxxxxxx
Delivery-date: Wed, 13 Mar 2013 08:33:35 +0000
List-id: Xen developer discussion <xen-devel.lists.xen.org>

>>> On 12.03.13 at 18:30, Konrad Rzeszutek Wilk <konrad.wilk@xxxxxxxxxx> wrote:
> This issue I am encountering seems to only happen on multi-socket
> machines.
> 
> It also does not help that the only multi-socket box I have is
> an Romley-EP (so two socket SandyBridge CPUs). The other
> SandyBridge boxes I've (one socket) are not showing this. Granted
> they are also a different model (42).
> 
> The problem is that when I run 'perf top' within an SMP PVHVM
> guest, after a couple of seconds or minutes the guest hangs.
> Hypervisor ends up stuck too looping, and then the dom0 ends
> up hanging as well.
> 
> Dumping the cpu registers (Ctrl-A x3, then 'd'
> shows that the guest is pretty firmly stuck in vmx_vmexit_handler:
> 
> (XEN)    [<ffff82c4c01d386f>] vmx_vmexit_handler+0x22f/0x174
> 
> and if I let this stay for some time, dom0 detects that some
> of its VCPUs are hanged and it resorts to sending NMI. NMI
> is not implemented in pv-ops and then dom0 wedges. In some
> cases it also wedges itself when doing 'xl list' or any up-calls
> to the hypervisor.

Did you try running Xen with its watchdog (and perhaps Dom0
without)?

> Anyhow, following 'Ctrl-A x3, then 'v' tells me:
> 
> (XEN) Virtual processor ID = 0x0c02
> .. snip..
> (XEN) Virtual processor ID = 0x0fc4
> (XEN)   VCPU 3
> 
> and stays stuck there. Doing the 'Ctrl-A x3' and 'd' to
> see where it is stuck tells me:

Perhaps sending 'd' without first sending 'v' might better show where
the original hang is?

Jan


_______________________________________________
Xen-devel mailing list
Xen-devel@xxxxxxxxxxxxx
http://lists.xen.org/xen-devel

Follow-Ups:
- Re: [Xen-devel] vpmu=1 and running 'perf top' within a PVHVM guest eventually hangs dom0 and hypervisor has stuck vCPUS. Romley-EP (model=45, stepping=2)
  - From: Konrad Rzeszutek Wilk

References:
- [Xen-devel] vpmu=1 and running 'perf top' within a PVHVM guest eventually hangs dom0 and hypervisor has stuck vCPUS. Romley-EP (model=45, stepping=2)
  - From: Konrad Rzeszutek Wilk

Prev by Date: Re: [Xen-devel] vpmu=1 and running 'perf top' within a PVHVM guest eventually hangs dom0 and hypervisor has stuck vCPUS. Romley-EP (model=45, stepping=2)
Next by Date: Re: [Xen-devel] [PATCH v2] VTD/Intremap: Disable Intremap on Chipset 5500/5520/X58 due to errata
Previous by thread: Re: [Xen-devel] vpmu=1 and running 'perf top' within a PVHVM guest eventually hangs dom0 and hypervisor has stuck vCPUS. Romley-EP (model=45, stepping=2)
Next by thread: Re: [Xen-devel] vpmu=1 and running 'perf top' within a PVHVM guest eventually hangs dom0 and hypervisor has stuck vCPUS. Romley-EP (model=45, stepping=2)
Index(es):
- Date
- Thread

Lists.xenproject.org is hosted with RackSpace, monitoring our
servers 24x7x365 and backed by RackSpace's Fanatical Support®.