[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index] Re: [Xen-devel] Hypervisor crash(!) on xl cpupool-numa-split
On 01/28/11 12:07, Andre Przywara wrote: Juergen Gross wrote:On 01/28/11 00:18, Andre Przywara wrote:Hi, when I boot my machine without restricting Dom0 (dom0_mem= dom0_max_vcpus=) I get an _hypervisor_ crash when I run # xl cpupool-numa-split If Dom0's resources are limited on the Xen cmdline, everything works fine. The crashdump points to a scheduling problem with weights, so I assume the NUMA distribution algorithm some fools the hypervisor completely. I will investigate this further tomorrow, but maybe someone has some good idea.I've seen this once with an older cpupool version on a 24 processor machine. It was NOT related to NUMA, but did occur only on reboot after a Dom0 panic. The machine had an init script creating a cpupool and populating it with cpus. The machine was in a panic loop due to the BUG in sched_acct then until it was resetted manually. After the reset the problem was gone. As I was never able to reproduce the problem later (the same software is running on dozens of machines!), I assumed there was a problem related to the first Dom0 panic, may be some destroyed BIOS tables. Can the crash be reproduced easily?Yes. If I don't specify dom0_max_vcpus= and dom0_mem= on the Xen cmdline, I can reliably trigger the crash with xl cpupool-numa-split. Omitting dom0_max_vcpus only does not suffice. Do I understand correctly? No crash with only dom0_max_vcpus= and no crash with only dom0_mem= ? Could you try this patch? diff -r b59f04eb8978 xen/common/schedule.c --- a/xen/common/schedule.c Fri Jan 21 18:06:23 2011 +0000 +++ b/xen/common/schedule.c Fri Jan 28 12:42:46 2011 +0100 @@ -1301,7 +1301,9 @@ void schedule_cpu_switch(unsigned int cp idle = idle_vcpu[cpu]; ppriv = SCHED_OP(new_ops, alloc_pdata, cpu); + BUG_ON(ppriv == NULL); vpriv = SCHED_OP(new_ops, alloc_vdata, idle, idle->domain->sched_priv); + BUG_ON(vpriv == NULL); pcpu_schedule_lock_irqsave(cpu, flags); -- Juergen Gross Principal Developer Operating Systems TSP ES&S SWE OS6 Telephone: +49 (0) 89 3222 2967 Fujitsu Technology Solutions e-mail: juergen.gross@xxxxxxxxxxxxxx Domagkstr. 28 Internet: ts.fujitsu.com D-80807 Muenchen Company details: ts.fujitsu.com/imprint.html _______________________________________________ Xen-devel mailing list Xen-devel@xxxxxxxxxxxxxxxxxxx http://lists.xensource.com/xen-devel
|
Lists.xenproject.org is hosted with RackSpace, monitoring our |