[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index] Re: [Xen-devel] [PATCH v3] x86: correct socket_cpumask allocation
On Fri, 2015-07-10 at 16:13 +0100, Jan Beulich wrote: > >>> On 10.07.15 at 16:57, <dario.faggioli@xxxxxxxxxx> wrote: > > ... > > (XEN) Preparing system for ACPI S5 state. > > (XEN) Disabling non-boot CPUs ... > > (XEN) Broke affinity for irq 9 > > (XEN) cpu=1 cpu_to_socket=4294967295 > > (XEN) ----[ Xen-4.6-unstable x86_64 debug=y Tainted: C ]---- > > (XEN) CPU: 0 > > (XEN) RIP: e008:[<ffff82d0801886c2>] cpu_smpboot_free+0x43/0x28b > > ... > > > > I.e., it looks like phys_proc_id has already been reset to > > XEN_INVALID_SOCKET_ID, as we're kind-of racing with > > remove_siblinginfo(). > > Right. We have > > cpu_down() > stop_machine_run(take_cpu_down, ...) > notifier_call_chain(&cpu_chain, CPU_DYING, ...) > __cpu_disable() > remove_siblinginfo() > __cpu_die() > notifier_call_chain(&cpu_chain, CPU_DEAD, ...) > cpu_smpboot_free() > > I.e. a clear use-after-invalidate. > Exactly. I don't have a box with CAT, but on one, I expect similar problems to happen in: psr_cpu_fini() cat_cpu_fini() --> unsigned int socket = cpu_to_socket(cpu); as that also runs from CPU_DEAD. :-/ I guess this haven't seen any "let's shut the host down" kind of testing... Dario -- <<This happens because I choose it to happen!>> (Raistlin Majere) ----------------------------------------------------------------- Dario Faggioli, Ph.D, http://about.me/dario.faggioli Senior Software Engineer, Citrix Systems R&D Ltd., Cambridge (UK) Attachment:
signature.asc _______________________________________________ Xen-devel mailing list Xen-devel@xxxxxxxxxxxxx http://lists.xen.org/xen-devel
|
Lists.xenproject.org is hosted with RackSpace, monitoring our |