[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index] Re: [Xen-devel] stub domain crash related to bind_interdomain
>>> On 20.06.17 at 19:36, <srn@xxxxxxxxx> wrote: > On 06/20/2017 01:24 AM, Jan Beulich wrote: >>>>> On 20.06.17 at 01:39, <srn@xxxxxxxxx> wrote: >>> I have gotten messages like this sporadically in the qemu-dm log for stub >>> domains, both at domain start and domain reboot: >>> >>> evtchn_open() -> 7 >>> ERROR: bind_interdomain failed with rc=-22xenevtchn_bind_interdomain(121, 0) >>> = -22 >>> bind interdomain ioctl error 22 >>> Unable to find x86 CPU definition >>> close(0) >>> >>> It is not always remote port 0 that fails but typically is so. >> >> But I'm afraid this is a relevant distinction, and hence you may be >> seeing two different issues. Have you been able to find out where >> that remote port is coming from? I ask because port 0 is never a >> valid one (see evtchn_init() setting it to ECS_RESERVED). > > By inspection I think it is > shared_page->vcpu_ioreq[i].vp_eport used in helper2.c:cpu_x86_init because > otherwise I should see another message like > > xc_evtchn_bind_interdomain(21, 3) = 0 > first, and I only see one message from xc_evtchn_bind_interdomain. So perhaps a race between the setting up of that field and its consumption for binding? With most of the involved code in qemu being the same between use in Dom0 and in stubdom, it may simply be a race that happens to never be lost in the former case (and as you say it's rare enough in the latter). Otoh I'm not sure qemu-dm uses multiple threads in the first place, and if it doesn't I can't see ways for such an occasionally lost race. In any event - I'm not a qemu-dm specialist at all, so I'll defer further analysis on that side to people who are. Jan _______________________________________________ Xen-devel mailing list Xen-devel@xxxxxxxxxxxxx https://lists.xen.org/xen-devel
|
Lists.xenproject.org is hosted with RackSpace, monitoring our |