[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index] [Xen-devel] [PATCH 1/2] libxl: datacopier: Avoid eof/POLLHUP race
When the bootloader exits, several things change, all at once: (a) The master pty fd (held by libxl) starts to signal POLLHUP and maybe also POLLIN. (b) The child exits (so that the SIGCHLD self-pipe signals POLLIN, which will be handled by the libxl child process code. (c) reads on the master pty fd start to return EOF From the point of view of the datacopier these might happen in any order. (c) can be detected only after a previous POLLIN without POLLHUP and that previous POLLIN would be associated with data which was read, which must therefore have ended up in the dc's buffer. But nothing stops the dc from writing that data into the output fd and reporting eof before it calls poll again. This race is unlikely. But nevertheless it should be fixed. We solve the race with a poll of the reading fd, to double-check, when we detect eof via read. (This is only necessary if the caller has specified callback_pollhup, as otherwise POLLHUP|POLLIN - and, presumably, POLLIN followed perhaps by POLLHUP|POLLIN, is to be treated as eof anyway.) With a testing patch supplied by me, Roger Pau Monnà has reproduced the failure on FreeBSD and verified that this patch fixes the problem. Signed-off-by: Ian Jackson <Ian.Jackson@xxxxxxxxxxxxx> Tested-by: Roger Pau Monnà <roger.pau@xxxxxxxxxx> Acked-by: Ian Campbell <ian.campbell@xxxxxxxxxx> CC: Andrew Cooper <andrew.cooper3@xxxxxxxxxx> CC: Ross Lagerwall <ross.lagerwall@xxxxxxxxxx> CC: Wei Liu <wei.liu2@xxxxxxxxxx> --- tools/libxl/libxl_aoutils.c | 16 ++++++++++++++++ 1 file changed, 16 insertions(+) diff --git a/tools/libxl/libxl_aoutils.c b/tools/libxl/libxl_aoutils.c index 3942634..ddbe6ae 100644 --- a/tools/libxl/libxl_aoutils.c +++ b/tools/libxl/libxl_aoutils.c @@ -250,6 +250,22 @@ static void datacopier_readable(libxl__egc *egc, libxl__ev_fd *ev, return; } if (r == 0) { + if (dc->callback_pollhup) { + /* It might be that this "eof" is actually a HUP. If + * the caller cares about the difference, + * double-check using poll(2). */ + struct pollfd hupchk; + hupchk.fd = ev->fd; + hupchk.events = POLLIN; + hupchk.revents = 0; + r = poll(&hupchk, 1, 0); + if (r < 0) + LIBXL__EVENT_DISASTER(egc, + "unexpected failure polling fd for datacopier eof hup check", + errno, 0); + if (datacopier_pollhup_handled(egc, dc, hupchk.revents, 0)) + return; + } libxl__ev_fd_deregister(gc, &dc->toread); break; } -- 1.7.10.4 _______________________________________________ Xen-devel mailing list Xen-devel@xxxxxxxxxxxxx http://lists.xen.org/xen-devel
|
Lists.xenproject.org is hosted with RackSpace, monitoring our |