|
[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index] Re: [Xen-devel] [PATCH] tools/libxl: Fixes to stream v2 task joining logic
On 24/07/15 12:41, Ian Jackson wrote:
> Andrew Cooper writes ("[PATCH] tools/libxl: Fixes to stream v2 task joining
> logic"):
>> During review of the libxl migration v2 series, I changes the task
>> joining logic, but clearly didn't think the result through
>> properly. This patch fixes several errors.
> This would have been much easier to review if it had been split into 3
> patches. I have gone mostly by the commit message because it was hard
> to see hunk belonged to what.
I have split them up.
>
>> 3) Avoid stacking of check_all_finished() via synchronous teardown of
>> tasks. If the _abort() functions call back synchronously,
>> stream->completion_callback() ends up getting called twice, as first and
>> last check_all_finished() frames observe each task being finished.
> I think this part of the patch is fine.
>
>
>> 1) Do not call check_all_finished() in the success cases of
>> libxl__xc_domain_{save,restore}_done(). It serves no specific purpose
>> as the save helper state will report itself as inactive by this point,
>> and avoids triggering a second stream->completion_callback() in the case
>> that write_toolstack_record()/stream_continue() record errors
>> synchronously themselves.
> "Serves no specific purpose" other than having a single exit path,
> which makes matters much less confusing.
"Serves no specific purpose" in so far as what check_all_finished()
would do in the success case.
>
> I think the problem may be that libxl__xc_domain_{save,restore}_done
> fail to "return" after "write_toolstack_record" and "stream_continue".
> That seems like simply a bug. I'm sorry that I didn't notice it in
> review.
After some more thought, I don't believe that my fix is necessarily
correct. If a condition were to exist where the stream had recorded an
error and abort()'ed the save helper, but the save helper was already
exiting with a success condition, then the callback wouldn't be fired at
all. I have a proposed alternate solution.
>
> In general each callback function should set up exactly one other
> callback. If it does anything else then reentrancy hazards arise.
The entire point of this logic is that there are multiple operations
going on in parallel. It is not guaranteed that a save helper will ever
be spawned on the read side (although this would be a very useless
stream). The state of the libxl stream read/write object is
deliberately separate from the save helper.
>
> Also, it is confusing and perhaps wrong that write_toolstack_record
> calls stream_complete. What if there are other threads of control
> outstanding ?
This is exactly the problem which check_all_finished() is supposed to
solve, but currently doesn't.
>
>
>> 2) Only ever set stream->rc in stream_complete(). The first version of
>> the migration v2 series had separate rc and joined_rc parameters, where
>> this logic worked. However when combining the two, the teardown path
>> fails to trigger if stream_done() records stream->rc itself. A side
>> effect of this is that stream_done() needs to take an rc parameter.
> "the teardown path fails to trigger if stream_done() records
> stream->rc itself" but in the code I am looking at neither of the
> functions stream_done assign to rc.
I have no idea why I wrote what I did. The code was correct but the
description was wrong. I have fixed it in the split version of this patch.
~Andrew
_______________________________________________
Xen-devel mailing list
Xen-devel@xxxxxxxxxxxxx
http://lists.xen.org/xen-devel
|
![]() |
Lists.xenproject.org is hosted with RackSpace, monitoring our |