[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index] Re: [Xen-devel] [PATCH v7 12/18] tools/libx{l, c}: add back channel to libxc
On 01/30/2016 12:38 AM, Konrad Rzeszutek Wilk wrote: > On Fri, Jan 29, 2016 at 01:27:28PM +0800, Wen Congyang wrote: >> In COLO mode, both VMs are running, and are considered in sync if the >> visible network traffic is identical. After some time, they fall out of >> sync. >> >> At this point, the two VMs have definitely diverged. Lets call the >> primary dirty bitmap set A, while the secondary dirty bitmap set B. >> >> Sets A and B are different. >> >> Under normal migration, the page data for set A will be sent from the >> primary to the secondary. >> >> However, the set difference B - A (the one in B but not in A, lets >> call this C) is out-of-date on the secondary (with respect to the >> primary) and will not be sent by the primary (to secondary), as it >> was not memory dirtied by the primary. The secondary needs C page data >> to reconstruct an exact copy of the primary at the checkpoint. >> >> The secondary cannot calculate C as it doesn't know A. Instead, the >> secondary must send B to the primary, at which point the primary >> calculates the union of A and B (lets call this D) which is all the >> pages dirtied by both the primary and the secondary, and sends all page >> data covered by D. >> >> In the general case, D is a superset of both A and B. Without the >> backchannel dirty bitmap, a COLO checkpoint can't reconstruct a valid >> copy of the primary. >> >> We transfer the dirty bitmap on libxc side, so we need to introduce back >> channel to libxc. >> >> Note: it is different from the paper. We change the original design to >> the current one, according to our following concerns: >> 1. The original design needs extra memory on Secondary host. When there's >> multiple backups on one host, the memory cost is high. >> 2. The memory cache code will be another 1k+, it will make the review >> more time consuming. >> >> Note: the back channel will be used in the patch >> libxc/restore: send dirty pfn list to primary when checkpoint under COLO >> to send dirty pfn list from secondary to primary. The patch is posted in >> another series. >> >> Signed-off-by: Yang Hongyang <hongyang.yang@xxxxxxxxxxxx> >> Signed-off-by: Andrew Cooper <andrew.cooper3@xxxxxxxxxx> >> CC: Ian Campbell <Ian.Campbell@xxxxxxxxxx> >> CC: Ian Jackson <Ian.Jackson@xxxxxxxxxxxxx> >> CC: Wei Liu <wei.liu2@xxxxxxxxxx> > > It is a bit confusing to have 'back_fd' and then 'send_fd'. > > Could you change the 'send_fd' (in this patch) to be called > 'send_back_fd' so that the connection between: > tools/libxl: Add back channel to allow migration target send data back > and this patch is clear? > > Or perhaps also add it in the commit description that you are using > the 'send_fd' provided by ' tools/libxl: Add back channel to allow migration > target send data back' Before this series: In libxl: we have send_fd/recv_fd(libxl_domain_remus_start()), and only have restore_fd(libxl_domain_create_restore()) In libxc: We have io_fd(xc_domain_save()/xc_domain_restore()) The fd in libxc is provided by libxl. I think after this series, we can add the following fd: 1. add a send_back_fd in libxl_domain_create_restore() 2. add a recv_fd in xc_domain_save() 3. add a send_back_fd in xc_domain_restore() What about this? Thanks Wen Congyang > > Otherwise: Reviewed-by: Konrad Rzeszutek Wilk <konrad.wilk@xxxxxxxxxx> > > > . > _______________________________________________ Xen-devel mailing list Xen-devel@xxxxxxxxxxxxx http://lists.xen.org/xen-devel
|
Lists.xenproject.org is hosted with RackSpace, monitoring our |