[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

RE: [Xen-API] [RFC][PATCH] Deadlock in RRD handling during master transition



Hi Ted,

Thanks for diagnosing this problem and sending the patch.

I think this might be the same deadlock that is fixed by:

http://xenbits.xensource.com/XCP/xen-api.hg?rev/793489e20822;style=rev%3DCA

Although I think the problem is now fixed in trunk, it seems to me that this 
problem might recur: it's very easy to forget you have the RRD mutex held and 
accidentally query the master's db. So I think it's probably best to adopt a 
'belt-and-braces' approach and take your patch too.

Thanks,
Dave

> -----Original Message-----
> From: xen-api-bounces@xxxxxxxxxxxxxxxxxxx [mailto:xen-api-
> bounces@xxxxxxxxxxxxxxxxxxx] On Behalf Of Ted Lin
> Sent: 09 August 2010 15:52
> To: xen-api@xxxxxxxxxxxxxxxxxxx
> Subject: [Xen-API] [RFC][PATCH] Deadlock in RRD handling during master
> transition
> 
> The codebase that this XAPI patch is based on is a bit old (5.5.0;
> George), but I think some form of it still exists in the latest
> codebase.
> 
> What we're seeing is that if the master host goes away during slave RRD
> processing, it will prevent the slave XAPI from transitioning to master.
> The patch adds a try-lock in the RRD backup so if the lock is still
> held, the backup is skipped.
> 
> 
> I'm wondering if this is a safe change (at least for 5.5.0), and if
> anyone has ever run into this before?  (An unreliable way of generating
> the situation is to pull the management link during a migration.)
> 
> 
> 
> Thanks for the help,
> 
> --
> Ted M Lin

_______________________________________________
xen-api mailing list
xen-api@xxxxxxxxxxxxxxxxxxx
http://lists.xensource.com/mailman/listinfo/xen-api


 


Rackspace

Lists.xenproject.org is hosted with RackSpace, monitoring our
servers 24x7x365 and backed by RackSpace's Fanatical Support®.