[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

[Xen-API] unkillable deadbeef-dead-beef domain and xapi host self disabling


  • To: xen-api@xxxxxxxxxxxxxxxxxxx
  • From: George Shuklin <george.shuklin@xxxxxxxxx>
  • Date: Tue, 19 Apr 2011 20:28:56 +0400
  • Delivery-date: Tue, 19 Apr 2011 09:29:19 -0700
  • Domainkey-signature: a=rsa-sha1; c=nofws; d=gmail.com; s=gamma; h=subject:from:to:content-type:date:message-id:mime-version:x-mailer; b=VOHK8r6clI+5UsW49pGuxKwgUjp5ZsSwx+5Ujc8Zn9uHRcLE1Hu83c7zKP7LkLbGnY OlwBxeqa7y7xum8So0Y8bFhuWynt4SHkyO45FDKW5yOrhHP3BeeegvbMQTvpqBWaqilA /Ah30GougVFDM8tMJZrxBuaZ7RkjCXvpWCBqg=
  • List-id: Discussion of API issues surrounding Xen <xen-api.lists.xensource.com>

Good day.

XCP 0.5, got this bug with PV-only VM's on host:

Xen suddenly stops to kill domain (i still don't know why). Domain has
statud 'd' (dying) and it was not possible to kill it even by
xc.domain_destroy() from python script.

Xapi checks if stray domain avaible when starts. It change their uuid to
deadbeef value if found and keeps trying to kill them until they die.

During this time host change it own value to 'disabled' in xapi
database. If domain is unkillable, it keeps host disable and even
prevent VM migration from hosts (because host switching to emergency
mode until it able to kill all stray domains).

I have met this problem last week, I was able to fool xapi by changing
database (create fake deadbeef vm set resident-on and powerstate
correctly), and even was able to migrate few machines from damaged host,
but domains in list after 'bad one' was unmigrable: after suspend xapi
was unable to destroy domain and migration process hangs...


log in attachment.

I think it's clearly Xen bug, but I unable to reproduce it, so I unable
to report it correctly.

Other question is xapi behavior with stray (or unkillable) domains, I
think we shall allow some kind of 'emergency migration' procedure: we
allow migration, we put domain to endless 'pause' state (we can kill it
but we can allow it to continue to run). 

---
wBR, George.

Attachment: deadbeef-dead-beef-dead.txt
Description: Text document

_______________________________________________
xen-api mailing list
xen-api@xxxxxxxxxxxxxxxxxxx
http://lists.xensource.com/mailman/listinfo/xen-api

 


Rackspace

Lists.xenproject.org is hosted with RackSpace, monitoring our
servers 24x7x365 and backed by RackSpace's Fanatical Support®.