[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: [Xen-devel] Questions about GPLPV stability tests



On Tue, Nov 29, 2011 at 9:12 AM, Andreas Kinzler <ml-xen-devel@xxxxxx> wrote:
> On 29.11.2011 16:48, Roderick Colenbrander wrote:
>>
>> Your tests are heavily stressing DomU. During any of your tests, have
>> you seen DomU crashing in such a nasty way that Dom0 went down? What
>> about in production? Our servers use similar software to you
>> (initially Xen 4.0.1, but now Xen 4.1.1 and a Linux 2.6.32-pvops
>> kernel), but a few percent of our servers go down in a very nasty way
>> every day. Dom0 becomes unresponsive, it feels it 'hung' and we have
>> to force reboot the boxes. Do issues like this sound familiar?
>
>
> Not in this year of my stability tests. In this year I am always
> experiencing crashes of domU only. dom0 was always stable.
>
> But last year, I hunted a very serious problem which causes nasty
> hangs/crashes in dom0 (which crashes domU as a consequence). See this
> mailing list post:
> http://lists.xen.org/archives/html/xen-devel/2010-09/msg00556.html
>
> In my tests it clearly shows that if you have a CPU without ARAT and you
> don't have the patch from my post, your Xen 4.0.1 or 4.1.1 will crash under
> load and/or after a while. What is your CPU?
>
> Regards Andreas

Most of our machines use i7 950 CPUs. They don't seem to have ARAT.
Some other machines use Xeon CPUs with ARAT support. We never had
issues on the Xeon systems, so we may actually be suffering from the
ARAT issue. Are you still using the patch you linked to in a
production environment? I wonder why a cleaned up patch like that
never made it into core.

I'm going to do some testing (may take a while).

Thanks,
Roderick

_______________________________________________
Xen-devel mailing list
Xen-devel@xxxxxxxxxxxxxxxxxxx
http://lists.xensource.com/xen-devel


 


Rackspace

Lists.xenproject.org is hosted with RackSpace, monitoring our
servers 24x7x365 and backed by RackSpace's Fanatical Support®.