[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: [Xen-devel] [PATCH 0/2] xen: credit2: fix vcpu starvation due to too few credits

On Thu, 2020-03-12 at 16:27 +0000, Andrew Cooper wrote:
> On 12/03/2020 15:51, Jürgen Groß wrote:
> > - Could it be the vcpu is busy for very long time in the
> > hypervisor?
> >   So either fighting with another vcpu for a lock, doing a long
> >   running hypercall, ...
> Using watchdog=2 might catch that.  (There is a counting issue which
> I've not had time to fix yet, which makes the watchdog more fragile
> with
> a smaller timeout, but 2 should be ok.)
Done (actually, I used "watchdog=1 watchdog_timeout=2"), both with and
without these patches applies.

I don't see any watchdog firing.

While I was there, I tried 1 too. Nothing either.

> > - The timer used is not reliable.
> > 
> > - The time base is not reliable (tsc or whatever is used for
> > getting
> >   the time has jumped 2 seconds into the future).
> Worth instrumenting the TSC rendezvous for unexpectedly large jumps?
Yes, we should look into that.

> > - System management mode has kicked in.
> There are non-architectural SMI_COUNT MSRs (0x34 on Intel, can't
> remember AMD off the top of my head) which can be used to see if any
> have occurred, and this has proved useful in the past for debugging.
I'll have to look up at how to check and dump these from where I need.
Let's see if I can do that... hints, code pointer or anything else you
may have handy would be much appreciated. :-)

Thanks and Regards
Dario Faggioli, Ph.D
Virtualization Software Engineer
SUSE Labs, SUSE https://www.suse.com/
<<This happens because _I_ choose it to happen!>> (Raistlin Majere)

Attachment: signature.asc
Description: This is a digitally signed message part

Xen-devel mailing list



Lists.xenproject.org is hosted with RackSpace, monitoring our
servers 24x7x365 and backed by RackSpace's Fanatical Support®.