[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: [Xen-devel] [PATCH RFC V4 0/5] kvm : Paravirt-spinlock support for KVM guests

To: Jeremy Fitzhardinge <jeremy@xxxxxxxx>, Avi Kivity <avi@xxxxxxxxxx>
From: Raghavendra K T <raghavendra.kt@xxxxxxxxxxxxxxxxxx>
Date: Wed, 18 Jan 2012 16:18:11 +0530
Cc: Greg Kroah-Hartman <gregkh@xxxxxxx>, Gleb Natapov <gleb@xxxxxxxxxx>, linux-doc@xxxxxxxxxxxxxxx, Peter Zijlstra <peterz@xxxxxxxxxxxxx>, Jan Kiszka <jan.kiszka@xxxxxxxxxxx>, Virtualization <virtualization@xxxxxxxxxxxxxxxxxxxxxxxxxx>, Randy Dunlap <rdunlap@xxxxxxxxxxxx>, Paul Mackerras <paulus@xxxxxxxxx>, Raghavendra <raghavendra.kt@xxxxxxxxxxxxxxxxxx>, "H. Peter Anvin" <hpa@xxxxxxxxx>, Stefano Stabellini <stefano.stabellini@xxxxxxxxxxxxx>, Xen <xen-devel@xxxxxxxxxxxxxxxxxxx>, Dave Jiang <dave.jiang@xxxxxxxxx>, KVM <kvm@xxxxxxxxxxxxxxx>, Rob Landley <rlandley@xxxxxxxxxxxxx>, X86 <x86@xxxxxxxxxx>, Alexander Graf <agraf@xxxxxxx>, Ingo Molnar <mingo@xxxxxxxxxx>, Konrad Rzeszutek Wilk <konrad.wilk@xxxxxxxxxx>, Srivatsa Vaddagiri <vatsa@xxxxxxxxxxxxxxxxxx>, Sasha Levin <levinsasha928@xxxxxxxxx>, Sedat Dilek <sedat.dilek@xxxxxxxxx>, Thomas Gleixner <tglx@xxxxxxxxxxxxx>, Marcelo Tosatti <mtosatti@xxxxxxxxxx>, LKML <linux-kernel@xxxxxxxxxxxxxxx>, Dave Hansen <dave@xxxxxxxxxxxxxxxxxx>, Suzuki Poulose <suzuki@xxxxxxxxxxxxxxxxxx>, Glauber Costa <glommer@xxxxxxxxxx>
Delivery-date: Fri, 20 Jan 2012 09:22:58 +0000
List-id: Xen developer discussion <xen-devel.lists.xensource.com>

On 01/17/2012 05:29 AM, Jeremy Fitzhardinge wrote:

On 01/16/2012 07:55 PM, Avi Kivity wrote:

On 01/16/2012 08:40 AM, Jeremy Fitzhardinge wrote:

That means we're spinning for n cycles, then notify the spinlock holder that 
we'd like to get kicked and go sleeping. While I'm pretty sure that it improves 
the situation, it doesn't solve all of the issues we have.

Imagine we have an idle host. All vcpus can freely run and everyone can fetch 
the lock as fast as on real machines. We don't need to / want to go to sleep 
here. Locks that take too long are bugs that need to be solved on real hw just 
as well, so all we do is possibly incur overhead.

I'm not quite sure what your concern is.  The lock is under contention, so 
there's nothing to do except spin; all this patch adds is a variable 
decrement/test to the spin loop, but that's not going to waste any more CPU 
than the non-counting case.  And once it falls into the blocking path, its a 
win because the VCPU isn't burning CPU any more.

The wakeup path is slower though.  The previous lock holder has to
hypercall, and the new lock holder has to be scheduled, and transition
from halted state to running (a vmentry).  So it's only a clear win if
we can do something with the cpu other than go into the idle loop.


Not burning power is a win too.

Actually what you want is something like "if you preempt a VCPU while
its spinning in a lock, then push it into the slowpath and don't
reschedule it without a kick".  But I think that interface would have a
lot of fiddly corners.


Yes wakeup path is little slower but better than burning cpu. no?

Suppose we have  16 vcpu,
vcpu 1- lockholder (preempted).
vcpu 2-8 - in slowpath.

If scheduler schedules vcpu-1 that is most favourable for lock progress,

But if vcpu-9 - vcpu-16 OR something else scheduled, then also it's awin right (we are doing some useful work), but yes lock progress is

again little slower though.

The optimization areas of interests are perhaps:
(1) suppose vcpu-1 is running and is about to release lock and next

vcpu in queue just goes to halt(). so this makes us to tuneSPIN_THRESHOLD rightly and have a mechanism to determine if lock-holderis running and do continue spin. Identifying whether lock-holder isrunning would be easier task and can be next step of optimization.


(2) Much talked, identifying lockholder-preemption (vcpu) and do
yield_to().

But I am not sure how complicated is yield_to() implementation once wehave identified the exact preempted vcpu (lock-holder).



_______________________________________________
Xen-devel mailing list
Xen-devel@xxxxxxxxxxxxxxxxxxx
http://lists.xensource.com/xen-devel

References:
- [Xen-devel] [PATCH RFC V4 0/5] kvm : Paravirt-spinlock support for KVM guests
  - From: Raghavendra K T
- Re: [Xen-devel] [PATCH RFC V4 0/5] kvm : Paravirt-spinlock support for KVM guests
  - From: Alexander Graf
- Re: [Xen-devel] [PATCH RFC V4 0/5] kvm : Paravirt-spinlock support for KVM guests
  - From: Jeremy Fitzhardinge
- Re: [Xen-devel] [PATCH RFC V4 0/5] kvm : Paravirt-spinlock support for KVM guests
  - From: Avi Kivity
- Re: [Xen-devel] [PATCH RFC V4 0/5] kvm : Paravirt-spinlock support for KVM guests
  - From: Jeremy Fitzhardinge

Prev by Date: Re: [Xen-devel] RFC: Still TODO for 4.2? xl domain numa memory allocation vs xm/xend
Next by Date: Re: [Xen-devel] why do I get bad disk write performance in the kernel 3.1?
Previous by thread: Re: [Xen-devel] [PATCH RFC V4 0/5] kvm : Paravirt-spinlock support for KVM guests
Next by thread: Re: [Xen-devel] [PATCH RFC V4 0/5] kvm : Paravirt-spinlock support for KVM guests
Index(es):
- Date
- Thread

Lists.xenproject.org is hosted with RackSpace, monitoring our
servers 24x7x365 and backed by RackSpace's Fanatical Support®.