[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: [Xen-devel] [PATCH 04/11] qspinlock: Extract out the exchange of tail code word

To: Konrad Rzeszutek Wilk <konrad.wilk@xxxxxxxxxx>
From: Waiman Long <waiman.long@xxxxxx>
Date: Wed, 18 Jun 2014 11:46:04 -0400
Cc: linux-arch@xxxxxxxxxxxxxxx, Peter Zijlstra <a.p.zijlstra@xxxxxxxxx>, kvm@xxxxxxxxxxxxxxx, oleg@xxxxxxxxxx, boris.ostrovsky@xxxxxxxxxx, scott.norton@xxxxxx, raghavendra.kt@xxxxxxxxxxxxxxxxxx, paolo.bonzini@xxxxxxxxx, linux-kernel@xxxxxxxxxxxxxxx, gleb@xxxxxxxxxx, virtualization@xxxxxxxxxxxxxxxxxxxxxxxxxx, Peter Zijlstra <peterz@xxxxxxxxxxxxx>, chegu_vinod@xxxxxx, david.vrabel@xxxxxxxxxx, xen-devel@xxxxxxxxxxxxxxxxxxxx, Paolo Bonzini <pbonzini@xxxxxxxxxx>, tglx@xxxxxxxxxxxxx, paulmck@xxxxxxxxxxxxxxxxxx, torvalds@xxxxxxxxxxxxxxxxxxxx, mingo@xxxxxxxxxx
Delivery-date: Wed, 18 Jun 2014 15:46:17 +0000
List-id: Xen developer discussion <xen-devel.lists.xen.org>

On 06/18/2014 09:50 AM, Konrad Rzeszutek Wilk wrote:

On Wed, Jun 18, 2014 at 01:37:45PM +0200, Paolo Bonzini wrote:

Il 17/06/2014 22:55, Konrad Rzeszutek Wilk ha scritto:

On Sun, Jun 15, 2014 at 02:47:01PM +0200, Peter Zijlstra wrote:

From: Waiman Long<Waiman.Long@xxxxxx>

This patch extracts the logic for the exchange of new and previous tail
code words into a new xchg_tail() function which can be optimized in a
later patch.

And also adds a third try on acquiring the lock. That I think should
be a seperate patch.

It doesn't really add a new try, the old code is:


-       for (;;) {
-               new = _Q_LOCKED_VAL;
-               if (val)
-                       new = tail | (val&  _Q_LOCKED_PENDING_MASK);
-
-               old = atomic_cmpxchg(&lock->val, val, new);
-               if (old == val)
-                       break;
-
-               val = old;
-       }

        /*
-        * we won the trylock; forget about queueing.
         */
-       if (new == _Q_LOCKED_VAL)
-               goto release;

The trylock happens if the "if (val)" hits the else branch.

What the patch does is change it from attempting two transition with a
single cmpxchg:

-        * 0,0,0 ->  0,0,1 ; trylock
-        * p,y,x ->  n,y,x ; prev = xchg(lock, node)

to first doing the trylock, then the xchg.  If the trylock passes and the
xchg returns prev=0,0,0, the next step of the algorithm goes to the
locked/uncontended state

+       /*
+        * claim the lock:
+        *
+        * n,0 ->  0,1 : lock, uncontended

Similar to your suggestion of patch 3, it's expected that the xchg will
*not* return prev=0,0,0 after a failed trylock.

I do like your explanation. I hope that Peter will put it in the
description as it explains the change quite well.

However, I *do* agree with you that it's simpler to just squash this patch
into 01/11.

Uh, did I say that? Oh I said why don't make it right the first time!

I meant in terms of seperating the slowpath (aka the bytelock on the pending
bit) from the queue (MCS code). Or renaming the function to be called
'complex' instead of 'slowpath' as it is getting quite hairy.

The #1 patch is nice by itself - as it lays out the foundation of the
MCS-similar code - and if Ingo decides he does not want this pending
byte-lock bit business - it can be easily reverted or dropped.

The pending bit code is needed for performance parity with ticketspinlock for light load. My own measurement indicates that the queuingoverhead will cause the queue spinlock to be slower than ticket spinlockwith 2-4 contending tasks. The pending bit solves the performanceproblem with 2 contending tasks, leave only the 3-4 tasks cases being abit slower than the ticket spinlock which should be more thancompensated by its superior performance with heavy contention andslightly better performance with no contention.


-Longman

_______________________________________________
Xen-devel mailing list
Xen-devel@xxxxxxxxxxxxx
http://lists.xen.org/xen-devel

Follow-Ups:
- Re: [Xen-devel] [PATCH 04/11] qspinlock: Extract out the exchange of tail code word
  - From: Konrad Rzeszutek Wilk
- Re: [Xen-devel] [PATCH 04/11] qspinlock: Extract out the exchange of tail code word
  - From: Paolo Bonzini

References:
- [Xen-devel] [PATCH 00/11] qspinlock with paravirt support
  - From: Peter Zijlstra
- [Xen-devel] [PATCH 04/11] qspinlock: Extract out the exchange of tail code word
  - From: Peter Zijlstra
- Re: [Xen-devel] [PATCH 04/11] qspinlock: Extract out the exchange of tail code word
  - From: Konrad Rzeszutek Wilk
- Re: [Xen-devel] [PATCH 04/11] qspinlock: Extract out the exchange of tail code word
  - From: Paolo Bonzini
- Re: [Xen-devel] [PATCH 04/11] qspinlock: Extract out the exchange of tail code word
  - From: Konrad Rzeszutek Wilk

Prev by Date: Re: [Xen-devel] [PATCH v9 5/9] libxl/xl: push VCPU affinity pinning down to libxl
Next by Date: Re: [Xen-devel] Bug: USB Bridge Will Not PassThru
Previous by thread: Re: [Xen-devel] [PATCH 04/11] qspinlock: Extract out the exchange of tail code word
Next by thread: Re: [Xen-devel] [PATCH 04/11] qspinlock: Extract out the exchange of tail code word
Index(es):
- Date
- Thread

Lists.xenproject.org is hosted with RackSpace, monitoring our
servers 24x7x365 and backed by RackSpace's Fanatical Support®.