[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: [Xen-devel] [PATCH RFC v6 09/11] pvqspinlock, x86: Add qspinlock para-virtualization support

To: Paolo Bonzini <pbonzini@xxxxxxxxxx>
From: Waiman Long <waiman.long@xxxxxx>
Date: Thu, 13 Mar 2014 15:49:16 -0400
Cc: Jeremy Fitzhardinge <jeremy@xxxxxxxx>, Raghavendra K T <raghavendra.kt@xxxxxxxxxxxxxxxxxx>, Gleb Natapov <gleb@xxxxxxxxxx>, Peter Zijlstra <peterz@xxxxxxxxxxxxx>, virtualization@xxxxxxxxxxxxxxxxxxxxxxxxxx, Andi Kleen <andi@xxxxxxxxxxxxxx>, "H. Peter Anvin" <hpa@xxxxxxxxx>, Michel Lespinasse <walken@xxxxxxxxxx>, Alok Kataria <akataria@xxxxxxxxxx>, linux-arch@xxxxxxxxxxxxxxx, kvm@xxxxxxxxxxxxxxx, x86@xxxxxxxxxx, Ingo Molnar <mingo@xxxxxxxxxx>, xen-devel@xxxxxxxxxxxxxxxxxxxx, "Paul E. McKenney" <paulmck@xxxxxxxxxxxxxxxxxx>, Arnd Bergmann <arnd@xxxxxxxx>, Scott J Norton <scott.norton@xxxxxx>, Steven Rostedt <rostedt@xxxxxxxxxxx>, Chris Wright <chrisw@xxxxxxxxxxxx>, Thomas Gleixner <tglx@xxxxxxxxxxxxx>, Aswin Chandramouleeswaran <aswin@xxxxxx>, Chegu Vinod <chegu_vinod@xxxxxx>, Oleg Nesterov <oleg@xxxxxxxxxx>, David Vrabel <david.vrabel@xxxxxxxxxx>
Delivery-date: Thu, 13 Mar 2014 19:49:49 +0000
List-id: Xen developer discussion <xen-devel.lists.xen.org>

On 03/13/2014 09:57 AM, Paolo Bonzini wrote:

Il 13/03/2014 12:21, David Vrabel ha scritto:
On 12/03/14 18:54, Waiman Long wrote:
This patch adds para-virtualization support to the queue spinlock in
the same way as was done in the PV ticket lock code. In essence, the
lock waiters will spin for a specified number of times (QSPIN_THRESHOLD
= 2^14) and then halted itself. The queue head waiter will spins
2*QSPIN_THRESHOLD times before halting itself. When it has spinned
QSPIN_THRESHOLD times, the queue head will assume that the lock
holder may be scheduled out and attempt to kick the lock holder CPU
if it has the CPU number on hand.
I don't really understand the reasoning for kicking the lock holder.
I agree. If the lock holder isn't running, there's probably a goodreason for that and going to sleep will not necessarily convince thescheduler to give more CPU to the lock holder. I think there are twochoices:
1) use yield_to to donate part of the waiter's quantum to the lockholder? For this we probably need a new, separate hypercallinterface. For KVM it would be the same as hlt in the guest but withan additional yield_to in the host.
2) do nothing, just go to sleep.

Could you get (or do you have) numbers for (2)?

I will take out the lock holder kick portion from the patch. I will alsotry to collect more test data.


More important, I think a barrier is missing:

    Lock holder ---------------------------------------

    // queue_spin_unlock
    barrier();
    ACCESS_ONCE(qlock->lock) = 0;
    barrier();

This is not the unlock code that is used when PV spinlock is enabled.The right unlock code is


        if (static_key_false(&paravirt_spinlocks_enabled)) {
                /*

* Need to atomically clear the lock byte to avoidracing with* queue head waiter trying to set_QSPINLOCK_LOCKED_SLOWPATH.

                 */
                if (likely(cmpxchg(&qlock->lock, _QSPINLOCK_LOCKED, 0)
                                == _QSPINLOCK_LOCKED))
                        return;
                else
                        queue_spin_unlock_slowpath(lock);

        } else {
                __queue_spin_unlock(lock);
        }

    // pv_kick_node:
    if (pv->cpustate != PV_CPU_HALTED)
        return;
    ACCESS_ONCE(pv->cpustate) = PV_CPU_KICKED;
    __queue_kick_cpu(pv->mycpu, PV_KICK_QUEUE_HEAD);

        Waiter -------------------------------------------

        // pv_head_spin_check
        ACCESS_ONCE(pv->cpustate) = PV_CPU_HALTED;
        lockval = cmpxchg(&qlock->lock,
                  _QSPINLOCK_LOCKED,
                  _QSPINLOCK_LOCKED_SLOWPATH);
        if (lockval == 0) {
            /*
             * Can exit now as the lock is free
             */
            ACCESS_ONCE(pv->cpustate) = PV_CPU_ACTIVE;
            *count = 0;
            return;
        }
        __queue_hibernate();

Nothing protects from writing qlock->lock before pv->cpustate is read,leading to this:


    Lock holder            Waiter
    ---------------------------------------------------------------
    read pv->cpustate
        (it is PV_CPU_ACTIVE)
                    pv->cpustate = PV_CPU_HALTED
                    lockval = cmpxchg(...)
                    hibernate()
    qlock->lock = 0
    if (pv->cpustate != PV_CPU_HALTED)
        return;

The lock holder will read cpustate only if the lock byte has beenchanged to _QSPINLOCK_LOCKED_SLOWPATH. So the setting of the lock bytesynchronize the 2 threads. The only thing that I am not certain is whenthe waiter is trying to go to sleep while, at the same time, the lockholder is trying to kick it. Will there be a missed wakeup because ofthis timing issue?


-Longman


_______________________________________________
Xen-devel mailing list
Xen-devel@xxxxxxxxxxxxx
http://lists.xen.org/xen-devel

Follow-Ups:
- Re: [Xen-devel] [PATCH RFC v6 09/11] pvqspinlock, x86: Add qspinlock para-virtualization support
  - From: Paolo Bonzini

References:
- [Xen-devel] [PATCH v6 00/11] qspinlock: a 4-byte queue spinlock with PV support
  - From: Waiman Long
- [Xen-devel] [PATCH RFC v6 09/11] pvqspinlock, x86: Add qspinlock para-virtualization support
  - From: Waiman Long
- Re: [Xen-devel] [PATCH RFC v6 09/11] pvqspinlock, x86: Add qspinlock para-virtualization support
  - From: David Vrabel
- Re: [Xen-devel] [PATCH RFC v6 09/11] pvqspinlock, x86: Add qspinlock para-virtualization support
  - From: Paolo Bonzini

Prev by Date: Re: [Xen-devel] [PATCH net-next] xen-netback: Schedule NAPI from dealloc thread instead of callback
Next by Date: Re: [Xen-devel] [PATCH v6 05/11] pvqspinlock, x86: Allow unfair spinlock in a PV guest
Previous by thread: Re: [Xen-devel] [PATCH RFC v6 09/11] pvqspinlock, x86: Add qspinlock para-virtualization support
Next by thread: Re: [Xen-devel] [PATCH RFC v6 09/11] pvqspinlock, x86: Add qspinlock para-virtualization support
Index(es):
- Date
- Thread

Lists.xenproject.org is hosted with RackSpace, monitoring our
servers 24x7x365 and backed by RackSpace's Fanatical Support®.