|
[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index] Re: [Xen-devel] [PATCH v10 11/20] x86/VPMU: Interface for setting PMU mode and flags
>>> On 11.09.14 at 18:10, <boris.ostrovsky@xxxxxxxxxx> wrote:
> On 09/11/2014 10:59 AM, Jan Beulich wrote:
>>>>> On 11.09.14 at 16:12, <boris.ostrovsky@xxxxxxxxxx> wrote:
>>> On 09/11/2014 02:44 AM, Jan Beulich wrote:
>>>>>>> On 10.09.14 at 19:37, <boris.ostrovsky@xxxxxxxxxx> wrote:
>>>>> On 09/10/2014 11:05 AM, Jan Beulich wrote:
>>>>>>>>> On 04.09.14 at 05:41, <boris.ostrovsky@xxxxxxxxxx> wrote:
>>>>>>> + cont_wait:
>>>>>>> + /*
>>>>>>> + * Note that we may fail here if a CPU is hot-(un)plugged while we
>>>>>>> are
>>>>>>> + * waiting. We will then time out.
>>>>>>> + */
>>>>>>> + while ( atomic_read(&vpmu_sched_counter) != allbutself_num )
>>>>>>> + {
>>>>>>> + /* Give up after 5 seconds */
>>>>>>> + if ( NOW() > start + SECONDS(5) )
>>>>>>> + {
>>>>>>> + printk(XENLOG_WARNING
>>>>>>> + "vpmu_force_context_switch: failed to sync\n");
>>>>>>> + ret = -EBUSY;
>>>>>>> + break;
>>>>>>> + }
>>>>>>> + cpu_relax();
>>>>>>> + if ( hypercall_preempt_check() )
>>>>>>> + return hypercall_create_continuation(
>>>>>>> + __HYPERVISOR_xenpmu_op, "ih", XENPMU_mode_set, arg);
>>>>>>> + }
>>>>>> I wouldn't complain about this not being synchronized with CPU
>>>>>> hotplug if there wasn't this hypercall continuation and relatively
>>>>>> long timeout. Much of the state you latch in static variables will
>>>>>> cause this operation to time out if in between a CPU got brought
>>>>>> down.
>>>>> It seemed to me that if we were to correctly deal with CPU hotplug it
>>>>> would add a bit too much complexity to the code. So I felt that letting
>>>>> the operation timeout would be a better way out.
>>>> The please at least add a code comment making this explicit to
>>>> future readers.
>>> Is the comment above 'while' keyword not sufficient?
>> Oh, it is of course. Must have not scrolled back enough...
>>
>>>>>> And as already alluded to, all this looks rather fragile anyway,
>>>>>> even if I can't immediately spot any problems with it anymore.
>>>>> The continuation is really a carry-over from earlier patch version when
>>>>> I had double loops over domain and VCPUs to explicitly unload VPMUs. At
>>>>> that time Andrew pointed out that these loops may take really long time
>>>>> and so I added continuations.
>>>>>
>>>>> Now that I changed that after realizing that having each PCPU go through
>>>>> a context switch is sufficient perhaps I don't need it any longer. Is
>>>>> the worst case scenario of being stuck here for 5 seconds (chosen
>>>>> somewhat arbitrary) acceptable without continuation?
>>>> 5 seconds is _way_ too long for doing this without continuation.
>>> Then I am also adding back your other comment from this thread
>>>
>>> > > +long do_xenpmu_op(int op, XEN_GUEST_HANDLE_PARAM(xen_pmu_params_t)
>>> arg)
>>> > > +{
>>> > > + int ret = -EINVAL;
>>> > > + xen_pmu_params_t pmu_params;
>>> > > +
>>> > > + switch ( op )
>>> > > + {
>>> > > + case XENPMU_mode_set:
>>> > > + {
>>> > > + static DEFINE_SPINLOCK(xenpmu_mode_lock);
>>> > > + uint32_t current_mode;
>>> > > +
>>> > > + if ( !is_control_domain(current->domain) )
>>> > > + return -EPERM;
>>> > > +
>>> > > + if ( copy_from_guest(&pmu_params, arg, 1) )
>>> > > + return -EFAULT;
>>> > > +
>>> > > + if ( pmu_params.val & ~XENPMU_MODE_SELF )
>>> > > + return -EINVAL;
>>> > > +
>>> > > + /*
>>> > > + * Return error is someone else is in the middle of changing
> mode ---
>>> > > + * this is most likely indication of two system
>>> administrators
>>> > > + * working against each other
>>> > > + */
>>> > > + if ( !spin_trylock(&xenpmu_mode_lock) )
>>> > > + return -EAGAIN;
>>> >
>>> > So what happens if you can't take the lock in a continuation? If
>>> > returning -EAGAIN in that case is not a problem, what do you
>>> > need the continuation for in the first place?
>>>
>>> EAGAIN this case means that the caller was not able to initiate the
>>> operation. Continuation will allow the caller to finish operation in
>>> progress.
>> But that's only what you want, not what the code does. Also now
>> that I look again I don't think the comment really applies to this if().
>
> Oh, I see. Then both first and second will fail.
>
> I can make the second caller reset everything so that when continuation
> gets to run it will start anew. And if it (i.e. the first caller) did
> get -EAGAIN while trying to get the lock then it's just as well --- the
> state will be clean when user tries this again.
>
> As for the question why continuation is needed in the firs place ---
> it's to make sure this hypercall doesn't prevent other unrelated
> operations from executing. Not to manage simultaneous execution of this
> hypercall from multiple VCPUs (if this is what you were asking).
No, that's not what I was asking. The point I'm trying to make is - if
the caller is in need of dealing with -EAGAIN anyway (i.e. you
require it to retry), why can't you simply return -EAGAIN also for
the case where you currently use a continuation?
Jan
_______________________________________________
Xen-devel mailing list
Xen-devel@xxxxxxxxxxxxx
http://lists.xen.org/xen-devel
|
![]() |
Lists.xenproject.org is hosted with RackSpace, monitoring our |