[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: [Xen-devel] rcu_sched self-detect stall when disable vif device

To: Wei Liu <wei.liu2@xxxxxxxxxx>
From: Julien Grall <julien.grall@xxxxxxxxxx>
Date: Wed, 28 Jan 2015 16:45:29 +0000
Cc: David Vrabel <david.vrabel@xxxxxxxxxx>, Ian Campbell <ian.campbell@xxxxxxxxxx>, xen-devel <xen-devel@xxxxxxxxxxxxx>
Delivery-date: Wed, 28 Jan 2015 16:46:09 +0000
List-id: Xen developer discussion <xen-devel.lists.xen.org>

On 27/01/15 16:53, Wei Liu wrote:
> On Tue, Jan 27, 2015 at 04:47:45PM +0000, Julien Grall wrote:
>> On 27/01/15 16:45, Wei Liu wrote:
>>> On Tue, Jan 27, 2015 at 04:03:52PM +0000, Julien Grall wrote:
>>>> Hi,
>>>>
>>>> While I'm working on support for 64K page in netfront, I got
>>>> an rcu_sced self-detect message. It happens when netback is
>>>> disabling the vif device due to an error.
>>>>
>>>> I'm using Linux 3.19-rc5 on seattle (ARM64). Any idea why
>>>> the processor is stucked in xenvif_rx_queue_purge?
>>>>
>>>
>>> When you try to release a SKB, core network driver need to enter some
>>> RCU cirital region to clean up. dst_release for one, calls call_rcu.
>>
>> But this message shouldn't happen in normal condition or because of
>> netfront. Right?
>>
> 
> Never saw  report like this before, even in the case that netfront is
> buggy.

This is only happening when preemption is not enabled (i.e
CONFIG_PREEMPT_NONE in the config file) in the backend kernel.

When the vif is disabled, the loop in xenvif_kthread_guest_rx turned
into an infinite loop. In my case, the code executed looks like:


 1. for (;;) {
 2.     xenvif_wait_for_rx_work(queue);
 3.
 4.     if (kthread_should_stop())
 5.         break;
 6.
 7.     if (unlikely(vif->disabled && queue->id == 0) {
 8.             xenvif_carrier_off(vif);
 9.             xenvif_rx_queue_purge(queue);
10.             continue;
11.     }
12. }

The wait on line 2 will return directly because the vif is disabled
(see xenvif_have_rx_work)

We are on queue 0, so the condition on line 7 is true. Therefore we will
loop on line 10. And so on...

On platform where preemption is not enabled, this thread will never
yield/give the hand to another thread (unless the domain is destroyed).

Regards,

-- 
Julien Grall

_______________________________________________
Xen-devel mailing list
Xen-devel@xxxxxxxxxxxxx
http://lists.xen.org/xen-devel

Follow-Ups:
- Re: [Xen-devel] rcu_sched self-detect stall when disable vif device
  - From: David Vrabel

References:
- [Xen-devel] rcu_sched self-detect stall when disable vif device
  - From: Julien Grall
- Re: [Xen-devel] rcu_sched self-detect stall when disable vif device
  - From: Wei Liu
- Re: [Xen-devel] rcu_sched self-detect stall when disable vif device
  - From: Julien Grall
- Re: [Xen-devel] rcu_sched self-detect stall when disable vif device
  - From: Wei Liu

Prev by Date: Re: [Xen-devel] [PATCH v4 18/21] libxlu: rework internal representation of setting
Next by Date: Re: [Xen-devel] [PATCH v4 21/21] xl: vNUMA support
Previous by thread: Re: [Xen-devel] rcu_sched self-detect stall when disable vif device
Next by thread: Re: [Xen-devel] rcu_sched self-detect stall when disable vif device
Index(es):
- Date
- Thread

Lists.xenproject.org is hosted with RackSpace, monitoring our
servers 24x7x365 and backed by RackSpace's Fanatical Support®.