[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: [Xen-users] 2.6.31 xenified kernel - not ready for production



On Mon, Nov 09, 2009 at 06:20:01PM +0000, Andrew Lyon wrote:
> On Sun, Nov 8, 2009 at 8:32 PM, Pasi Kärkkäinen <pasik@xxxxxx> wrote:
> > On Sun, Nov 08, 2009 at 08:40:43PM +0100, Peter Braun wrote:
> >> Hi,
> >>
> >> I just want to know if somebody use 2.6.31.4 xenified kernel (aka
> >> OpenSUSE) in production?
> >>
> >> We have been testing it on new Nehalem Xeon server for few weeks w/o
> >> any problem.
> >> But as soon we tried it on production machine - after several
> >> production domUs started - hard OS failure.
> >> We had to switch back to 2.6.18.8 - xen stock kernel.
> >>
> >
> > What kind of failure?
> >
> > I hope you have a serial console set up so you can capture the (possible)
> > BUG/OOPS/stacktrace or just the error messages..
> >
> > -- Pasi
> >
> >
> 
> Yes I've had problems with 2.6.31 crashing with null pointer
> dereference, I am going to install opensuse 11.2 next week and see if
> I can replicate the problem using the opensuse kernel so that I can
> get some help from Jan to fix it.
> 

Hmm.. dunno if this helps, it's from a user (from #xen) having problems with the
opensuse 2.6.31 patches:

http://hachi.kuiki.net/software_problems/20091110-xen-i7-panic/panic2.txt

[  257.758664] BUG: unable to handle kernel paging request at ffff88016f521000
[  257.766783] IP: [<ffffffff80391feb>] swiotlb_bounce+0x35/0x3a
[  257.773349] PGD 1fd4067 PUD 29da067 PMD 2b55067 PTE 0
[  257.779308] Thread overran stack, or stack corrupted
[  257.784979] Oops: 0002 [#1] SMP 
[  257.789046] last sysfs file: 
/sys/devices/xen-backend/vbd-1-2059/statistics/wr_sect
[  257.797632] CPU 0 
[  257.800027] Modules linked in: xt_tcpudp xt_physdev iptable_filter ip_tables 
x_tables bridge nls_utf8 nls_cp437 vfat fat 8021q garp stp bonding ipv6 lm85 
hwmon_vid i2c_amd756 i2c_i801 i2c_core pl2303 usbserial button pcspkr processor 
evdev ext3 jbd dm_mod raid456 raid6_pq async_xor async_memcpy async_tx xor 
raid1 raid0 md_mod sd_mod ata_generic sata_promise ata_piix libata uhci_hcd 
scsi_mod ide_pci_generic ide_core ehci_hcd e1000e thermal fan thermal_sys 
configfs e100 mii [last unloaded: scsi_wait_scan]
[  257.853871] Pid: 0, comm: swapper Not tainted 2.6.31.5 #2 X8ST3
[  257.860462] RIP: e030:[<ffffffff80391feb>]  [<ffffffff80391feb>] 
swiotlb_bounce+0x35/0x3a
[  257.869634] RSP: e02b:ffffc90000003d38  EFLAGS: 00010002
[  257.875562] RAX: 0000000000002000 RBX: 0000000000007748 RCX: 0000000000002000
[  257.883513] RDX: 0000000000002000 RSI: ffff88000e823000 RDI: ffff88016f521000
[  257.891459] RBP: 0000000000000002 R08: ffff88000e823000 R09: 000000016f521000
[  257.899379] R10: 000020208065dc20 R11: ffffc90000003e50 R12: 0000000000007748
[  257.907331] R13: 0000000000002000 R14: ffff8801718bd080 R15: 0000000000000050
[  257.915284] FS:  00007fb30d7106f0(0000) GS:ffffc90000000000(0000) 
knlGS:0000000000000000
[  257.924274] CS:  e033 DS: 0000 ES: 0000 CR0: 000000008005003b
[  257.930694] CR2: ffff88016f521000 CR3: 0000000170aa8000 CR4: 0000000000002660
[  257.938606] DR0: 0000000000000000 DR1: 0000000000000000 DR2: 0000000000000000
[  257.946541] DR3: 0000000000000000 DR6: 00000000ffff0ff0 DR7: 0000000000000400
[  257.954466] Process swapper (pid: 0, threadinfo ffffffff805de000, task 
ffffffff8062b490)
[  257.963447] Stack:
[  257.965740]  ffffffff802b0358 ffffffff80392066 ffffffff802b0358 
ffff8800f10220c0
[  257.974011] <0> 0000000000000001 0000000000000002 0000000000000002 
ffffffff80392197
[  257.982945] <0> ffff88016f2f80d0 ffff88016f2f8000 0000000000000002 
ffff8800f10220c0
[  257.997334] Call Trace:
[  258.000107]  <IRQ> 
[  258.002580]  [<ffffffff802b0358>] ? kfree+0x68/0x1a0
[  258.008121]  [<ffffffff80392066>] ? unmap_single+0x76/0x172
[  258.014315]  [<ffffffff802b0358>] ? kfree+0x68/0x1a0
[  258.019857]  [<ffffffff80392197>] ? swiotlb_unmap_sg_attrs+0x35/0x4e
[  258.026939]  [<ffffffffa00d3253>] ? ata_sg_clean+0x8a/0xa2 [libata]
[  258.033938]  [<ffffffffa00d32dd>] ? __ata_qc_complete+0x72/0xe1 [libata]
[  258.041406]  [<ffffffffa00e286f>] ? ata_sff_hsm_move+0x660/0x68b [libata]
[  258.048956]  [<ffffffffa004c7c1>] ? e1000_clean_tx_irq+0xc5/0x2d9 [e1000e]
[  258.056599]  [<ffffffffa00e2977>] ? ata_sff_host_intr+0xdd/0x128 [libata]
[  258.064161]  [<ffffffffa00e2af4>] ? ata_sff_interrupt+0x85/0xbf [libata]
[  258.071609]  [<ffffffff8026dc17>] ? handle_IRQ_event+0x74/0x147
[  258.078197]  [<ffffffff80238932>] ? __do_softirq+0x171/0x1b8
[  258.084498]  [<ffffffff8026f4d8>] ? handle_level_irq+0x9e/0x104
[  258.091081]  [<ffffffff8020b601>] ? handle_irq+0x17/0x1d
[  258.097052]  [<ffffffff804087f8>] ? evtchn_do_upcall+0x12d/0x1fd
[  258.103741]  [<ffffffff80209a4e>] ? do_hypervisor_callback+0x1e/0x30
[  258.110808]  <EOI> 
[  258.113281]  [<ffffffff8020bdef>] ? xen_safe_halt+0xa2/0xb7
[  258.119483]  [<ffffffff8020f2c5>] ? xen_idle+0x5e/0xbc
[  258.125213]  [<ffffffff802087fe>] ? cpu_idle+0x46/0x82
[  258.130932]  [<ffffffff8068324b>] ? start_kernel+0x37b/0x387
[  258.137262] Code: 89 f0 48 89 d0 75 13 48 be 00 00 00 00 00 88 ff ff 48 8d 
34 37 4c 89 c7 eb 0e 48 bf 00 00 00 00 00 88 ff ff 49 8d 3c 39 48 89 c1 <f3> a4 
41 58 c3 41 55 49 89 d5 41 54 55 89 cd 53 48 89 f3 48 83 
[  258.163334] RIP  [<ffffffff80391feb>] swiotlb_bounce+0x35/0x3a
[  258.169904]  RSP <ffffc90000003d38>
[  258.173813] CR2: ffff88016f521000
[  258.177542] ---[ end trace f10a55534d9fba8d ]---
[  258.182794] Kernel panic - not syncing: Fatal exception in interrupt
[  258.189843] Pid: 0, comm: swapper Tainted: G      D    2.6.31.5 #2
[  258.196716] Call Trace:
[  258.199489]  <IRQ>  [<ffffffff804d7710>] ? panic+0x86/0x14c
[  258.205811]  [<ffffffff8024afe0>] ? up+0xe/0x36
[  258.210876]  [<ffffffff802335d6>] ? release_console_sem+0x1e6/0x21b
[  258.217839]  [<ffffffff8020d251>] ? oops_end+0xbe/0xcb
[  258.223571]  [<ffffffff80217480>] ? no_context+0x1fc/0x20b
[  258.229693]  [<ffffffffa030596b>] ? br_handle_frame_finish+0x127/0x148 
[bridge]
[  258.237833]  [<ffffffff8021763f>] ? __bad_area_nosemaphore+0x1b0/0x1d4
[  258.245128]  [<ffffffffa0309a50>] ? br_nf_pre_routing_finish+0x0/0x2d2 
[bridge]
[  258.253257]  [<ffffffff8047b3f0>] ? nf_hook_slow+0x62/0xc3
[  258.259377]  [<ffffffffa0309a50>] ? br_nf_pre_routing_finish+0x0/0x2d2 
[bridge]
[  258.267512]  [<ffffffffa0304e93>] ? __br_forward+0x88/0x9d [bridge]
[  258.274506]  [<ffffffff8021786c>] ? do_page_fault+0xa2/0x27a
[  258.280810]  [<ffffffff804da4f8>] ? page_fault+0x28/0x30
[  258.286727]  [<ffffffff80391feb>] ? swiotlb_bounce+0x35/0x3a
[  258.293059]  [<ffffffff802b0358>] ? kfree+0x68/0x1a0
[  258.298637]  [<ffffffff80392066>] ? unmap_single+0x76/0x172
[  258.304856]  [<ffffffff802b0358>] ? kfree+0x68/0x1a0
[  258.310408]  [<ffffffff80392197>] ? swiotlb_unmap_sg_attrs+0x35/0x4e
[  258.317472]  [<ffffffffa00d3253>] ? ata_sg_clean+0x8a/0xa2 [libata]
[  258.324456]  [<ffffffffa00d32dd>] ? __ata_qc_complete+0x72/0xe1 [libata]
[  258.331948]  [<ffffffffa00e286f>] ? ata_sff_hsm_move+0x660/0x68b [libata]
[  258.339493]  [<ffffffffa004c7c1>] ? e1000_clean_tx_irq+0xc5/0x2d9 [e1000e]
[  258.347171]  [<ffffffffa00e2977>] ? ata_sff_host_intr+0xdd/0x128 [libata]
[  258.354732]  [<ffffffffa00e2af4>] ? ata_sff_interrupt+0x85/0xbf [libata]
[  258.362187]  [<ffffffff8026dc17>] ? handle_IRQ_event+0x74/0x147
[  258.368778]  [<ffffffff80238932>] ? __do_softirq+0x171/0x1b8
[  258.375069]  [<ffffffff8026f4d8>] ? handle_level_irq+0x9e/0x104
[  258.381658]  [<ffffffff8020b601>] ? handle_irq+0x17/0x1d
[  258.387582]  [<ffffffff804087f8>] ? evtchn_do_upcall+0x12d/0x1fd
[  258.394268]  [<ffffffff80209a4e>] ? do_hypervisor_callback+0x1e/0x30
[  258.401358]  <EOI>  [<ffffffff8020bdef>] ? xen_safe_halt+0xa2/0xb7
[  258.408316]  [<ffffffff8020f2c5>] ? xen_idle+0x5e/0xbc
[  258.414054]  [<ffffffff802087fe>] ? cpu_idle+0x46/0x82
[  258.419816]  [<ffffffff8068324b>] ? start_kernel+0x37b/0x387
[  258.426378] Rebooting in 60 seconds..  
s <SpaceBar> to update BIOS.


-- Pasi


_______________________________________________
Xen-users mailing list
Xen-users@xxxxxxxxxxxxxxxxxxx
http://lists.xensource.com/xen-users


 


Rackspace

Lists.xenproject.org is hosted with RackSpace, monitoring our
servers 24x7x365 and backed by RackSpace's Fanatical Support®.