[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

[Xen-users] lots of cycles in i/o wait state

To: "xen-users@xxxxxxxxxxxxxxxxxxx" <xen-users@xxxxxxxxxxxxxxxxxxx>
From: Miles Fidelman <mfidelman@xxxxxxxxxxxxxxxx>
Date: Sat, 05 Jun 2010 18:59:51 -0400
Delivery-date: Sat, 05 Jun 2010 16:01:23 -0700
List-id: Xen user discussion <xen-users.lists.xensource.com>

Hi Folks,

I've been doing some experimenting to see how far I can push some oldhardware into a virtualized environment - partially to see how much useI can get out of the hardware, and partially to learn more about thebehavior of, and interactions between, software RAID, LVM, DRBD, and Xen.


Basic configuration:

- two machines, 4 disk drives each, two 1G ethernet ports (1 each to theoutside world, 1 each as a cross-connect)

- each machine runs Xen 3 on top of Debian Lenny (the basic install)

- very basic Dom0s - just running the hypervisor and i/o (including diskmanagement)

---- software RAID6 (md)
---- LVM
---- DRBD
---- heartbeat to provide some failure migration

- dom0, on each machine, runs directly on md RAID volumes (RAID1 forboot, RAID6 for root and swap)

- each Xen VM uses 2 DRBD volumes - one for root, one for swap
- one of the VMs has a third volume, used for backup copies of files

One domU, on one machine, runs a medium volume mail/list server. Thisused to run non-virtualized on one of the machines, and I moved it intoa domU. Before virtualization, everything just hummed along (98% idletime as reported by top). Virtualized, the machine is mostly idle, butnow top reports a lot of i/o wait time, usually in the 20-25% range).

As I've started experimenting with adding additional domUs, in variousconfigurations, I've found that my mail server can get into a statewhere it's spending almost all of its cycles in an i/o wait state (95%and higher as reported by top). This is particularly noticeable when Irun a backup job (essentially a large tar job that reads from the rootvolume and writes to the backup volume). The domU grinds to halt.


So I've been trying to track down the bottlenecks.

At first, I thought this was probably a function of pushing my diskstack beyond reasonable limits - what with multiple domUs on top of DRBDvolumes, on top of LVM volumes, on top of software RAID6 (md). Ifigured I was seeing a lot of disk churning.

But... after running some disk benchmarks, what I'm seeing is somethingelse:


- I took one machine, turned off all the domUs, and turned off DRBD

- I ran a disk benchmark (bonnie++) on dom0, which reported 50MB/sec to90MB/sec of throughput depending on the test (not exactly sure what thismeans, but it's a baseline)- I then brought up DRBD and various combinations of domUs, and ran thebenchmark in various places- the most interesting result, running in the same domU as the mailserver: 34M-60M depending on the test (not much degredation from runningdirectly on the RAID volume- but.... while running, the benchmark, the baseline i/o wait percentagejumps from 25% to the 70-90% range

So... the question becomes, if it's not disk churning, what's causingall those i/o wait cycles? I'm starting to think it might involvebuffering or other interactions in the hypervisor.

Any thoughts or suggestions regarding diagnostics and/or tuning? (Otherthan "throw hardware at it" of course :-).


Thanks very much,

Miles Fidelman





_______________________________________________
Xen-users mailing list
Xen-users@xxxxxxxxxxxxxxxxxxx
http://lists.xensource.com/xen-users

Follow-Ups:
- Re: [Xen-users] lots of cycles in i/o wait state
  - From: Pasi Kärkkäinen
- Re: [Xen-users] lots of cycles in i/o wait state
  - From: Florian Manschwetus

Prev by Date: [Xen-users] How many guests
Next by Date: Re: [Xen-users] lots of cycles in i/o wait state
Previous by thread: [Xen-users] How many guests
Next by thread: Re: [Xen-users] lots of cycles in i/o wait state
Index(es):
- Date
- Thread

Lists.xenproject.org is hosted with RackSpace, monitoring our
servers 24x7x365 and backed by RackSpace's Fanatical Support®.