|
[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index] Re: Network driver domain broken
On Fri, Mar 04, 2022 at 01:05:55PM +0100, Andrea Stevanato wrote:
> On 3/4/2022 12:52 PM, Roger Pau Monné wrote:
> > On Thu, Mar 03, 2022 at 01:08:31PM -0500, Jason Andryuk wrote:
> > > On Thu, Mar 3, 2022 at 11:34 AM Roger Pau Monné <roger.pau@xxxxxxxxxx>
> > > wrote:
> > > >
> > > > On Thu, Mar 03, 2022 at 05:01:23PM +0100, Andrea Stevanato wrote:
> > > > > On 03/03/2022 15:54, Andrea Stevanato wrote:
> > > > > > Hi all,
> > > > > >
> > > > > > according to the conversation that I had with royger, aa67b97ed34
> > > > > > broke the driver domain support.
> > > > > >
> > > > > > What I'm trying to do is to setup networking between guests using
> > > > > > driver domain. Therefore, the guest (driver) has been started with
> > > > > > the following cfg.
> > > > > >
> > > > > > name = "guest0"
> > > > > > kernel = "/media/sd-mmcblk0p1/Image"
> > > > > > ramdisk = "/media/sd-mmcblk0p1/rootfs.cpio.gz"
> > > > > > extra = "console=hvc0 rdinit=/sbin/init root=/dev/ram0"
> > > > > > memory = 1024 vcpus = 2
> > > > > > driver_domain = 1
> > > > > >
> > > > > > On guest0 I created the bridge, assigned a static IP and started
> > > > > > the udhcpd on xenbr0 interface.
> > > > > > While the second guest has been started with the following cfg:
> > > > > >
> > > > > > name = "guest1"
> > > > > > kernel = "/media/sd-mmcblk0p1/Image"
> > > > > > ramdisk = "/media/sd-mmcblk0p1/rootfs.cpio.gz"
> > > > > > extra = "console=hvc0 rdinit=/sbin/init root=/dev/ram0"
> > > > > > memory = 1024 vcpus = 2
> > > > > > vcpus = 2
> > > > > > vif = [ 'bridge=xenbr0, backend=guest0' ]
> > > > > >
> > > > > > Follows the result of strace xl devd:
> > > > > >
> > > > > > # strace xl devd
> > > > > > execve("/usr/sbin/xl", ["xl", "devd"], 0xffffdf0420c8 /* 13 vars
> > > > > > */) = 0
> > >
> > > > > > ioctl(5, _IOC(_IOC_NONE, 0x50, 0, 0x30), 0xffffe6e41b40) = -1 EPERM
> > > > > > (Operation not permitted)
> > > > > > write(2, "libxl: ", 7libxl: ) = 7
> > > > > > write(2, "error: ", 7error: ) = 7
> > > > > > write(2, "libxl_utils.c:820:libxl_cpu_bitm"...,
> > > > > > 87libxl_utils.c:820:libxl_cpu_bitmap_alloc: failed to retrieve the
> > > > > > maximum number of cpus) = 87
> > > > > > write(2, "\n", 1
> > > > > > ) = 1
> > > > > > clone(child_stack=NULL,
> > > > > > flags=CLONE_CHILD_CLEARTID|CLONE_CHILD_SETTID|SIGCHLD,
> > > > > > child_tidptr=0xffff9ee7a0e0) = 814
> > > > > > wait4(814, [{WIFEXITED(s) && WEXITSTATUS(s) == 0}], 0, NULL) = 814
> > > > > > --- SIGCHLD {si_signo=SIGCHLD, si_code=CLD_EXITED, si_pid=814,
> > > > > > si_uid=0, si_status=0, si_utime=2, si_stime=2} ---
> > >
> > > xl devd is daemonizing, but strace is only following the first
> > > process. Use `strace xl devd -F` to prevent the daemonizing (or
> > > `strace -f xl devd` to follow children).
> >
> > Or as a first step try to see what kind of messages you get from `xl
> > devd -F` when trying to attach a device using the driver domain.
>
> Nothing has changed. On guest0 (the driver domain):
>
> # xl devd -F
> libxl: error: libxl_utils.c:820:libxl_cpu_bitmap_alloc: failed to retrieve
> the maximum number of cpus
> libxl: error: libxl_utils.c:820:libxl_cpu_bitmap_alloc: failed to retrieve
> the maximum number of cpus
> libxl: error: libxl_utils.c:820:libxl_cpu_bitmap_alloc: failed to retrieve
> the maximum number of cpus
> [ 696.805619] xenbr0: port 1(vif2.0) entered blocking state
> [ 696.810334] xenbr0: port 1(vif2.0) entered disabled state
> [ 696.824518] device vif2.0 entered promiscuous mode
Can you use `xl -vvv devd -F` here?
I assume the process doesn't die unexpectedly?
Thanks, Roger.
|
![]() |
Lists.xenproject.org is hosted with RackSpace, monitoring our |