[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: [Xen-devel] bnx2x DMA mapping errors cause iscsi problems

To: xen-devel@xxxxxxxxxxxxxxxxxxx
From: Patrick Vranckx <Patrick.Vranckx@xxxxxxxxxxxx>
Date: Thu, 24 Apr 2014 15:55:36 +0200
Delivery-date: Thu, 24 Apr 2014 14:04:53 +0000
List-id: Xen developer discussion <xen-devel.lists.xen.org>


Hi Malcolm,


Thank you for your answer !

We already tried to tune serveral parameters in order to find aworkaround for our concern:


- swiotlb size:

We didn't increase the swiotlb size. We found a similar case found onthe Citrix forums (http://discussions.citrix.com/topic/324343-xenserver-61-bnx2x-sw-iommu/): using swiotlb=256 did not help. So we didn't try ourselves.Unfortunately, there is no mention of a solution in that thread, only apatch for the bnx2x driver (Driver Disk for Broadcom bnx2x driverv1.74.22 for XenServer 6.1.0 with Hotfix XS61E018) but I have to verifyif it is related to our problem.I have to mention that we have no error messages about "Out of SW-IOMMUspace" but this can be due the verbosity of the driver or the kernel.


- disable_tpa=1

this is already the case by disabling LRO (correct ?). Here is theoutput of ethtool:


root@xen2-pyth:~# ethtool -k eth4
Features for eth4:
rx-checksumming: on
tx-checksumming: on
        tx-checksum-ipv4: on
        tx-checksum-unneeded: off [fixed]
        tx-checksum-ip-generic: off [fixed]
        tx-checksum-ipv6: on
        tx-checksum-fcoe-crc: off [fixed]
        tx-checksum-sctp: off [fixed]
scatter-gather: on
        tx-scatter-gather: on
        tx-scatter-gather-fraglist: off [fixed]
tcp-segmentation-offload: on
        tx-tcp-segmentation: on
        tx-tcp-ecn-segmentation: on
        tx-tcp6-segmentation: on
udp-fragmentation-offload: off [fixed]
generic-segmentation-offload: on
generic-receive-offload: on
large-receive-offload: off
rx-vlan-offload: on [fixed]
tx-vlan-offload: on
ntuple-filters: off [fixed]
receive-hashing: on
highdma: on [fixed]
rx-vlan-filter: off [fixed]
vlan-challenged: off [fixed]
tx-lockless: off [fixed]
netns-local: off [fixed]
tx-gso-robust: off [fixed]
tx-fcoe-segmentation: off [fixed]
fcoe-mtu: off [fixed]
tx-nocache-copy: on
loopback: off

- reducing the queues.

We reduced the queues to 4 (default was 11). When the problems happenedthis week, we modified again the parameter dynamically to num_queues=1.We were then able to go on without rebooting the hypervisor. No moremessages 'Can't map rx data' till now... but for how long ? Setting thenumber of queues as low as 1 could have a long term effect ?

I've read the draft you wrote to solve the problem. As far as Iunderstand (because this a very complex for me), this could be the rootcause of our problem. But how can we monitor the different parameters(DMA, SW-IOMMU space, ...) when we have this problem to validate thisassumption ?

BTW what is the time frame for implementing the proposed solution inyour draft ? We run version 4.1.4 of Xen : are there improvementsrelated to this problem in newer versions ?


Regards,

Patrick




_______________________________________________
Xen-devel mailing list
Xen-devel@xxxxxxxxxxxxx
http://lists.xen.org/xen-devel

References:
- [Xen-devel] bnx2x DMA mapping errors cause iscsi problems
  - From: Patrick Vranckx

Prev by Date: [Xen-devel] Xen 4.4 Testing: Platform Op XENPF_microcode_update Leaves VCPU Lock Set?
Next by Date: Re: [Xen-devel] [PATCH v2 0/3] fixes (read: workarounds) for XSA-59
Previous by thread: Re: [Xen-devel] bnx2x DMA mapping errors cause iscsi problems
Next by thread: [Xen-devel] [PATCH v2 0/4] Fix grant map/unmap with auto-translated guests
Index(es):
- Date
- Thread

Lists.xenproject.org is hosted with RackSpace, monitoring our
servers 24x7x365 and backed by RackSpace's Fanatical Support®.