Bug 202983 - ixv driver in 11.0-CURRENT(10.1 & 10.2 RELEASE) doesn't pass traffic using XEN hypervisor(AWS EC2)
Summary: ixv driver in 11.0-CURRENT(10.1 & 10.2 RELEASE) doesn't pass traffic using XE...
Status: Closed FIXED
Alias: None
Product: Base System
Classification: Unclassified
Component: kern (show other bugs)
Version: CURRENT
Hardware: amd64 Any
: --- Affects Some People
Assignee: freebsd-net (Nobody)
URL:
Keywords: IntelNetworking
Depends on:
Blocks:
 
Reported: 2015-09-09 06:06 UTC by Jarrod Petz
Modified: 2016-01-07 22:47 UTC (History)
5 users (show)

See Also:


Attachments
10.2 dmesg (7.23 KB, text/plain)
2015-09-09 06:06 UTC, Jarrod Petz
no flags Details

Note You need to log in before you can comment on or make changes to this bug.
Description Jarrod Petz 2015-09-09 06:06:20 UTC
Created attachment 160852 [details]
10.2 dmesg

Similar problem to Bug 202875
https://bugs.freebsd.org/bugzilla/show_bug.cgi?id=202875

Only the Hypervisor host is Xen+Qemu/EC2

I have ready access to AWS for testing of a guest ixv(Xen DomU) http://docs.aws.amazon.com/AWSEC2/latest/UserGuide/enhanced-networking.html.

The driver detects the hardware correctly and sets everything up. But DHCP fails as there is no received packets(created guest packet capture in rc.d) from the the host Dom0. Unfortunately I can't check the host system with a packet capture as this is a IaaS platform. But on the upside you can be pretty much be guaranteed the host is setup fine for the guest as Linux/Windows work fine.

I have spoken with both Colin Percival, Jack Vogel(Intel), George Neville-Neil and Jeffrey Pieper about this previously but never filed a bug for it.
https://lists.freebsd.org/pipermail/freebsd-xen/2014-May/002114.html

The last I heard on this was that it wass suspected to be a Xen+Qemu issue since KVM was working. However it seems more likely to be a that different host drivers as incompatible with certain guest drivers(based on below) and the fact KVM also has a bug report now(Bug 202875)?

------------------------------
>         Since KVM works and Xen does not, this is basically a qemu issue. I'll
>         cut and paste my original findings:
>
>         All,
>
>         I was able to reproduce the failure Colin was seeing on SRIOV with Xen
>         and FreeBSD 9.2/ixgbe-2.5.15. I used SLES11-SP3 w/ Xen. I could not
>         acquire a dhcp license and nor could I  pass traffic through the VF with
>         a manually assigned IP address.  I tested this  with the driver both
>         compiled in and out of the kernel. The interesting point is that the
>         exact same configuration works on RHEL 6.5 KVM.
>
>         The good news, however is that FreeBSD 10.0/ixgbe-2.5.18 DOES work, with
>         the driver compiled both in and out of the kernel. I am able to acquire
>         a dhcp license at boot as well as pass traffic with a manually assigned
>         IP address. Another interesting data point is that with both
>         9.2/ixgbe-2.5.18 and 10.0/2.5.18 both running in VMs and each assigned a
>         VF belonging to the same PF, 10.0/ixgbe-2.5.18 did not work either.
>
>         I also attempted to use our out-of-tree ixgbe-2.5.18 on 9.2, but I get
>          SIOCGIF: ix0 no such device errors when attempting to acquire a dhcp
>         address or even assign an address manually.
>
>
>         Jeff
------------------------------
Comment 1 Jeff Pieper 2015-10-22 18:34:13 UTC
We are able to reproduce this, but we will need logs from the host during the time of the failure.
Comment 2 Jeff Pieper 2015-11-02 21:18:20 UTC
We are able to reproduce this with a KVM hypervisor. This occurs when the PF has MTU set to 9000 and the VF has MTU set to default (1500). We are investigating.
Comment 3 Jeff Pieper 2015-11-19 16:41:58 UTC
After some additional testing using KVM, I've found that if the MTU on the PF is set to 9000 BEFORE the VF is created, ixv behaves as expected. If the MTU is changed AFTER the VF is created and then attached to ixv, then the issue is reproducible. We are continuing to investigate.
Comment 4 Jarrod Petz 2015-11-24 22:22:40 UTC
Have had feedback from other engineers who confirmed this patch fixes the issue.
https://reviews.freebsd.org/D4186

However there was some small issues with it. As detailed below.

-------------------------------------------------------------------------------------
I applied the changes from https://reviews.freebsd.org/D4186 to 11.0-CURRENT (which among other things adds the missing VF-PF API renegotiation on the reset path) and saw packets arriving in the instance, but tagged with vlan 2048.

# tcpdump -i ixv0 -e -vvv
tcpdump: listening on ixv0, link-type EN10MB (Ethernet), capture size 262144 bytes
10:39:07.551985 12:8d:18:b1:e5:6b (oui Unknown) > 12:39:94:73:0b:1d (oui Unknown), ethertype 802.1Q (0x8100), length 60: vlan 2048, p 0, ethertype ARP, Ethernet (len 6), IPv4 (len 4), Request who-has ip-10-0-3-114.ec2.internal tell ip-10-0-3-1.ec2.internal, length 42
10:39:08.552133 12:8d:18:b1:e5:6b (oui Unknown) > 12:39:94:73:0b:1d (oui Unknown), ethertype 802.1Q (0x8100), length 60: vlan 2048, p 0, ethertype ARP, Ethernet (len 6), IPv4 (len 4), Request who-has ip-10-0-3-114.ec2.internal tell ip-10-0-3-1.ec2.internal, length 42

After creating a vlan0 interface with ID 2048 on top of ixv0, I saw traffic passing and DHCP worked.

# ifconfig vlan0 create
# ifconfig vlan0 vlan 2048 vlandev ixv0
# tcpdump -i vlan0 -vvv -s65534 -n
tcpdump: listening on vlan0, link-type EN10MB (Ethernet), capture size 65534 bytes
10:42:00.342629 IP (tos 0x10, ttl 128, id 0, offset 0, flags [none], proto UDP (17), length 328)
    0.0.0.0.68 > 255.255.255.255.67: [udp sum ok] BOOTP/DHCP, Request from 12:39:94:73:0b:1d, length 300, xid 0x5d968cbb, Flags [none] (0x0000)
          Client-Ethernet-Address 12:39:94:73:0b:1d
          Vendor-rfc1048 Extensions
            Magic Cookie 0x63825363
            DHCP-Message Option 53, length 1: Discover
            Client-ID Option 61, length 7: ether 12:39:94:73:0b:1d
            Hostname Option 12, length 13: "ip-10-0-0-203"
            Parameter-Request Option 55, length 9:
              Subnet-Mask, BR, Time-Zone, Classless-Static-Route
              Default-Gateway, Domain-Name, Domain-Name-Server, Hostname
              Option 119
            END Option 255, length 0
            PAD Option 0, length 0, occurs 21
10:42:00.342916 IP (tos 0x10, ttl 16, id 0, offset 0, flags [none], proto UDP (17), length 337)
    10.0.3.1.67 > 10.0.3.114.68: [udp sum ok] BOOTP/DHCP, Reply, length 309, xid 0x5d968cbb, Flags [none] (0x0000)
          Your-IP 10.0.3.114
          Client-Ethernet-Address 12:39:94:73:0b:1d
          Vendor-rfc1048 Extensions
            Magic Cookie 0x63825363
            DHCP-Message Option 53, length 1: Offer
            Server-ID Option 54, length 4: 10.0.3.1
            Lease-Time Option 51, length 4: 3600
            Subnet-Mask Option 1, length 4: 255.255.255.0
            BR Option 28, length 4: 10.0.3.255
            Default-Gateway Option 3, length 4: 10.0.3.1
            Domain-Name Option 15, length 12: "ec2.internal"
            Domain-Name-Server Option 6, length 4: 10.0.0.2
            Hostname Option 12, length 13: "ip-10-0-3-114"
            END Option 255, length 0
10:42:02.365085 IP (tos 0x10, ttl 128, id 0, offset 0, flags [none], proto UDP (17), length 328)
    0.0.0.0.68 > 255.255.255.255.67: [udp sum ok] BOOTP/DHCP, Request from 12:39:94:73:0b:1d, length 300, xid 0x5d968cbb, Flags [none] (0x0000)
          Client-Ethernet-Address 12:39:94:73:0b:1d
          Vendor-rfc1048 Extensions
            Magic Cookie 0x63825363
            DHCP-Message Option 53, length 1: Request
            Server-ID Option 54, length 4: 10.0.3.1
            Requested-IP Option 50, length 4: 10.0.3.114
            Client-ID Option 61, length 7: ether 12:39:94:73:0b:1d
            Hostname Option 12, length 13: "ip-10-0-0-203"
            Parameter-Request Option 55, length 9:
              Subnet-Mask, BR, Time-Zone, Classless-Static-Route
              Default-Gateway, Domain-Name, Domain-Name-Server, Hostname
              Option 119
            END Option 255, length 0
            PAD Option 0, length 0, occurs 9
10:42:02.365274 IP (tos 0x10, ttl 16, id 0, offset 0, flags [none], proto UDP (17), length 337)
    10.0.3.1.67 > 10.0.3.114.68: [udp sum ok] BOOTP/DHCP, Reply, length 309, xid 0x5d968cbb, Flags [none] (0x0000)
          Your-IP 10.0.3.114
          Client-Ethernet-Address 12:39:94:73:0b:1d
          Vendor-rfc1048 Extensions
            Magic Cookie 0x63825363
            DHCP-Message Option 53, length 1: ACK
            Server-ID Option 54, length 4: 10.0.3.1
            Lease-Time Option 51, length 4: 3600
            Subnet-Mask Option 1, length 4: 255.255.255.0
            BR Option 28, length 4: 10.0.3.255
            Default-Gateway Option 3, length 4: 10.0.3.1
            Domain-Name Option 15, length 12: "ec2.internal"
            Domain-Name-Server Option 6, length 4: 10.0.0.2
            Hostname Option 12, length 13: "ip-10-0-3-114"
            END Option 255, length 0
10:42:02.370732 ARP, Ethernet (len 6), IPv4 (len 4), Request who-has 10.0.3.114 tell 10.0.3.114, length 28
10:42:16.345260 ARP, Ethernet (len 6), IPv4 (len 4), Request who-has 10.0.3.114 tell 10.0.3.1, length 42
10:42:16.345280 ARP, Ethernet (len 6), IPv4 (len 4), Reply 10.0.3.114 is-at 12:39:94:73:0b:1d, length 28
^C

So I added the following patch to the VF driver in the instance to force the VF into stripping VLAN tags on RX and now the instance is able to acquire a DHCP lease and pass traffic on the interface.

diff --git a/dev/ixgbe/if_ixv.c b/dev/ixgbe/if_ixv.c
index bd06492..a90b4f2 100644
--- a/dev/ixgbe/if_ixv.c
+++ b/dev/ixgbe/if_ixv.c
@@ -1700,6 +1700,7 @@ ixv_initialize_receive_units(struct adapter *adapter)
                /* Do the queue enabling last */
                rxdctl = IXGBE_READ_REG(hw, IXGBE_VFRXDCTL(i));
                rxdctl |= IXGBE_RXDCTL_ENABLE;
+               rxdctl |= IXGBE_RXDCTL_VME;
                IXGBE_WRITE_REG(hw, IXGBE_VFRXDCTL(i), rxdctl);
                for (int k = 0; k < 10; k++) {
                        if (IXGBE_READ_REG(hw, IXGBE_VFRXDCTL(i)) &

All this with an unmodified host driver. The patch probably breaks VLANs inside the instance in some way.
-------------------------------------------------------------------------------------
Comment 5 Jarrod Petz 2016-01-05 21:59:21 UTC
I didn't see any action from other on this, so have submitted a diff for review.
https://reviews.freebsd.org/D4788

This allows DHCP to work and obtain a lease. Be mindful though that if you use this that you should ensure the MTU is set correctly for AWS instances. See my diff for details on why.
Comment 6 Jarrod Petz 2016-01-07 22:47:22 UTC
I am resolving this, the commits below have fixed FreeBSD CURRENT/11 and 10 will be fixed if these get MFC'ed.

https://reviews.freebsd.org/D4186
https://reviews.freebsd.org/rS292674

https://reviews.freebsd.org/D4788
https://reviews.freebsd.org/rS293338