Bug 44355

Summary: After deletion of an IPv6 alias, the route to the whole subnet is removed too.
Product: Base System Reporter: Dmitry Y. Bordakov <dmitry_bordakov>
Component: kernAssignee: George V. Neville-Neil <gnn>
Status: Closed FIXED    
Severity: Affects Only Me    
Priority: Normal    
Version: 4.4-RELEASE   
Hardware: Any   
OS: Any   
Attachments:
Description Flags
in6.c.diff
none
in6_ifattach.c.diff none

Description Dmitry Y. Bordakov 2002-10-21 20:10:01 UTC
You configured several IPv6 aliases in one subnet (with the same prefix) on one network interface, for example, 3ffe::1, 3ffe::2, 3ffe::3, etc, all on fxp0.
During the first alias configuration, the network stack automatically configured an appropriate route to the network, in our example, 3ffe::/64 -> link#1 .
Deletion of one of the aliases causes deletion of the route and, therefore, unreachability of the remote addresses in the same subnet, for example, ping6 3ffe::123 returns "No route to host".

How-To-Repeat: ifconfig lo0 inet6 add 3ffe::1
ifconfig lo0 inet6 add 3ffe::2
netstat -nr | grep lo0
                          ...
                          3ffe::/64       fe80::1%lo0   Uc    lo0
                          3ffe::1         link#2        UHL   lo0
                          3ffe::2         link#2        UHL   lo0
                          ...
ifconfig lo0 inet6 delete 3ffe::1
netstat -nr | grep lo0
                          ...
                          3ffe::2         link#2        UHL   lo0
                          ...
Comment 1 Peter Pentchev 2002-10-21 20:13:37 UTC
On Mon, Oct 21, 2002 at 12:01:24PM -0700, Dmitry Y. Bordakov wrote:
> 
> >Number:         44355
> >Category:       kern
> >Synopsis:       After deletion of an IPv6 alias, the route to the whole subnet is removed too.
> >Originator:     Dmitry Y. Bordakov
> >Release:        4.4-RELEASE
> >Organization:
> Agilent Technologies, Inc.
> >Environment:
> FreeBSD qar1 4.4-RELEASE FreeBSD 4.4-RELEASE #14: Fri Oct 11 19:55:37 GMT 2002     root@qar1:/usr/src/sys/compile/QAR44GENERIC.znb  i386
> 
> >Description:
> You configured several IPv6 aliases in one subnet (with the same prefix) on one network interface, for example, 3ffe::1, 3ffe::2, 3ffe::3, etc, all on fxp0.
> During the first alias configuration, the network stack automatically configured an appropriate route to the network, in our example, 3ffe::/64 -> link#1 .
> Deletion of one of the aliases causes deletion of the route and, therefore, unreachability of the remote addresses in the same subnet, for example, ping6 3ffe::123 returns "No route to host".
> 
> >How-To-Repeat:
> ifconfig lo0 inet6 add 3ffe::1
> ifconfig lo0 inet6 add 3ffe::2
> netstat -nr | grep lo0
>                           ...
>                           3ffe::/64       fe80::1%lo0   Uc    lo0
>                           3ffe::1         link#2        UHL   lo0
>                           3ffe::2         link#2        UHL   lo0
>                           ...
> ifconfig lo0 inet6 delete 3ffe::1
> netstat -nr | grep lo0
>                           ...
>                           3ffe::2         link#2        UHL   lo0
>                           ...

Is there a reason for you not to set a prefixlen 128 on each alias, just
as you are supposed to set a netmask of 0xffffffff on each IPv4 alias in
the same subnet as another address on the interface?

Could you try setting a prefixlen 128 on all aliases except for the
"primary" address (yeah, yeah, I know.. but it helps to talk of a
"primary" address sometimes), and check if the problem persists?

G'luck,
Peter

-- 
Peter Pentchev	roam@ringlet.net	roam@FreeBSD.org
PGP key:	http://people.FreeBSD.org/~roam/roam.key.asc
Key fingerprint	FDBA FD79 C26F 3C51 C95E  DF9E ED18 B68D 1619 4553
I am not the subject of this sentence.
Comment 2 Peter Pentchev 2002-10-22 06:19:54 UTC
Adding the originator's response to the PR audit trail..

----- Forwarded message from dmitry_bordakov@agilent.com -----

From: dmitry_bordakov@agilent.com
To: roam@ringlet.net
Subject: RE: kern/44355: After deletion of an IPv6 alias, the route to the
	 whole subnet is removed too.
Date: Mon, 21 Oct 2002 14:38:30 -0600

Hello,


> Is there a reason for you not to set a prefixlen 128 on each alias, just
> as you are supposed to set a netmask of 0xffffffff on each IPv4 alias in
> the same subnet as another address on the interface?

Mmmm... I see no general reasons for this. It came from some non-standard usage of the network stack.

> Could you try setting a prefixlen 128 on all aliases except for the
> "primary" address (yeah, yeah, I know.. but it helps to talk of a
> "primary" address sometimes), and check if the problem persists?

No, there is no problem in this case (until you delete the "primary" address").

IPv4 stack also requires the same workaround (primary address + 32bit-masked aliases).
Is it by design?


Thank you,
Dmitry Bordakov.
Comment 3 Tilman Keskinoz freebsd_committer freebsd_triage 2004-08-27 13:38:53 UTC
Responsible Changed
From-To: freebsd-bugs->freebsd-net

Old patches against IPv6, over to freebsd-net to decide if this PR is still  
relevant
Comment 4 Jinmei Tatuya 2004-08-29 14:02:07 UTC
>>>>> On Fri, 27 Aug 2004 12:39:39 GMT, 
>>>>> Tilman Linneweh <arved@FreeBSD.org> said:

> Synopsis: After deletion of an IPv6 alias, the route to the whole subnet is removed too.
> Responsible-Changed-From-To: freebsd-bugs->freebsd-net
> Responsible-Changed-By: arved
> Responsible-Changed-When: Fri Aug 27 12:38:53 GMT 2004
> Responsible-Changed-Why: 
> Old patches against IPv6, over to freebsd-net to decide if this PR is still 
> relevant

> http://www.freebsd.org/cgi/query-pr.cgi?pr=44355

Hmm, this seems to be the same issue as that reported to the KAME
project almost two years ago.  The problem was then fixed there, but
the fix does not seem to be merged to the FreeBSD repository.

The attached diff below is a similar fix for 5.2.1R.  It cannot
be applied to 4.x directly, but I guess it's easy to modify.

It would be nice if some committers could review the diff and (if
appropriate) merge to the FreeBSD repository.

Thanks,

					JINMEI, Tatuya
					Communication Platform Lab.
					Corporate R&D Center, Toshiba Corp.
					jinmei@isl.rdc.toshiba.co.jp

Index: in6.c
===================================================================
RCS file: /home/ncvs/src/sys/netinet6/in6.c,v
retrieving revision 1.40
diff -u -r1.40 in6.c
--- in6.c	8 Nov 2003 23:36:32 -0000	1.40
+++ in6.c	29 Aug 2004 12:45:15 -0000
@@ -1,5 +1,5 @@
 /*	$FreeBSD: src/sys/netinet6/in6.c,v 1.40 2003/11/08 23:36:32 sam Exp $	*/
-/*	$KAME: in6.c,v 1.259 2002/01/21 11:37:50 keiichi Exp $	*/
+/*	$KAME: in6.c,v 1.334 2002/12/05 15:33:26 jinmei Exp $	*/
 
 /*
  * Copyright (C) 1995, 1996, 1997, and 1998 WIDE Project.
@@ -579,6 +579,14 @@
 		 */
 		if ((error = in6_update_ifa(ifp, ifra, ia)) != 0)
 			return (error);
+		if ((ia = in6ifa_ifpwithaddr(ifp, &ifra->ifra_addr.sin6_addr))
+		    == NULL) {
+		    	/*
+			 * this can happen when the user specify the 0 valid
+			 * lifetime.
+			 */
+			break;
+		}
 
 		/*
 		 * then, make the prefix on-link on the interface.
@@ -617,6 +625,15 @@
 		    ((ifra->ifra_flags & IN6_IFF_AUTOCONF) != 0);
 		pr0.ndpr_vltime = ifra->ifra_lifetime.ia6t_vltime;
 		pr0.ndpr_pltime = ifra->ifra_lifetime.ia6t_pltime;
+		if ((error = in6_init_prefix_ltimes(&pr0)) != 0) {
+			/*
+			 * Validation for lifetimes should have been done, so
+			 * this should always succeed.
+			 */
+			log(LOG_ERR, "in6_control: failed to initialize prefix"
+			    " lifetimes\n");
+			return (error);
+		}
 
 		/* add the prefix if not yet. */
 		if ((pr = nd6_prefix_lookup(&pr0)) == NULL) {
@@ -632,48 +649,40 @@
 				return (EINVAL); /* XXX panic here? */
 			}
 		}
-		if ((ia = in6ifa_ifpwithaddr(ifp, &ifra->ifra_addr.sin6_addr))
-		    == NULL) {
-		    	/* XXX: this should not happen! */
-			log(LOG_ERR, "in6_control: addition succeeded, but"
-			    " no ifaddr\n");
-		} else {
-			if ((ia->ia6_flags & IN6_IFF_AUTOCONF) != 0 &&
-			    ia->ia6_ndpr == NULL) { /* new autoconfed addr */
-				ia->ia6_ndpr = pr;
-				pr->ndpr_refcnt++;
-
-				/*
-				 * If this is the first autoconf address from
-				 * the prefix, create a temporary address
-				 * as well (when specified).
-				 */
-				if (ip6_use_tempaddr &&
-				    pr->ndpr_refcnt == 1) {
-					int e;
-					if ((e = in6_tmpifadd(ia, 1)) != 0) {
-						log(LOG_NOTICE, "in6_control: "
-						    "failed to create a "
-						    "temporary address, "
-						    "errno=%d\n", e);
-					}
-				}
-			}
+
+		/* relate the address to the prefix */
+		if (ia->ia6_ndpr == NULL) {
+			ia->ia6_ndpr = pr;
+			pr->ndpr_refcnt++;
 
 			/*
-			 * this might affect the status of autoconfigured
-			 * addresses, that is, this address might make
-			 * other addresses detached.
+			 * If this is the first autoconf address from the
+			 * prefix, create a temporary address as well
+			 * (when required).
 			 */
-			pfxlist_onlink_check();
+			if ((ia->ia6_flags & IN6_IFF_AUTOCONF) &&
+			    ip6_use_tempaddr && pr->ndpr_refcnt == 1) {
+				int e;
+				if ((e = in6_tmpifadd(ia, 1)) != 0) {
+					log(LOG_NOTICE, "in6_control: failed "
+					    "to create a temporary address, "
+					    "errno=%d\n", e);
+				}
+			}
 		}
+
+		/*
+		 * this might affect the status of autoconfigured addresses,
+		 * that is, this address might make other addresses detached.
+		 */
+		pfxlist_onlink_check();
+
 		break;
 	}
 
 	case SIOCDIFADDR_IN6:
 	{
-		int i = 0;
-		struct nd_prefix pr0, *pr;
+		struct nd_prefix *pr;
 
 		/*
 		 * If the address being deleted is the only one that owns
@@ -683,37 +692,12 @@
 		 * and the prefix management.  We do this, however, to provide
 		 * as much backward compatibility as possible in terms of
 		 * the ioctl operation.
+		 * Note that in6_purgeaddr() will decrement ndpr_refcnt.
 		 */
-		bzero(&pr0, sizeof(pr0));
-		pr0.ndpr_ifp = ifp;
-		pr0.ndpr_plen = in6_mask2len(&ia->ia_prefixmask.sin6_addr,
-					     NULL);
-		if (pr0.ndpr_plen == 128)
-			goto purgeaddr;
-		pr0.ndpr_prefix = ia->ia_addr;
-		pr0.ndpr_mask = ia->ia_prefixmask.sin6_addr;
-		for (i = 0; i < 4; i++) {
-			pr0.ndpr_prefix.sin6_addr.s6_addr32[i] &=
-				ia->ia_prefixmask.sin6_addr.s6_addr32[i];
-		}
-		/*
-		 * The logic of the following condition is a bit complicated.
-		 * We expire the prefix when
-		 * 1. the address obeys autoconfiguration and it is the
-		 *    only owner of the associated prefix, or
-		 * 2. the address does not obey autoconf and there is no
-		 *    other owner of the prefix.
-		 */
-		if ((pr = nd6_prefix_lookup(&pr0)) != NULL &&
-		    (((ia->ia6_flags & IN6_IFF_AUTOCONF) != 0 &&
-		      pr->ndpr_refcnt == 1) ||
-		     ((ia->ia6_flags & IN6_IFF_AUTOCONF) == 0 &&
-		      pr->ndpr_refcnt == 0))) {
-			pr->ndpr_expire = 1; /* XXX: just for expiration */
-		}
-
-	  purgeaddr:
+		pr = ia->ia6_ndpr;
 		in6_purgeaddr(&ia->ia_ifa);
+		if (pr && pr->ndpr_refcnt == 0)
+			prelist_remove(pr);
 		break;
 	}
 
@@ -1177,24 +1161,26 @@
 		in6_prefix_remove_ifid(iilen, oia);
 	}
 
-	/*
-	 * When an autoconfigured address is being removed, release the
-	 * reference to the base prefix.  Also, since the release might
-	 * affect the status of other (detached) addresses, call
-	 * pfxlist_onlink_check().
+  	/*
+	 * Release the reference to the base prefix.  There should be a
+	 * positive reference.
 	 */
-	if ((oia->ia6_flags & IN6_IFF_AUTOCONF) != 0) {
-		if (oia->ia6_ndpr == NULL) {
-			nd6log((LOG_NOTICE, "in6_unlink_ifa: autoconf'ed address "
-			    "%p has no prefix\n", oia));
-		} else {
-			oia->ia6_ndpr->ndpr_refcnt--;
-			oia->ia6_flags &= ~IN6_IFF_AUTOCONF;
-			oia->ia6_ndpr = NULL;
-		}
+	if (oia->ia6_ndpr == NULL) {
+		nd6log((LOG_NOTICE,
+		    "in6_unlink_ifa: autoconf'ed address "
+		    "%p has no prefix\n", oia));
+	} else {
+		oia->ia6_ndpr->ndpr_refcnt--;
+		oia->ia6_ndpr = NULL;
+	}
 
+	/*
+	 * Also, if the address being removed is autoconf'ed, call
+	 * pfxlist_onlink_check() since the release might affect the status of
+	 * other (detached) addresses. 
+	 */
+	if ((oia->ia6_flags & IN6_IFF_AUTOCONF))
 		pfxlist_onlink_check();
-	}
 
 	/*
 	 * release another refcnt for the link from in6_ifaddr.
Comment 5 George V. Neville-Neil freebsd_committer freebsd_triage 2004-10-14 12:34:13 UTC
Responsible Changed
From-To: freebsd-net->gnn@freebsd.org

Took responsibility for patching and testing this.
Comment 6 George V. Neville-Neil freebsd_committer freebsd_triage 2004-10-16 10:11:26 UTC
Here is a patch against CURRENT for this that I am awaiting approval
on:

Index: in6.c
===================================================================
RCS file: /Volumes/exported/FreeBSD-CVS/src/sys/netinet6/in6.c,v
retrieving revision 1.48
diff -u -r1.48 in6.c
--- in6.c	23 Aug 2004 03:00:26 -0000	1.48
+++ in6.c	16 Oct 2004 08:54:56 -0000
@@ -1,4 +1,4 @@
-/*	$FreeBSD$	*/
+/*	$FreeBSD: src/sys/netinet6/in6.c,v 1.48 2004/08/23 03:00:26 rwatson Exp $	*/
 /*	$KAME: in6.c,v 1.259 2002/01/21 11:37:50 keiichi Exp $	*/
 
 /*
@@ -575,6 +575,14 @@
 		 */
 		if ((error = in6_update_ifa(ifp, ifra, ia)) != 0)
 			return (error);
+		if ((ia = in6ifa_ifpwithaddr(ifp, &ifra->ifra_addr.sin6_addr))
+		    == NULL) {
+		    	/*
+			 * this can happen when the user specify the 0 valid
+			 * lifetime.
+			 */
+			break;
+		}
 
 		/*
 		 * then, make the prefix on-link on the interface.
@@ -628,41 +636,34 @@
 				return (EINVAL); /* XXX panic here? */
 			}
 		}
-		if ((ia = in6ifa_ifpwithaddr(ifp, &ifra->ifra_addr.sin6_addr))
-		    == NULL) {
-		    	/* XXX: this should not happen! */
-			log(LOG_ERR, "in6_control: addition succeeded, but"
-			    " no ifaddr\n");
-		} else {
-			if ((ia->ia6_flags & IN6_IFF_AUTOCONF) != 0 &&
-			    ia->ia6_ndpr == NULL) { /* new autoconfed addr */
-				ia->ia6_ndpr = pr;
-				pr->ndpr_refcnt++;
-
-				/*
-				 * If this is the first autoconf address from
-				 * the prefix, create a temporary address
-				 * as well (when specified).
-				 */
-				if (ip6_use_tempaddr &&
-				    pr->ndpr_refcnt == 1) {
-					int e;
-					if ((e = in6_tmpifadd(ia, 1)) != 0) {
-						log(LOG_NOTICE, "in6_control: "
-						    "failed to create a "
-						    "temporary address, "
-						    "errno=%d\n", e);
-					}
-				}
-			}
+
+		/* relate the address to the prefix */
+		if (ia->ia6_ndpr == NULL) {
+			ia->ia6_ndpr = pr;
+			pr->ndpr_refcnt++;
 
 			/*
-			 * this might affect the status of autoconfigured
-			 * addresses, that is, this address might make
-			 * other addresses detached.
+			 * If this is the first autoconf address from the
+			 * prefix, create a temporary address as well
+			 * (when required).
 			 */
-			pfxlist_onlink_check();
+			if ((ia->ia6_flags & IN6_IFF_AUTOCONF) &&
+			    ip6_use_tempaddr && pr->ndpr_refcnt == 1) {
+				int e;
+				if ((e = in6_tmpifadd(ia, 1)) != 0) {
+					log(LOG_NOTICE, "in6_control: failed "
+					    "to create a temporary address, "
+					    "errno=%d\n", e);
+				}
+			}
 		}
+
+		/*
+		 * this might affect the status of autoconfigured addresses,
+		 * that is, this address might make other addresses detached.
+		 */
+		pfxlist_onlink_check();
+
 		if (error == 0 && ia)
 			EVENTHANDLER_INVOKE(ifaddr_event, ifp);
 		break;
@@ -670,8 +671,7 @@
 
 	case SIOCDIFADDR_IN6:
 	{
-		int i = 0;
-		struct nd_prefix pr0, *pr;
+		struct nd_prefix *pr;
 
 		/*
 		 * If the address being deleted is the only one that owns
@@ -681,37 +681,12 @@
 		 * and the prefix management.  We do this, however, to provide
 		 * as much backward compatibility as possible in terms of
 		 * the ioctl operation.
+		 * Note that in6_purgeaddr() will decrement ndpr_refcnt.
 		 */
-		bzero(&pr0, sizeof(pr0));
-		pr0.ndpr_ifp = ifp;
-		pr0.ndpr_plen = in6_mask2len(&ia->ia_prefixmask.sin6_addr,
-					     NULL);
-		if (pr0.ndpr_plen == 128)
-			goto purgeaddr;
-		pr0.ndpr_prefix = ia->ia_addr;
-		pr0.ndpr_mask = ia->ia_prefixmask.sin6_addr;
-		for (i = 0; i < 4; i++) {
-			pr0.ndpr_prefix.sin6_addr.s6_addr32[i] &=
-				ia->ia_prefixmask.sin6_addr.s6_addr32[i];
-		}
-		/*
-		 * The logic of the following condition is a bit complicated.
-		 * We expire the prefix when
-		 * 1. the address obeys autoconfiguration and it is the
-		 *    only owner of the associated prefix, or
-		 * 2. the address does not obey autoconf and there is no
-		 *    other owner of the prefix.
-		 */
-		if ((pr = nd6_prefix_lookup(&pr0)) != NULL &&
-		    (((ia->ia6_flags & IN6_IFF_AUTOCONF) != 0 &&
-		      pr->ndpr_refcnt == 1) ||
-		     ((ia->ia6_flags & IN6_IFF_AUTOCONF) == 0 &&
-		      pr->ndpr_refcnt == 0))) {
-			pr->ndpr_expire = 1; /* XXX: just for expiration */
-		}
-
-	  purgeaddr:
+		pr = ia->ia6_ndpr;
 		in6_purgeaddr(&ia->ia_ifa);
+		if (pr && pr->ndpr_refcnt == 0)
+			prelist_remove(pr);
 		EVENTHANDLER_INVOKE(ifaddr_event, ifp);
 		break;
 	}
@@ -1171,23 +1146,26 @@
 	}
 
 	/*
-	 * When an autoconfigured address is being removed, release the
-	 * reference to the base prefix.  Also, since the release might
-	 * affect the status of other (detached) addresses, call
-	 * pfxlist_onlink_check().
+	 * Release the reference to the base prefix.  There should be a
+	 * positive reference.
 	 */
-	if ((oia->ia6_flags & IN6_IFF_AUTOCONF) != 0) {
-		if (oia->ia6_ndpr == NULL) {
-			nd6log((LOG_NOTICE, "in6_unlink_ifa: autoconf'ed address "
-			    "%p has no prefix\n", oia));
-		} else {
-			oia->ia6_ndpr->ndpr_refcnt--;
-			oia->ia6_flags &= ~IN6_IFF_AUTOCONF;
-			oia->ia6_ndpr = NULL;
-		}
+	if (oia->ia6_ndpr == NULL) {
+		nd6log((LOG_NOTICE,
+		    "in6_unlink_ifa: autoconf'ed address "
+		    "%p has no prefix\n", oia));
+	} else {
+		oia->ia6_ndpr->ndpr_refcnt--;
+		oia->ia6_ndpr = NULL;
+	}
 
+	/*
+	 * Also, if the address being removed is autoconf'ed, call
+	 * pfxlist_onlink_check() since the release might affect the status of
+	 * other (detached) addresses. 
+	 */
+	if ((oia->ia6_flags & IN6_IFF_AUTOCONF))
 		pfxlist_onlink_check();
-	}
+
 
 	/*
 	 * release another refcnt for the link from in6_ifaddr.
Comment 7 Mark Linimon freebsd_committer freebsd_triage 2005-08-06 19:29:04 UTC
Responsible Changed
From-To: gnn@freebsd.org->gnn

Switch to canonical assignment.
Comment 8 George V. Neville-Neil freebsd_committer freebsd_triage 2006-06-22 14:27:06 UTC
State Changed
From-To: open->feedback

Can those who saw this test to see if this is fixed?  The code has been  
in for a while.
Comment 9 George V. Neville-Neil freebsd_committer freebsd_triage 2006-07-10 14:14:09 UTC
State Changed
From-To: feedback->closed

Bug fix in HEAD and STABLE branches, no feedback.