public inbox for passt-dev@passt.top
 help / color / mirror / code / Atom feed
From: Stefano Brivio <sbrivio@redhat.com>
To: David Gibson <david@gibson.dropbear.id.au>
Cc: passt-dev@passt.top, Callum Parsey <callum@neoninteger.au>,
	me@yawnt.com, lemmi@nerd2nerd.org,
	Andrea Arcangeli <aarcange@redhat.com>
Subject: Re: [PATCH v2 03/10] netlink: Add functionality to copy routes from outer namespace
Date: Tue, 23 May 2023 08:14:07 +0200	[thread overview]
Message-ID: <20230523081345.4f0a0274@elisabeth> (raw)
In-Reply-To: <ZGwuJRvOnMDqwVGx@yekko>

On Tue, 23 May 2023 13:08:21 +1000
David Gibson <david@gibson.dropbear.id.au> wrote:

> On Mon, May 22, 2023 at 11:58:51AM +0200, Stefano Brivio wrote:
> > On Mon, 22 May 2023 18:42:01 +1000
> > David Gibson <david@gibson.dropbear.id.au> wrote:
> >   
> > > On Mon, May 22, 2023 at 01:42:17AM +0200, Stefano Brivio wrote:  
> > > > Instead of just fetching the default gateway and configuring a single
> > > > equivalent route in the target namespace, on 'pasta --config-net', it
> > > > might be desirable in some cases to copy the whole set of routes
> > > > corresponding to a given output interface.
> > > > 
> > > > For instance, in:
> > > >   https://github.com/containers/podman/issues/18539
> > > >   IPv4 Default Route Does Not Propagate to Pasta Containers on Hetzner VPSes
> > > > 
> > > > configuring the default gateway won't work without a gateway-less
> > > > route (specifying the output interface only), because the default
> > > > gateway is, somewhat dubiously, not on the same subnet as the
> > > > container.
> > > > 
> > > > This is a similar case to the one covered by commit 7656a6f88882
> > > > ("conf: Adjust netmask on mismatch between IPv4 address/netmask and
> > > > gateway"), and I'm not exactly proud of that workaround.
> > > > 
> > > > We also have:
> > > >   https://bugs.passt.top/show_bug.cgi?id=49
> > > >   pasta does not work with tap-style interface
> > > > 
> > > > for which, eventually, we should be able to configure a gateway-less
> > > > route in the target namespace.
> > > > 
> > > > Introduce different operation modes for nl_route(), including a new
> > > > NL_DUP one, not exposed yet, which simply parrots back to the kernel
> > > > the route dump for a given interface from the outer namespace, fixing
> > > > up flags and interface indices on the way, and requesting to add the
> > > > same routes in the target namespace, on the interface we manage.
> > > > 
> > > > For n routes we want to duplicate, send n identical netlink requests
> > > > including the full dump: routes might depend on each other and the
> > > > kernel processes RTM_NEWROUTE messages sequentially, not atomically,
> > > > and repeating the full dump naturally resolves dependencies without
> > > > the need to actually calculate them.
> > > > 
> > > > I'm not kidding, it actually works pretty well.    
> > > 
> > > If there's a way to detect whether the kernel rejected some of the
> > > routes, it would be nice to cut that loop short as soon as all the
> > > routes are inserted.  Obviously that could be a followup improvement,
> > > though.  
> > 
> > Yes, there's a way, but to keep things asynchronous in a simple way we
> > process errors from nl_req() only at the next request.
> > 
> > This part doesn't really need to be asynchronous, though: we could add
> > a flag for nl_req() saying that we want to know about NLMSG_ERROR right
> > away. This looks relatively straightforward, and already an improvement
> > in the sense you mentioned.
> > 
> > Actually parsing the error and finding out the offending route is a bit
> > more complicated, though.  
> 
> Right, but we don't necessarily need to do that: all we need is that
> if there are *no* errors we can stop the loop early.

Yes yes, that's what I meant with the paragraph before.

By the way, note that in general we'll get EEXIST in the "extended ACK"
for any message we send, because we just inserted addresses that
already created their prefix routes.

We could think of setting the IFA_F_NOPREFIXROUTE flag on addresses, on
NL_DUP in nl_addr(), or even always, to avoid this.

-- 
Stefano


  reply	other threads:[~2023-05-23  6:14 UTC|newest]

Thread overview: 23+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2023-05-21 23:42 [PATCH v2 00/10] Optionally copy all routes and addresses for pasta, allow gateway-less routes Stefano Brivio
2023-05-21 23:42 ` [PATCH v2 01/10] netlink: Fix comment about response buffer size for nl_req() Stefano Brivio
2023-05-21 23:42 ` [PATCH v2 02/10] pasta: Improve error handling on failure to join network namespace Stefano Brivio
2023-05-21 23:42 ` [PATCH v2 03/10] netlink: Add functionality to copy routes from outer namespace Stefano Brivio
2023-05-22  8:42   ` David Gibson
2023-05-22  9:58     ` Stefano Brivio
2023-05-23  3:08       ` David Gibson
2023-05-23  6:14         ` Stefano Brivio [this message]
2023-05-27  2:06           ` David Gibson
2023-05-21 23:42 ` [PATCH v2 04/10] conf: --config-net option is for pasta mode only Stefano Brivio
2023-05-21 23:42 ` [PATCH v2 05/10] conf, pasta: With --config-net, copy all routes by default Stefano Brivio
2023-05-22  8:44   ` David Gibson
2023-05-22  9:59     ` Stefano Brivio
2023-05-21 23:42 ` [PATCH v2 06/10] Revert "conf: Adjust netmask on mismatch between IPv4 address/netmask and gateway" Stefano Brivio
2023-05-21 23:42 ` [PATCH v2 07/10] conf: Don't exit if sourced default route has no gateway Stefano Brivio
2023-05-22  8:48   ` David Gibson
2023-05-22 15:29     ` Stefano Brivio
2023-05-21 23:42 ` [PATCH v2 08/10] netlink: Add functionality to copy addresses from outer namespace Stefano Brivio
2023-05-22  8:51   ` David Gibson
2023-05-21 23:42 ` [PATCH v2 09/10] conf, pasta: With --config-net, copy all addresses by default Stefano Brivio
2023-05-22  8:53   ` David Gibson
2023-05-22 15:30     ` Stefano Brivio
2023-05-21 23:42 ` [PATCH v2 10/10] passt.h: Fix description of pasta_ifi in struct ctx Stefano Brivio

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=20230523081345.4f0a0274@elisabeth \
    --to=sbrivio@redhat.com \
    --cc=aarcange@redhat.com \
    --cc=callum@neoninteger.au \
    --cc=david@gibson.dropbear.id.au \
    --cc=lemmi@nerd2nerd.org \
    --cc=me@yawnt.com \
    --cc=passt-dev@passt.top \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
Code repositories for project(s) associated with this public inbox

	https://passt.top/passt

This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for IMAP folder(s).