From: David Gibson <david@gibson.dropbear.id.au>
To: Stefano Brivio <sbrivio@redhat.com>
Cc: passt-dev@passt.top
Subject: Re: [PATCH v3 4/4] fwd: Direct inbound spliced forwards to the guest's external address
Date: Thu, 10 Oct 2024 16:57:32 +1100 [thread overview]
Message-ID: <ZwdszOBQxWf1Njx0@zatzit.fritz.box> (raw)
In-Reply-To: <20241009224433.7fc28fc7@elisabeth>
[-- Attachment #1: Type: text/plain, Size: 5120 bytes --]
On Wed, Oct 09, 2024 at 10:44:33PM +0200, Stefano Brivio wrote:
> On Wed, 9 Oct 2024 15:07:21 +0200
> Stefano Brivio <sbrivio@redhat.com> wrote:
>
> > On Wed, 2 Oct 2024 15:48:26 +1000
> > David Gibson <david@gibson.dropbear.id.au> wrote:
> >
> > > In pasta mode, where addressing permits we "splice" connections, forwarding
> > > directly from host socket to guest/container socket without any L2 or L3
> > > processing. This gives us a very large performance improvement when it's
> > > possible.
> > >
> > > Since the traffic is from a local socket within the guest, it will go over
> > > the guest's 'lo' interface, and accordingly we set the guest side address
> > > to be the loopback address. However this has a surprising side effect:
> > > sometimes guests will run services that are only supposed to be used within
> > > the guest and are therefore bound to only 127.0.0.1 and/or ::1. pasta's
> > > forwarding exposes those services to the host, which isn't generally what
> > > we want.
> > >
> > > Correct this by instead forwarding inbound "splice" flows to the guest's
> > > external address.
> > >
> > > Link: https://github.com/containers/podman/issues/24045
> > >
> > > Signed-off-by: David Gibson <david@gibson.dropbear.id.au>
> > > ---
> > > conf.c | 9 +++++++++
> > > fwd.c | 31 +++++++++++++++++++++++--------
> > > passt.1 | 23 +++++++++++++++++++----
> > > passt.h | 2 ++
> > > 4 files changed, 53 insertions(+), 12 deletions(-)
> > >
> > > diff --git a/conf.c b/conf.c
> > > index 6e62510..b5318f3 100644
> > > --- a/conf.c
> > > +++ b/conf.c
> > > @@ -908,6 +908,9 @@ pasta_opts:
> > > " -U, --udp-ns SPEC UDP port forwarding to init namespace\n"
> > > " SPEC is as described above\n"
> > > " default: auto\n"
> > > + " --host-lo-to-ns-lo DEPRECATED:\n"
> > > + " Translate host-loopback forwards to\n"
> > > + " namespace loopback\n"
> > > " --userns NSPATH Target user namespace to join\n"
> > > " --netns PATH|NAME Target network namespace to join\n"
> > > " --netns-only Don't join existing user namespace\n"
> > > @@ -1284,6 +1287,7 @@ void conf(struct ctx *c, int argc, char **argv)
> > > {"netns-only", no_argument, NULL, 20 },
> > > {"map-host-loopback", required_argument, NULL, 21 },
> > > {"map-guest-addr", required_argument, NULL, 22 },
> > > + {"host-lo-to-ns-lo", no_argument, NULL, 23 },
> > > { 0 },
> > > };
> > > const char *logname = (c->mode == MODE_PASTA) ? "pasta" : "passt";
> > > @@ -1461,6 +1465,11 @@ void conf(struct ctx *c, int argc, char **argv)
> > > conf_nat(optarg, &c->ip4.map_guest_addr,
> > > &c->ip6.map_guest_addr, NULL);
> > > break;
> > > + case 23:
> > > + if (c->mode != MODE_PASTA)
> > > + die("--host-lo-to-ns-lo is for pasta mode only");
> > > + c->host_lo_to_ns_lo = 1;
> > > + break;
> > > case 'd':
> > > c->debug = 1;
> > > c->quiet = 0;
> > > diff --git a/fwd.c b/fwd.c
> > > index a505098..c71f5e1 100644
> > > --- a/fwd.c
> > > +++ b/fwd.c
> > > @@ -447,20 +447,35 @@ uint8_t fwd_nat_from_host(const struct ctx *c, uint8_t proto,
> > > (proto == IPPROTO_TCP || proto == IPPROTO_UDP)) {
> > > /* spliceable */
> > >
> > > - /* Preserve the specific loopback adddress used, but let the
> > > - * kernel pick a source port on the target side
> > > + /* The traffic will go over the guest's 'lo' interface, but by
> > > + * default use its external address, so we don't inadvertently
> > > + * expose services that listen only on the guest's loopback
> > > + * address. That can be overridden by --host-lo-to-ns-lo which
> > > + * will instead forward to the loopback address in the guest.
> > > + *
> > > + * In either case, let the kernel pick the source address to
> > > + * match.
> > > */
> > > - tgt->oaddr = ini->eaddr;
> > > + if (inany_v4(&ini->eaddr)) {
> > > + if (c->host_lo_to_ns_lo)
> > > + tgt->eaddr = inany_loopback4;
> > > + else
> > > + tgt->eaddr = inany_from_v4(c->ip4.addr_seen);
> > > + tgt->oaddr = inany_any4;
> > > + } else {
> > > + if (c->host_lo_to_ns_lo)
> > > + tgt->eaddr = inany_loopback6;
> > > + else
> > > + tgt->eaddr.a6 = c->ip6.addr_seen;
> >
> > Either this...
> >
> > > + tgt->oaddr = inany_any6;
> >
> > or this (and not something before this patch, up to 3/4) make the
> > "TCP/IPv6: host to ns (spliced): big transfer" test in pasta/tcp hang,
> > sometimes (about one in three/four runs), that's what I mistakenly
> > reported as coming from Laurent's series at:
Huh, interesting. Just got back from my leave and ran that group of
tests in a loop this afternoon, but didn't manage to reproduce. I
have administrivia that will probably fill the rest of this week, but
I'll look into this as soon as I can.
--
David Gibson (he or they) | I'll have my music baroque, and my code
david AT gibson.dropbear.id.au | minimalist, thank you, not the other way
| around.
http://www.ozlabs.org/~dgibson
[-- Attachment #2: signature.asc --]
[-- Type: application/pgp-signature, Size: 833 bytes --]
next prev parent reply other threads:[~2024-10-10 5:57 UTC|newest]
Thread overview: 16+ messages / expand[flat|nested] mbox.gz Atom feed top
2024-10-02 5:48 [PATCH v3 0/4] Don't expose container loopback services to the host David Gibson
2024-10-02 5:48 ` [PATCH v3 1/4] passt.1: Mark --stderr as deprecated more prominently David Gibson
2024-10-02 5:48 ` [PATCH v3 2/4] passt.1: Clarify and update "Handling of local addresses" section David Gibson
2024-10-02 5:48 ` [PATCH v3 3/4] test: Clarify test for spliced inbound transfers David Gibson
2024-10-02 5:48 ` [PATCH v3 4/4] fwd: Direct inbound spliced forwards to the guest's external address David Gibson
2024-10-09 13:07 ` Stefano Brivio
2024-10-09 20:44 ` Stefano Brivio
2024-10-10 5:57 ` David Gibson [this message]
2024-10-16 3:15 ` David Gibson
2024-10-16 5:46 ` David Gibson
2024-10-16 8:39 ` David Gibson
2024-10-16 15:26 ` Stefano Brivio
2024-10-17 1:19 ` David Gibson
2024-10-17 8:31 ` Stefano Brivio
2024-10-21 1:35 ` David Gibson
2024-10-17 5:06 ` David Gibson
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=ZwdszOBQxWf1Njx0@zatzit.fritz.box \
--to=david@gibson.dropbear.id.au \
--cc=passt-dev@passt.top \
--cc=sbrivio@redhat.com \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
Code repositories for project(s) associated with this public inbox
https://passt.top/passt
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for IMAP folder(s).