From: Stefano Brivio <sbrivio@redhat.com>
To: David Gibson <david@gibson.dropbear.id.au>
Cc: passt-dev@passt.top
Subject: Re: [PATCH 13/14] tap: Split tap_ip4_send() into UDP and ICMP variants
Date: Tue, 18 Oct 2022 14:27:04 +0200 [thread overview]
Message-ID: <20221018142704.11ee38ac@elisabeth> (raw)
In-Reply-To: <Y06XHtUTMqxBwq96@yekko>
On Tue, 18 Oct 2022 23:07:58 +1100
David Gibson <david@gibson.dropbear.id.au> wrote:
> On Tue, Oct 18, 2022 at 05:06:34AM +0200, Stefano Brivio wrote:
> > On Mon, 17 Oct 2022 19:58:06 +1100
> > David Gibson <david@gibson.dropbear.id.au> wrote:
> >
> > > tap_ip4_send() has special case logic to compute the checksums for UDP
> > > and ICMP packets, which is a mild layering violation. By using a suitable
> > > helper we can split it into tap_udp4_send() and tap_icmp4_send() functions
> > > without greatly increasing the code size, this removing that layering
> > > violation.
> > >
> > > We make some small changes to the interface while there. In both cases
> > > we make the destination IPv4 address a parameter, which will be useful
> > > later. For the UDP variant we make it take just the UDP payload, and it
> > > will generate the UDP header. For the ICMP variant we pass in the ICMP
> > > header as before. The inconsistency is because that's what seems to be
> > > the more natural way to invoke the function in the callers in each case.
> > >
> > > Signed-off-by: David Gibson <david@gibson.dropbear.id.au>
> > > ---
> > > icmp.c | 3 ++-
> > > tap.c | 75 +++++++++++++++++++++++++++++++++++++++++-----------------
> > > tap.h | 7 ++++--
> > > 3 files changed, 60 insertions(+), 25 deletions(-)
> > >
> > > diff --git a/icmp.c b/icmp.c
> > > index 6493ea9..233acf9 100644
> > > --- a/icmp.c
> > > +++ b/icmp.c
> > > @@ -124,7 +124,8 @@ void icmp_sock_handler(const struct ctx *c, union epoll_ref ref,
> > > icmp_id_map[V4][id].seq = seq;
> > > }
> > >
> > > - tap_ip4_send(c, sr4->sin_addr.s_addr, IPPROTO_ICMP, buf, n);
> > > + tap_icmp4_send(c, sr4->sin_addr.s_addr, tap_ip4_daddr(c),
> > > + buf, n);
> > > }
> > > }
> > >
> > > diff --git a/tap.c b/tap.c
> > > index 274f4ba..5792880 100644
> > > --- a/tap.c
> > > +++ b/tap.c
> > > @@ -127,20 +127,10 @@ static void *tap_l2_hdr(const struct ctx *c, void *buf, uint16_t proto)
> > > return eh + 1;
> > > }
> > >
> > > -/**
> > > - * tap_ip4_send() - Send IPv4 packet, with L2 headers, calculating L3/L4 checksums
> > > - * @c: Execution context
> > > - * @src: IPv4 source address
> > > - * @proto: L4 protocol number
> > > - * @in: Payload
> > > - * @len: L4 payload length
> > > - */
> > > -void tap_ip4_send(const struct ctx *c, in_addr_t src, uint8_t proto,
> > > - const char *in, size_t len)
> >
> > I understand why you return ip(4)h + 1 here because I've just reviewed
> > 9/14, I wouldn't know otherwise:
> >
> > /**
> > * tap_ip4_hdr() - Build IPv4 header for inbound packet, with checksum
> > * @c: Execution context
> > * @src: IPv4 source address, network order
> > * @dst: IPv4 destination address, network order
> > * @len: L4 payload length
> > * @proto: L4 protocol number
> > *
> > * Return: pointer to write payload to
> > */
>
> Oops, yes, forgot to add a function comment. Done.
>
> > > +static void *tap_ip4_hdr(char *buf, in_addr_t src, in_addr_t dst,
> > > + size_t len, uint8_t proto)
> > > {
> > > - char buf[USHRT_MAX];
> > > - struct iphdr *ip4h = (struct iphdr *)tap_l2_hdr(c, buf, ETH_P_IP);
> > > - char *data = (char *)(ip4h + 1);
> > > + struct iphdr *ip4h = (struct iphdr *)buf;
> > >
> > > ip4h->version = 4;
> > > ip4h->ihl = sizeof(struct iphdr) / 4;
> > > @@ -151,20 +141,61 @@ void tap_ip4_send(const struct ctx *c, in_addr_t src, uint8_t proto,
> > > ip4h->ttl = 255;
> > > ip4h->protocol = proto;
> > > ip4h->saddr = src;
> > > - ip4h->daddr = tap_ip4_daddr(c);
> > > + ip4h->daddr = dst;
> > > csum_ip4_header(ip4h);
> > > + return ip4h + 1;
> > > +}
> > > +
> > > +/**
> > > + * tap_udp4_send() - Send UDP over IPv4 packet
> > > + * @c: Execution context
> > > + * @src: IPv4 source address
> > > + * @sport: UDP source port
> > > + * @dst: IPv4 destination address
> > > + * @dport: UDP destination port
> > > + * @in: UDP payload contents (not including UDP header)
> > > + * @len: UDP payload length (not including UDP header)
> > > + */
> > > +/* cppcheck-suppress unusedFunction */
> > > +void tap_udp4_send(const struct ctx *c, in_addr_t src, in_port_t sport,
> > > + in_addr_t dst, in_port_t dport,
> > > + const void *in, size_t len)
> > > +{
> > > + size_t udplen = len + sizeof(struct udphdr);
> > > + char buf[USHRT_MAX];
> > > + void *ip4h = tap_l2_hdr(c, buf, ETH_P_IP);
> > > + void *uhp = tap_ip4_hdr(ip4h, src, dst, udplen, IPPROTO_UDP);
> >
> > Two observations:
> >
> > - this saves one line and one cast, but it's really a bit unnatural that
> > tap_ip4_hdr() doesn't point to the header it just made, or to nothing.
> >
> > I would rather have to +1 the return value or the original pointer
> > instead or having this trick
> >
> > > + struct udphdr *uh = (struct udphdr *)uhp;
> > > + char *data = (char *)(uh + 1);
> >
> > - it's longer, but in my opinion clearer, if we split a bit more clearly
> > the components of the packet, that is, something like (untested):
>
> I don't really want to change this. Yes, it's a bit counterintuitive
> at first blush, but there's a reason for this approach.
>
> This style of a function which generates a header then points *after*
> it works even if the header it generates is of variable length.
> Advancing to the payload in the caller doesn't (at least not without
> breaking the abstraction I'm trying to generate with these helpers).
>
> That's not just theoretical, because at some point I'd like to extend
> the l2_hdr function to also allocate space for the qemu socket length
> header.
>
> I'm certainly open to name changes to make this behaviour more
> obvious, but I think returning the payload pointer not the header
> pointer makes for a better abstraction here.
Hmm, yes, I think the variable length case is a very valid point, and
also in terms of abstraction I see the advantage. There are just two
things I can think of:
- passing the end pointer as argument (not as practical as your
solution, though)
- naming it tap_ip4_push_hdr(), tap_ip4_hdr_after(),
tap_ip4_hdr_goto_next(), tap_ip4_leave_header_behind()... I can't
think of anything better at this point. I'll keep thinking, but at the
moment I'd be fine even with the current name.
--
Stefano
next prev parent reply other threads:[~2022-10-18 12:27 UTC|newest]
Thread overview: 36+ messages / expand[flat|nested] mbox.gz Atom feed top
2022-10-17 8:57 [PATCH 00/14] Clean up checksum and header generation for inbound packets David Gibson
2022-10-17 8:57 ` [PATCH 01/14] Add csum_icmp6() helper for calculating ICMPv6 checksums David Gibson
2022-10-18 3:01 ` Stefano Brivio
2022-10-18 12:05 ` David Gibson
2022-10-17 8:57 ` [PATCH 02/14] Add csum_icmp4() helper for calculating ICMPv4 checksums David Gibson
2022-10-18 3:01 ` Stefano Brivio
2022-10-18 12:06 ` David Gibson
2022-10-18 12:28 ` Stefano Brivio
2022-10-17 8:57 ` [PATCH 03/14] Add csum_udp6() helper for calculating UDP over IPv6 checksums David Gibson
2022-10-18 3:02 ` Stefano Brivio
2022-10-18 12:06 ` David Gibson
2022-10-17 8:57 ` [PATCH 04/14] Add csum_udp4() helper for calculating UDP over IPv4 checksums David Gibson
2022-10-18 3:03 ` Stefano Brivio
2022-10-18 12:06 ` David Gibson
2022-10-17 8:57 ` [PATCH 05/14] Add csum_ip4_header() helper to calculate IPv4 header checksums David Gibson
2022-10-18 3:03 ` Stefano Brivio
2022-10-18 12:07 ` David Gibson
2022-10-17 8:57 ` [PATCH 06/14] Add helpers for normal inbound packet destination addresses David Gibson
2022-10-18 3:04 ` Stefano Brivio
2022-10-18 12:07 ` David Gibson
2022-10-17 8:58 ` [PATCH 07/14] Remove support for TCP packets from tap_ip_send() David Gibson
2022-10-17 8:58 ` [PATCH 08/14] tap: Remove unhelpeful vnet_pre optimization from tap_send() David Gibson
2022-10-18 3:05 ` Stefano Brivio
2022-10-18 12:07 ` David Gibson
2022-10-17 8:58 ` [PATCH 09/14] Split tap_ip_send() into IPv4 and IPv6 specific functions David Gibson
2022-10-18 3:06 ` Stefano Brivio
2022-10-18 12:07 ` David Gibson
2022-10-17 8:58 ` [PATCH 10/14] tap: Split tap_ip6_send() into UDP and ICMP variants David Gibson
2022-10-17 8:58 ` [PATCH 11/14] ndp: Remove unneeded eh_source parameter David Gibson
2022-10-17 8:58 ` [PATCH 12/14] ndp: Use tap_icmp6_send() helper David Gibson
2022-10-17 8:58 ` [PATCH 13/14] tap: Split tap_ip4_send() into UDP and ICMP variants David Gibson
2022-10-18 3:06 ` Stefano Brivio
2022-10-18 12:07 ` David Gibson
2022-10-18 12:27 ` Stefano Brivio [this message]
2022-10-18 23:54 ` David Gibson
2022-10-17 8:58 ` [PATCH 14/14] dhcp: Use tap_udp4_send() helper in dhcp() David Gibson
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=20221018142704.11ee38ac@elisabeth \
--to=sbrivio@redhat.com \
--cc=david@gibson.dropbear.id.au \
--cc=passt-dev@passt.top \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
Code repositories for project(s) associated with this public inbox
https://passt.top/passt
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for IMAP folder(s).