public inbox for passt-dev@passt.top
 help / color / mirror / code / Atom feed
From: Stefano Brivio <sbrivio@redhat.com>
To: David Gibson <david@gibson.dropbear.id.au>
Cc: passt-dev@passt.top, Matej Hrica <mhrica@redhat.com>
Subject: Re: [PATCH 3/4] util, lineread, tap: Overflow checks on long signed sums and subtractions
Date: Thu, 27 Jun 2024 22:46:45 +0200	[thread overview]
Message-ID: <20240627224645.6cad668d@elisabeth> (raw)
In-Reply-To: <20240627095537.4d3f20f8@elisabeth>

On Thu, 27 Jun 2024 09:55:46 +0200
Stefano Brivio <sbrivio@redhat.com> wrote:

> On Thu, 27 Jun 2024 11:13:14 +1000
> David Gibson <david@gibson.dropbear.id.au> wrote:
> 
> > On Thu, Jun 27, 2024 at 01:45:35AM +0200, Stefano Brivio wrote:  
> > > Potential sum and subtraction overflows were reported by Coverity in a
> > > few places where we use size_t and ssize_t types.
> > > 
> > > Strictly speaking, those are not false positives even if I don't see a
> > > way to trigger those: for instance, if we receive up to n bytes from a
> > > socket up, the return value from recv() is already constrained and
> > > can't overflow for the usage we make in tap_handler_passt().    
> > 
> > Actually, I think they are false positives.  In a bunch of cases the
> > reasoning for that does rely on assuming the kernel will never return
> > a value greater than the buffer size for read(), write() or similar.  
> 
> No, that was exactly my point: return values are constrained by the
> kernel, but a static checker doesn't necessarily have to assume a kernel
> that's properly functioning.
> 
> In general, static checkers do, especially if POSIX has a clear
> definition of a system call, and for what matters to us, they should.
> 
> But here Coverity is ignoring that, and I'm not sure we should call it
> a false positive. It's kind of arbitrary really. I think Coverity in
> these cases just prefers to "blindly" apply CERT C INT32-C locally,
> which is not necessarily a bad choice, because "false positives" are
> not so much of a nuisance.
> 
> > So possibly just ASSERT()ing that would suffice.  
> 
> In some cases yes, but as we have built-ins in gcc and Clang that aim
> at keeping the cost of the checks down by, quoting gcc documentation,
> using "hardware instructions to implement these built-in functions where
> possible", and they already implement the operation, open-coding our own
> checks for assertions looks redundant and might result in slower code.

I tried, but in most cases, not even open-coding the whole thing, as an
ASSERT() or an early return, say:

			if ((rc > 0 && lr->count < LONG_MIN + rc) ||
			    (rc < 0 && lr->count > LONG_MAX + rc))
				return -1;

or its ASSERT() equivalent is enough. That's because, without using
built-in functions, we can't have (of course) "infinite precision"
types. From:

  https://gcc.gnu.org/onlinedocs/gcc/Integer-Overflow-Builtins.html

  These built-in functions promote the first two operands into infinite
  precision signed type and perform addition on those promoted operands

And it looks like Coverity wants to see operations executed in those
types: size_t or unsigned long long is not enough.

In two cases, I could make Coverity happy with some rather bulky
asserts, if I turn operations into size_t plus ssize_t, in size_t
domain. But the result looks really bulky.

I managed to change a couple of things, though:

> > > In any case, take care of those by adding two functions that
> > > explicitly check for overflows in sums and subtractions of long signed
> > > values, and using them.
> > > 
> > > Signed-off-by: Stefano Brivio <sbrivio@redhat.com>
> > > ---
> > >  lineread.c |  5 +++--
> > >  tap.c      | 21 +++++++++++++++------
> > >  util.c     |  7 +++++--
> > >  util.h     | 46 +++++++++++++++++++++++++++++++++++++++++++++-
> > >  4 files changed, 68 insertions(+), 11 deletions(-)
> > > 
> > > diff --git a/lineread.c b/lineread.c
> > > index 0387f4a..12f2d24 100644
> > > --- a/lineread.c
> > > +++ b/lineread.c
> > > @@ -17,6 +17,7 @@
> > >  #include <string.h>
> > >  #include <stdbool.h>
> > >  #include <unistd.h>
> > > +#include <errno.h>
> > >  
> > >  #include "lineread.h"
> > >  #include "util.h"
> > > @@ -102,8 +103,8 @@ ssize_t lineread_get(struct lineread *lr, char **line)
> > >  
> > >  		if (rc == 0)
> > >  			eof = true;
> > > -		else
> > > -			lr->count += rc;    
> > 
> > From the construction of the read, lr->count + rc can never exceed
> > LINEREAD_BUFFER_SIZE - lr->next_line, so this can't overflow.  
> 
> Sure. But, especially as package maintainer, in this case I prefer to
> have a useless check than carrying suppressions around.
> 
> > > +		else if (saddl_overflow(lr->count, rc, &lr->count))
> > > +			return -ERANGE;
> > >  	}
> > >  
> > >  	*line = lr->buf + lr->next_line;
> > > diff --git a/tap.c b/tap.c
> > > index ec994a2..7f8c26d 100644
> > > --- a/tap.c
> > > +++ b/tap.c
> > > @@ -1031,7 +1031,11 @@ redo:
> > >  		 */
> > >  		if (l2len > n) {
> > >  			rem = recv(c->fd_tap, p + n, l2len - n, 0);
> > > -			if ((n += rem) != l2len)    
> > 
> > Similarly, rem <= l2len - n, and therefore n + rem <= l2len.  
> 
> Same here.
> 
> > > +
> > > +			if (saddl_overflow(n, rem, &n))
> > > +				return;
> > > +
> > > +			if (n != l2len)
> > >  				return;
> > >  		}
> > >  
> > > @@ -1046,7 +1050,9 @@ redo:
> > >  
> > >  next:
> > >  		p += l2len;
> > > -		n -= l2len;    
> > 
> > We already checked that l2len <= n, so this one can't overflow either.  
> 
> Same here.
> 
> > Not sure why Coverity can't see that itself, though :/.  Possibly it
> > doesn't understand gotos well enough to see that the only goto next is
> > after that check.  
> 
> It sees that, that's the path it takes in reporting a potential
> overflow here. I think here, again, it's just blindly requesting
> INT32-C from CERT C rules, locally.
> 
> > > +
> > > +		if (ssubl_overflow(n, l2len, &n))
> > > +			return;
> > >  	}
> > >  
> > >  	tap_handler(c, now);
> > > @@ -1077,17 +1083,20 @@ redo:
> > >  	tap_flush_pools();
> > >  restart:
> > >  	while ((len = read(c->fd_tap, pkt_buf + n, TAP_BUF_BYTES - n)) > 0) {
> > > -
> > >  		if (len < (ssize_t)sizeof(struct ethhdr) ||
> > >  		    len > (ssize_t)ETH_MAX_MTU) {
> > > -			n += len;    
> > 
> > Here n+len can't exceed TAP_BUF_BYTES, so again, no overflow.  
> 
> Same here.
> 
> > > +			if (saddl_overflow(n, len, &n))
> > > +				return;
> > > +
> > >  			continue;
> > >  		}
> > >  
> > > -
> > >  		tap_add_packet(c, len, pkt_buf + n);
> > >  
> > > -		if ((n += len) == TAP_BUF_BYTES)    
> > 
> > Same here.  
> 
> Same here.
> 
> > > +		if (saddl_overflow(n, len, &n))
> > > +			return;
> > > +
> > > +		if (n == TAP_BUF_BYTES)
> > >  			break;
> > >  	}
> > >  
> > > diff --git a/util.c b/util.c
> > > index dd2e57f..a72d6c5 100644
> > > --- a/util.c
> > > +++ b/util.c
> > > @@ -567,7 +567,7 @@ int do_clone(int (*fn)(void *), char *stack_area, size_t stack_size, int flags,
> > >   *
> > >   * #syscalls write writev
> > >   */
> > > -int write_remainder(int fd, const struct iovec *iov, int iovcnt, size_t skip)
> > > +int write_remainder(int fd, const struct iovec *iov, int iovcnt, ssize_t skip)    
> > 
> > I don't love this change, since negative skip values make no sense.

Dropped, by:

> > >  {
> > >  	int i;
> > >  	size_t offset;
> > > @@ -585,7 +585,10 @@ int write_remainder(int fd, const struct iovec *iov, int iovcnt, size_t skip)
> > >  		if (rc < 0)
> > >  			return -1;
> > >  
> > > -		skip += rc;    
> > 
> > Ok, here it's not a false positive.  I believe this really could
> > overflow if you had an iov where the sum of the iov_len exceeded a
> > size_t.
> >   
> > > +		if (saddl_overflow(skip, rc, &skip)) {
> > > +			errno = -ERANGE;
> > > +			return -1;
> > > +		}    
> > 
> > If you leave skip an unsigned, you've already checked for negative rc,
> > so this is essentially an unsigned add.  Checking for overflow on an
> > unsigned addition is simpler than the logic of saddl_overflow().  
> 
> I'm fairly sure I tried that and it looked rather bulky, because I
> couldn't use __builtin_uaddl_overflow() if I recall correctly, I can try
> again.

...checking for overflow in the unsigned sum, without any built-in. Same
lines of code, but the logic is simpler.

> > >  	}
> > >  
> > >  	return 0;
> > > diff --git a/util.h b/util.h
> > > index eebb027..497d2fd 100644
> > > --- a/util.h
> > > +++ b/util.h
> > > @@ -161,7 +161,7 @@ void pidfile_write(int fd, pid_t pid);
> > >  int __daemon(int pidfile_fd, int devnull_fd);
> > >  int fls(unsigned long x);
> > >  int write_file(const char *path, const char *buf);
> > > -int write_remainder(int fd, const struct iovec *iov, int iovcnt, size_t skip);
> > > +int write_remainder(int fd, const struct iovec *iov, int iovcnt, ssize_t skip);
> > >  
> > >  /**
> > >   * af_name() - Return name of an address family
> > > @@ -223,6 +223,50 @@ static inline bool mod_between(unsigned x, unsigned i, unsigned j, unsigned m)
> > >  	return mod_sub(x, i, m) < mod_sub(j, i, m);
> > >  }
> > >  
> > > +/**
> > > + * saddl_overflow() - Sum with overflow check for long signed values
> > > + * @a:		First value
> > > + * @b:		Second value
> > > + * @sum:	Pointer to result of sum, if it doesn't overflow
> > > + *
> > > + * Return: true if the sum would overflow, false otherwise
> > > + */
> > > +static inline bool saddl_overflow(long a, long b, long *sum)    
> > 
> > These take long, but you're often calling them with ssize_t.  That's
> > _probably_ the same thing, but not necessarily.  
> 
> Right, yes, ssize_t can be long or int, even though I'm fairly sure it's
> always long on all the architectures we are able to build for.
> 
> There's no integer overflow built-in for ssize_t, but I'll probably
> need to add a macro conditional for the whole thing anyway, based on
> the type of ssize_t.

It was a bit simpler than I thought: first off, gcc, Clang and icc all
have __builtin_{add,sub}_overflow(type1 a, type2 b, type3 res).

If those built-ins are not available, all we need to check is what
SSIZE_MIN corresponds to. Now, glibc goes with a __WORDSIZE == 32 to
find that out, but it doesn't look very portable. Checking if SSIZE_MAX
is LONG_MAX, instead, should be.

> > > +{
> > > +#if __GNUC__
> > > +	return __builtin_saddl_overflow(a, b, sum);
> > > +#else
> > > +	if ((a > 0 && a > LONG_MAX - b) ||
> > > +	    (b < 0 && a < LONG_MIN - b))
> > > +		return true;
> > > +
> > > +	*sum = a + b;
> > > +	return false;
> > > +#endif
> > > +}
> > > +
> > > +/**
> > > + * saddl_overflow() - Subtraction with overflow check for long signed values    
> > 
> > s/saddl_overflow/ssubl_overflow/  
> 
> Oops, fixed.
> 
> > > + * @a:		Minuend
> > > + * @b:		Subtrahend
> > > + * @sum:	Pointer to result of subtraction, if it doesn't overflow
> > > + *
> > > + * Return: true if the subtraction would overflow, false otherwise
> > > + */
> > > +static inline bool ssubl_overflow(long a, long b, long *diff)
> > > +{
> > > +#if __GNUC__
> > > +	return __builtin_ssubl_overflow(a, b, diff);
> > > +#else
> > > +	if ((b > 0 && a < LONG_MIN + b) ||
> > > +	    (b < 0 && a > LONG_MAX + b))
> > > +		return true;
> > > +
> > > +	*diff = a - b;
> > > +	return false;
> > > +#endif
> > > +}
> > > +
> > >  /*
> > >   * Workarounds for https://github.com/llvm/llvm-project/issues/58992
> > >   *    
> >   
> 

-- 
Stefano


  reply	other threads:[~2024-06-27 20:47 UTC|newest]

Thread overview: 21+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2024-06-26 23:45 [PATCH 0/4] Small, assorted "hardening" fixes Stefano Brivio
2024-06-26 23:45 ` [PATCH 1/4] conf: Copy up to MAXDNSRCH - 1 bytes, not MAXDNSRCH Stefano Brivio
2024-06-27  0:45   ` David Gibson
2024-06-27  7:27     ` Stefano Brivio
2024-06-27 10:11       ` David Gibson
2024-06-26 23:45 ` [PATCH 2/4] tcp_splice: Check return value of setsockopt() for SO_RCVLOWAT Stefano Brivio
2024-06-27  0:46   ` David Gibson
2024-06-26 23:45 ` [PATCH 3/4] util, lineread, tap: Overflow checks on long signed sums and subtractions Stefano Brivio
2024-06-27  1:13   ` David Gibson
2024-06-27  7:55     ` Stefano Brivio
2024-06-27 20:46       ` Stefano Brivio [this message]
2024-06-28  7:15         ` David Gibson
2024-06-28  7:11       ` David Gibson
2024-06-28  7:55         ` Stefano Brivio
2024-06-28 18:30           ` Stefano Brivio
2024-07-08 13:01             ` Stefano Brivio
2024-06-26 23:45 ` [PATCH 4/4] tap: Drop frames from guest whose length is more than remaining buffer Stefano Brivio
2024-06-27  1:30   ` David Gibson
2024-06-27  8:21     ` Stefano Brivio
2024-06-28  7:19       ` David Gibson
2024-06-28  7:56         ` Stefano Brivio

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=20240627224645.6cad668d@elisabeth \
    --to=sbrivio@redhat.com \
    --cc=david@gibson.dropbear.id.au \
    --cc=mhrica@redhat.com \
    --cc=passt-dev@passt.top \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
Code repositories for project(s) associated with this public inbox

	https://passt.top/passt

This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for IMAP folder(s).