Follow @Openwall on Twitter for new release announcements and other news
[<prev] [next>] [<thread-prev] [thread-next>] [day] [month] [year] [list]
Date: Thu, 16 May 2019 09:22:46 -0400
From: Rich Felker <dalias@...c.org>
To: musl@...ts.openwall.com
Subject: Re: [PATCH] fix tls offsets when p_vaddr%p_align != 0 for
 TLS_ABOVE_TP

On Thu, May 16, 2019 at 09:48:50AM +0200, Szabolcs Nagy wrote:
> * Rich Felker <dalias@...c.org> [2019-05-15 20:20:51 -0400]:
> > On Tue, May 14, 2019 at 04:01:31AM +0200, Szabolcs Nagy wrote:
> > > this came up because lld changed tls alignment on aarch64 as a
> > > workaround for a bionic abi issue https://reviews.llvm.org/D53906
> > > but lld does not handle p_vaddr%p_align!=0 right so it broke on glibc
> > > https://reviews.llvm.org/D61824
> > > 
> > > the patch is untested (bfd linker cannot seem to create problematic
> > > elf objects), but at least there are no regressions with libc-test.
> > 
> > > >From 8c94fcbc9faeb8b07132506757c3d3973652420e Mon Sep 17 00:00:00 2001
> > > From: Szabolcs Nagy <nsz@...t70.net>
> > > Date: Mon, 13 May 2019 18:47:11 +0000
> > > Subject: [PATCH] fix tls offsets when p_vaddr%p_align != 0 for TLS_ABOVE_TP
> > > 
> > > currently the bfd linker does not seem to create tls segments where
> > > p_vaddr%p_align != 0, but this is valid in ELF and then the runtime
> > > computed tls offset must satisfy
> > > 
> > >   offset%p_align == (base+p_vaddr)%p_align
> > > 
> > > and in case of local exec tls (main executable) the smallest such
> > > offset must be used (otherwise it is incompatible with the offset
> > > computed by the static linker). the !TLS_ABOVE_TP case handled this
> > > correctly (the offset is negative then in the formula).
> > > 
> > > the ldso code for TLS_ABOVE_TP is changed so the static tls offset
> > > of each module satisfies the formula and tls_offset always points
> > > to the end of the allocated static tls area (and not aligned up to
> > > tls_align or MIN_TLS_ALIGN).
> > 
> > I guess this saves some wasted memory?
> > 
> > > the tls_offset computation was wrong
> > > when multiple modules were loaded with static tls and in some the
> > > tls segment p_memsz%p_align != 0.
> > 
> > I don't understand this part. Are you saying we're currently
> > misaligning TLS for some libraries now? 
> 
> any time there are 'alignment gaps' in the static tls area
> for shared libs, since tls_offset += dso.tls.size is the
> only update to tls_offset which can be misaligned, it is
> not updated when dso.tls.offset is aligned up.
> 
> the only exception is tls in the application: tls_offset
> is aligned up after the app tls. (which my introduce
> bigger gap than necessary)
> 
> i can make the tls_offset fix a separate change from
> the p_vaddr%p_align!=0 fix.

I'd like that, both because it was non-obvious to me that this was
another major change, and because it fixes a separate, major
user-facing bug (this is what was breaking rust, and could break lots
of other things) rather than the new issue with lld's Bionic hack.

> 
> > > ---
> > >  ldso/dynlink.c       | 13 ++++++-------
> > >  src/env/__init_tls.c |  3 ++-
> > >  2 files changed, 8 insertions(+), 8 deletions(-)
> > > 
> > > diff --git a/ldso/dynlink.c b/ldso/dynlink.c
> > > index 42a5470d..6dc39483 100644
> > > --- a/ldso/dynlink.c
> > > +++ b/ldso/dynlink.c
> > > @@ -1126,9 +1126,9 @@ static struct dso *load_library(const char *name, struct dso *needed_by)
> > >  		p->tls_id = ++tls_cnt;
> > >  		tls_align = MAXP2(tls_align, p->tls.align);
> > >  #ifdef TLS_ABOVE_TP
> > > -		p->tls.offset = tls_offset + ( (tls_align-1) &
> > > -			-(tls_offset + (uintptr_t)p->tls.image) );
> > > -		tls_offset += p->tls.size;
> > > +		p->tls.offset = tls_offset + ( (p->tls.align-1) &
> > > +			(-tls_offset + (uintptr_t)p->tls.image) );
> > > +		tls_offset = p->tls.offset + p->tls.size;
> > 
> > Is there a motivation for the seemingly independent change from use of
> > tls_align to use of p->tls.align here?
> 
> to 'satisfy the formula' which uses p_align, using the max
> alignment seemed unnecessary overalignment.
> 
> if all modules follow the same formula and static linking
> too then you only have to prove that one thing to work.

OK, this sounds good. After reading it less tired it makes sense to me
too. Thanks for the explanation.

Rich

Powered by blists - more mailing lists

Confused about mailing lists and their use? Read about mailing lists on Wikipedia and check out these guidelines on proper formatting of your messages.