Follow @Openwall on Twitter for new release announcements and other news
[<prev] [next>] [<thread-prev] [thread-next>] [day] [month] [year] [list]
Date: Tue, 3 Jan 2017 18:53:19 -0500
From: Rich Felker <dalias@...c.org>
To: musl@...ts.openwall.com
Subject: Re: libstdc++ namespace pollution

On Tue, Jan 03, 2017 at 06:33:21PM -0500, Rich Felker wrote:
> On Tue, Jan 03, 2017 at 06:17:28PM -0500, Rich Felker wrote:
> > On Tue, Jan 03, 2017 at 10:52:19PM +0000, Justin Cormack wrote:
> > > On 3 January 2017 at 21:35, Rich Felker <dalias@...c.org> wrote:
> > > > On Tue, Jan 03, 2017 at 09:16:29PM +0000, Justin Cormack wrote:
> > > >> On 3 January 2017 at 18:29, Rich Felker <dalias@...c.org> wrote:
> > > >> > On Tue, Jan 03, 2017 at 05:44:47PM +0000, Justin Cormack wrote:
> > > >> >> I have been trying to build a C++ program recently, and came across
> > > >> >> the issue that
> > > >> >>
> > > >> >> 1. libstdc++ always defines _GNU_SOURCE see
> > > >> >> https://gcc.gnu.org/onlinedocs/libstdc++/faq.html#faq.predefined
> > > >> >> 2. Musl defines pretty much everything once _GNU_SOURCE is defined
> > > >> >
> > > >> > So does glibc. :)
> > > >>
> > > >> This particular issue only happens with Musl, it includes more...
> > > >
> > > > That might be a bug/unwanted behavior on musl's side then. Could you
> > > > help me check? I'd be happy to remove namespace-polluting cruft that's
> > > > not actually needed to meet what applications can reasonably expect
> > > > from _GNU_SOURCE.
> > > >
> > > 
> > > This minimal test case compiles with c++ on Debian but not on Alpine:
> > > 
> > > #include <signal.h>
> > > 
> > > class ArgumentParser_x64 {
> > >   enum Register {
> > >     REG_A,
> > >     REG_B,
> > >     REG_C,
> > >     REG_D,
> > >     REG_SI,
> > >     REG_DI,
> > >     REG_BP,
> > >     REG_SP,
> > >     REG_8,
> > >     REG_9,
> > >     REG_10,
> > >     REG_11,
> > >     REG_12,
> > >     REG_13,
> > >     REG_14,
> > >     REG_15,
> > >     REG_RIP,
> > >   };
> > > };
> > > 
> > > main() {}
> > 
> > I see. It's a bit of luck that it happens to work on glibc, I think --
> > they define the REG_* identifiers as enum constants and then #define
> > them to themselves in order to satisfy programs which are checking for
> > their presence with #ifdef. So while the above code has macros
> > clashing with the identifier names it wants to use, they end up being
> > benign because they're defined to themselves.
> > 
> > In general I don't do this (the enum approach) in musl because (1) I
> > don't like enums, and (2) it breaks things that want to use the macros
> > in preprocessor #if conditionals. However for macros like this that
> > aren't specified by any standard and which are fundamentally namespace
> > pollution, it seems like a better approach, so I'm not opposed to
> > switching. We should probably do the same on all affected archs if we
> > do.
> 
> After a quick glance, looks like this issue only affects x86[_64].
> I'll see if I can prepare a simple patch with a clean idiom we can
> repeat elsewhere if needed.

Does this sed script, run on bits/signal.h, work for you?

sed 's/#define  *\(REG_[A-Z_]\{1,\}\)  *\([0-9]\{1,\}\)/enum { \1 = \2 };\n#define \1 \1/'

I like this approach better than trying to pretty-format a big enum by
hand because it doesn't risk mistakes. If a single big enum would be
preferred, maybe we could do a similar sed for the body of the enum,
but I actually don't really like the GNU-style interleaved
enum-and-#define.

Rich

Powered by blists - more mailing lists

Confused about mailing lists and their use? Read about mailing lists on Wikipedia and check out these guidelines on proper formatting of your messages.