|
|
Message-ID: <4798297.Es0HkHt82z@nimes>
Date: Tue, 14 Nov 2023 14:22:46 +0100
From: Bruno Haible <bruno@...sp.org>
To: musl@...ts.openwall.com
Subject: *printf %lc of L'\0'
Hi,
On 2023-03-21 I noticed a bug with %lc in most libcs:
<https://lists.gnu.org/archive/html/bug-gnulib/2023-03/msg00080.html>.
On 2023-03-28 Eric Blake opened a defect with POSIX, with the intent that
both ISO C and POSIX make the four *printf cases consistent:
<https://austingroupbugs.net/view.php?id=1647>
This issue was then submitted in the ISO C 23 ballot as GB-141,
and in the meeting from 2023-06-20 to 2023-06-23 it was decided upon:
<https://www.open-std.org/JTC1/sc22/wg14/www/docs/n3167.pdf>
page 23, 24. The decision ("option 1") is detailed in
<https://www.open-std.org/JTC1/sc22/wg14/www/docs/n3148.doc>:
"Option 1 (require a NUL) - change the text to:
If an l length modifier is present, the wint_t argument is converted
as if by a call to the wcrtomb function with a pointer to storage of
at least MB_CUR_MAX bytes, the wint_t argument converted to wchar_t,
and an initial shift state."
So, ISO C changed, and POSIX will follow suit.
The bug in most libcs is thus no longer a bug.
musl libc, which had it correct, now has a bug.
Test case:
===============================================================================
#include <stdio.h>
#include <string.h>
#include <wchar.h>
int main ()
{
{
char buf[12] = { 0xDD, 0xDD, 0xDD, 0xDD, 0xDD, 0xDD, 0xDD, 0xDD, 0xDD, 0xDD, 0xDD, 0xDD };
wchar_t two_nuls[2] = { 0, 0 };
int ret = snprintf (buf, 12, "a%lsz", two_nuls);
printf ("ret = %d, buf[0] = 0x%x, buf[1] = 0x%x, buf[2] = 0x%x, buf[3] = 0x%x\n",
ret,
(unsigned char) buf[0], (unsigned char) buf[1],
(unsigned char) buf[2], (unsigned char) buf[3]);
}
{
char buf[12] = { 0xDD, 0xDD, 0xDD, 0xDD, 0xDD, 0xDD, 0xDD, 0xDD, 0xDD, 0xDD, 0xDD, 0xDD };
int ret = snprintf (buf, 12, "a%lcz", 0);
printf ("ret = %d, buf[0] = 0x%x, buf[1] = 0x%x, buf[2] = 0x%x, buf[3] = 0x%x\n",
ret,
(unsigned char) buf[0], (unsigned char) buf[1],
(unsigned char) buf[2], (unsigned char) buf[3]);
}
return 0;
}
/*
glibc, *BSD, macOS, AIX, Solaris - all correct now:
ret = 2, buf[0] = 0x61, buf[1] = 0x7a, buf[2] = 0x0, buf[3] = 0xdd
ret = 3, buf[0] = 0x61, buf[1] = 0x0, buf[2] = 0x7a, buf[3] = 0x0
musl libc - now incorrect:
ret = 2, buf[0] = 0x61, buf[1] = 0x7a, buf[2] = 0x0, buf[3] = 0xdd
ret = 2, buf[0] = 0x61, buf[1] = 0x7a, buf[2] = 0x0, buf[3] = 0xdd
*/
===============================================================================
Best regards,
Bruno
Powered by blists - more mailing lists
Confused about mailing lists and their use? Read about mailing lists on Wikipedia and check out these guidelines on proper formatting of your messages.