Follow @Openwall on Twitter for new release announcements and other news
[<prev] [next>] [<thread-prev] [thread-next>] [day] [month] [year] [list]
Message-Id: <20250629123903.530387f8801e0772126be9f2@zhasha.com>
Date: Sun, 29 Jun 2025 12:39:03 +0200
From: Joakim Sindholt <opensource@...sha.com>
To: musl@...ts.openwall.com
Cc: Alexander Monakov <amonakov@...ras.ru>
Subject: Re: Paintcans for reverse iterating strings

On Sun, 29 Jun 2025 10:58:11 +0300 (MSK), Alexander Monakov <amonakov@...ras.ru> wrote:
> 
> On Sat, 28 Jun 2025, Rich Felker wrote:
> 
> > One thing we're going to need for LC_COLLATE in locales where
> > second-level weights are applied in reverse order (diacritic marks
> > later in the string weigh more than earlier ones) is the ability to
> > traverse (& live transform to NFD) the input string in reverse.
> 
> Apologies if I'm forgetting some essential context, but what is the
> encoding of the input string, is it not always UTF-8? Reverse iteration
> of valid UTF-8 is easy (unless you need something more specific than
> just reverse iteration).

As I understand the problem it's not a matter of iterating UTF-8 in
reverse but the normalized NFD codepoint stream it transforms into in
reverse.

Powered by blists - more mailing lists

Confused about mailing lists and their use? Read about mailing lists on Wikipedia and check out these guidelines on proper formatting of your messages.