Follow @Openwall on Twitter for new release announcements and other news
[<prev] [next>] [<thread-prev] [thread-next>] [day] [month] [year] [list]
Message-Id: <ECF3C79A-6A8F-4DD0-820D-6FD5D056C7B4@aevum.de>
Date: Thu, 29 May 2025 11:45:01 +0200
From: Nick Wellnhofer <wellnhofer@...um.de>
To: musl@...ts.openwall.com
Subject: Re: Collation, IDN, and Unicode normalization

On May 29, 2025, at 04:37, Rich Felker <dalias@...c.org> wrote:
> Top-level table (indexed by codepoint>>8) to select a table: 1 byte
> per entry, for 512 bytes.
> 
> Second-level tables (indexed by codepoint&255):

You could also try different bit shifts that might yield smaller tables. Another option to compress the data further is to use third-level tables. I have some old code somewhere that brute-forces all combinations of shift values to find the smallest tables.

Nick

Powered by blists - more mailing lists

Confused about mailing lists and their use? Read about mailing lists on Wikipedia and check out these guidelines on proper formatting of your messages.