![]() |
|
Message-ID: <d3b94e19-7754-48f0-60f7-28ae19009d3f@mirbsd.de> Date: Thu, 19 Jun 2025 00:42:50 +0200 (CEST) From: Thorsten Glaser <tg@...bsd.de> To: musl@...ts.openwall.com cc: Pablo Correa Gomez <pabloyoyoista@...tmarketos.org> Subject: Re: Planned locale work and community thoughts On Wed, 18 Jun 2025, Rich Felker wrote: >Theoretically it's possible the textual grep missed things if there is >inconsistent json formatting anywhere, so if anyone familiar with jq >wants to conduct a search using it instead to confirm, go ahead. I My jq-foo is not very good, but I managed this: tg@...p:/tmp/u/cldr-numbers-full/main $ cat */numbers.json | jq 'paths(.decimal?|scalars) as $p | [">>" + ($p | join(".")), getpath($p).decimal]' | sed 's/">>/>>/' | grep -e '^ "[^.,]"' -e '^ ".[^"]' | uniq "٫" So yes, U+066B is the only other one, and no multi-char ones. tg@...p:/tmp/u/cldr-numbers-full/main $ cat */numbers.json | jq 'paths(.decimal?|scalars) as $p | [">>" + ($p | join(".")), getpath($p).decimal]' | sed 's/">>/>>/' | grep -B 1 -e '^ "[^.,]"' -e '^ ".[^"]' … shows all the occurrences, but a quick filter shows that we have both symbols-numberSystem-arabext and symbols-numberSystem-arab but assuming both are out of scope… tg@...p:/tmp/u/cldr-numbers-full/main $ cat */numbers.json | jq 'paths(.decimal?|scalars) as $p | [">>" + ($p | join(".")), getpath($p).decimal]' | sed 's/">>/>>/' | grep -B 1 -e '^ "[^.,]"' -e '^ ".[^"]' | fgrep '>>' | fgrep -v -e '.symbols-numberSystem-arabext"' -e '.symbols-numberSystem-arab"' >>main.bgn-AE.numbers.symbols-numberSystem-latn", >>main.bgn-AF.numbers.symbols-numberSystem-latn", >>main.bgn-IR.numbers.symbols-numberSystem-latn", >>main.bgn-OM.numbers.symbols-numberSystem-latn", >>main.bgn.numbers.symbols-numberSystem-latn", … leaves us with this; bgn/numbers.json examplary: { "main": { "bgn": { "numbers": { "symbols-numberSystem-arabext": { "decimal": "٫", "group": "٬", "list": "؛", … }, "symbols-numberSystem-latn": { "decimal": "٫", "group": "،", "list": ";", … So, if the bgn locales are ever going to be relevant… unsure what that exactly is, but my acronyms database says… [ISO 639-3] Western Balochi (cf. bal) … which seems to fit. bye, //mirabilos -- <ch> you introduced a merge commit │<mika> % g rebase -i HEAD^^ <mika> sorry, no idea and rebasing just fscked │<mika> Segmentation <ch> should have cloned into a clean repo │ fault (core dumped) <ch> if I rebase that now, it's really ugh │<mika:#grml> wuahhhhhh
Powered by blists - more mailing lists
Confused about mailing lists and their use? Read about mailing lists on Wikipedia and check out these guidelines on proper formatting of your messages.