Follow @Openwall on Twitter for new release announcements and other news
[<prev] [next>] [<thread-prev] [thread-next>] [day] [month] [year] [list]
Date: Sun, 6 Jan 2013 11:32:07 +0100
From: Frank Dittrich <>
Subject: Re: Markov UTF-8 magic

Hi magnum,

I wasn't fully awake (not enough coffee) when I sent my previous mail.
I hope you can still parse most of it.

Creating a really good UTF-8 validity checker is even somewhat more
complicated, since you have to exclude illegal overlong sequences as
well as invalid Unicode code points.

See the discussion here (just one example):

BTW: Here's a perl expression which checks for valid UTF-8, just in case
we'll need one:

May be we should google for a well-tested free C implementation which we
can use.


Powered by blists - more mailing lists

Confused about mailing lists and their use? Read about mailing lists on Wikipedia and check out these guidelines on proper formatting of your messages.