Follow @Openwall on Twitter for new release announcements and other news
[<prev] [next>] [<thread-prev] [day] [month] [year] [list]
Date: Sun, 6 Jan 2013 13:10:02 +0100
From: magnum <>
Subject: Re: Markov UTF-8 magic

On 6 Jan, 2013, at 11:32 , Frank Dittrich <> wrote:
> Creating a really good UTF-8 validity checker is even somewhat more
> complicated, since you have to exclude illegal overlong sequences as
> well as invalid Unicode code points.
> See the discussion here (just one example):
> BTW: Here's a perl expression which checks for valid UTF-8, just in case
> we'll need one:
> May be we should google for a well-tested free C implementation which we
> can use.

I'm pretty sure the original lib I got our Unicode support from had a validity checker, I'll have a look at that. It's pretty trivial but if we try to invent the wheel we'll probably end up overlooking something.


Powered by blists - more mailing lists

Confused about mailing lists and their use? Read about mailing lists on Wikipedia and check out these guidelines on proper formatting of your messages.