Openwall GNU/*/Linux - a small security-enhanced Linux distro for servers
[<prev] [next>] [<thread-prev] [day] [month] [year] [list]
Date: Sun, 6 Jan 2013 13:10:02 +0100
From: magnum <>
Subject: Re: Markov UTF-8 magic

On 6 Jan, 2013, at 11:32 , Frank Dittrich <> wrote:
> Creating a really good UTF-8 validity checker is even somewhat more
> complicated, since you have to exclude illegal overlong sequences as
> well as invalid Unicode code points.
> See the discussion here (just one example):
> BTW: Here's a perl expression which checks for valid UTF-8, just in case
> we'll need one:
> May be we should google for a well-tested free C implementation which we
> can use.

I'm pretty sure the original lib I got our Unicode support from had a validity checker, I'll have a look at that. It's pretty trivial but if we try to invent the wheel we'll probably end up overlooking something.


Powered by blists - more mailing lists

Your e-mail address:

Powered by Openwall GNU/*/Linux - Powered by OpenVZ