Dave Burt <dave / burt.id.au> wrote:

> ry the code below, translated from 
> http://stuffofinterest.com/misc/utf8-about.html
> 
> There may be a potential problem matching over character boundaries, but I
> think UTF-8's unique starting bytes avoid the issue. So this should work.
> For long strings, it could be slow. If I wanted speed, I'd probably do the
> same thing in C and make it an extension.

thanks a lot this works great even with ligatures, i don't need speed
because i'll use that only for file names...
-- 
une bue