Subject: Re: Unicode in Regexp followup
From: Sean Russell <ser germane-software.com>
Date: Mon, 25 Mar 2002 16:38:40 +0900
References: 36608366753669436695
Yukihiro Matsumoto wrote:
> By the definition of UTF-8, 0x80-0xEF at the first byte of multibyte
> sequence are invalid, so Ruby treats them as if they are single byte
> characters.
Makes sense. Thanks!
--- SER