----- Original Message -----
From: "Aristarkh A Zagorodnikov" <xm / bolotov-team.ru>
To: "ruby-talk ML" <ruby-talk / ruby-lang.org>
Sent: Thursday, October 04, 2001 12:15 PM
Subject: [ruby-talk:22069] Re: writing UTF-8 strings


>
> > -----Original Message-----
> > From: matz / ev.netlab.jp [mailto:matz / ev.netlab.jp]On Behalf Of
> > Yukihiro Matsumoto
> > Sent: Thursday, October 04, 2001 2:05 PM
> > To: ruby-talk ML
> > Subject: [ruby-talk:22068] Re: writing UTF-8 strings
> >
> >
> > Hi,
> >
> > In message "[ruby-talk:22067] Re: writing UTF-8  strings"
> >     on 01/10/04, Tobias Reif <tobiasreif / pinkjuice.com> writes:
> >
> > |So what do I declare?:
> > |
> > |<?xml version="1.0" encoding="UTF-8" standalone="no" ?>
> >
> > I'm not good at XML, but I know you can call your ASCII file as UTF-8.
> >
> > |Is there a difference between an ASCII string and a UTF-8 string with
> > |only ascii region characters?
> >
> > No.
> >
> > |With Ruby, would use UTF-8 characters, serve ASCII, and declare
> > it's UTF-8?
> >
> > I'm not sure what you meant by "serve ASCII", but there's nothing
> > prevent you from using UTF-8 in your Ruby scripts, if you can edit
> > UTF-8 files.
> Tobias, I recommend you to visist http://www.czyborra.com/utf/#UTF-8
> This link describes UTF-8 very good, including pointing out that UTF-8 is
> 100% ASCII compatible if only ASCII chars are used.
> Also this page covers different Unicode-based formats too in a vary simple
> and understandable manner.
>
> P.S. Unicode rocks - everyone statrt using it immediately! ;))
> But the size of translation tables make shivers scatter all over m back
:))

In the unicode book ch 5.1 they recommend 'multistage tables' for reducing
size:

http://www.unicode.org/unicode/uni2book/ch05.pdf

Don't know if that helps.

:-)!

/He