--oLBj+sq0vYjzfsbl
Content-Type: text/plain; charset=utf-8
Content-Disposition: inline
Content-Transfer-Encoding: quoted-printable

Andreas S.:

> Apparently in 1.9 \p{Word} can be used instead of \w to match unicode
> characters; however I did not find any documentation for this ("word"
> it's not a unicode character category).

You can also use the (documneted) \p{L} property:

chastell@devielle:~$ ruby -ve "p '=E2=84=89=C3=BC=C3=BC=C3=BC' =3D~ /\p{L}/"
ruby 1.9.2p0 (2010-08-18 revision 29036) [x86_64-linux]
1

BTW: I find http://www.regular-expressions.info/unicode.html#prop
most useful.

=E2=80=94 Piotr Szotkowski
--=20
I should like to find the person who decided that since
=E2=80=98bookmarks=E2=80=99 and =E2=80=98history=E2=80=99 were both lists o=
f URLs they
ought to be integrated in a single database. I should like
to shake him warmly by the throat until his head comes off.
             [Roger Burton West on Firefox, hates-software]




--oLBj+sq0vYjzfsbl
Content-Type: application/pgp-signature; name="signature.asc"
Content-Description: Digital signature
Content-Disposition: inline

-----BEGIN PGP SIGNATURE-----
Version: GnuPG v1.4.10 (GNU/Linux)

iEYEARECAAYFAkynOdcACgkQi/mCfdEo8UqcggCffw4ucGoC7EBwGv/GpdrKOmlp
ZCsAoLfZVJIOWQ7+62IzSZzrM7WAzmpl
=bqze
-----END PGP SIGNATURE-----

--oLBj+sq0vYjzfsbl--