Ben bulabilirim sadece yararlı bilgiler hangi devletler, this page of the manual dan:
A "word" character is any letter or
digit or the underscore character,
that is, any character which can be
part of a Perl "word". The definition
of letters and digits is controlled by
PCRE's character tables, and may vary
if locale-specific matching is taking
place. For example, in the "fr"
(French) locale, some character codes
greater than 128 are used for accented
letters, and these are matched by \w.
Yine de, istediğiniz gibi çalışıyor bahse olmaz ...
Ama, emin olmak için:
- belki kullanan unicode matching daha iyi olurdu
- Muhtemelen emin olmak için denemek gerekecek ...
Unicode hakkında, manuel bu diyor:
Matching characters by Unicode
property is not fast, because PCRE has
to search a structure that contains
data for over fifteen thousand
characters. That is why the
traditional escape sequences such as
\d and \w do not use Unicode
properties in PCRE.
Yani, daha güvenli bir çözüm olabilir ... bu konuda meraklı, ben ^ ^ eklemek gerekir