BUG #3525: Lithuanian characters doesn't work in queries with regular expressions
The following bug has been logged online:
Bug reference: 3525
Logged by: Rolandas Rudomanskis
Email address: rolandasr@gmail.com
PostgreSQL version: 8.2.3
Operating system: Debian Etch
Description: Lithuanian characters doesn't work in queries with
regular expressions
Details:
The database is set to unicode. Debian locale is utf-8.
All lithuanian characters characters, except ė, ų, namely,
ą, č, ę, į, š, ū, ž in non-case sensitive queries specifying word
beginning, are ignored. SQL example:
select * from table1 where title ~* '\\m" + mytext + "';"
"Rolandas Rudomanskis" <rolandasr@gmail.com> writes:
Description: Lithuanian characters doesn't work in queries with
regular expressions
The database is set to unicode. Debian locale is utf-8.
Yeah, the regex code doesn't handle localization issues (such as
case-insensitive matching) correctly for multibyte characters.
Can you use a single-byte encoding? Alternatively, I believe
case-insensitive LIKE matching works for this.
regards, tom lane