BUG #3525: Lithuanian characters doesn't work in queries with regular expressions

Started by Rolandas Rudomanskisover 18 years ago2 messagesbugs
Jump to latest
#1Rolandas Rudomanskis
rolandasr@gmail.com

The following bug has been logged online:

Bug reference: 3525
Logged by: Rolandas Rudomanskis
Email address: rolandasr@gmail.com
PostgreSQL version: 8.2.3
Operating system: Debian Etch
Description: Lithuanian characters doesn't work in queries with
regular expressions
Details:

The database is set to unicode. Debian locale is utf-8.

All lithuanian characters characters, except ė, ų, namely,
ą, č, ę, į, š, ū, ž in non-case sensitive queries specifying word
beginning, are ignored. SQL example:

select * from table1 where title ~* '\\m" + mytext + "';"

#2Tom Lane
tgl@sss.pgh.pa.us
In reply to: Rolandas Rudomanskis (#1)
Re: BUG #3525: Lithuanian characters doesn't work in queries with regular expressions

"Rolandas Rudomanskis" <rolandasr@gmail.com> writes:

Description: Lithuanian characters doesn't work in queries with
regular expressions

The database is set to unicode. Debian locale is utf-8.

Yeah, the regex code doesn't handle localization issues (such as
case-insensitive matching) correctly for multibyte characters.
Can you use a single-byte encoding? Alternatively, I believe
case-insensitive LIKE matching works for this.

regards, tom lane