tsearch2 & unicode

Started by Michal Hlavacalmost 21 years ago3 messagesgeneral
Jump to latest
#1Michal Hlavac
hlavki@hlavki.sk

hello,

I have suse 9.3 with installed postgresql 8.0.3 from rpm.
All my databases are unicode with locale sk_SK.UTF-8.
I installed slovak ispell dictionary and tsearch2 is working very well.

m13=# SELECT lexize('sk_ispell','Ivana');
lexize
--------------
{ivana,ivan}
(1 row)

On the other side I have FreeBSD also with postgresql 8.0.3 and with
unicode database. I also installed slovak ispell dictionary and tsearch2
is working very well.
but result:
hlk=# SELECT lexize('sk_ispell','Ivana');
ERROR: Regex error in 'ia[t�ina$': brackets [] not balanced

where is the problem??? thanks, miso

#2Oleg Bartunov
oleg@sai.msu.su
In reply to: Michal Hlavac (#1)
Re: tsearch2 & unicode

This message is in MIME format. The first part should be readable text,
while the remaining parts are likely unreadable without MIME-aware tools.

---559023410-637165587-1126029515=:14250
Content-Type: TEXT/PLAIN; charset=koi8-r; format=flowed
Content-Transfer-Encoding: 8BIT

Michal,

tsearch2 doesn't supports UTF-8. It might works though.
full support of UTF-8 and other features are planned
http://www.sai.msu.su/~megera/oddmuse/index.cgi/todo

Oleg
On Tue, 6 Sep 2005, Michal Hlavac wrote:

hello,

I have suse 9.3 with installed postgresql 8.0.3 from rpm.
All my databases are unicode with locale sk_SK.UTF-8.
I installed slovak ispell dictionary and tsearch2 is working very well.

m13=# SELECT lexize('sk_ispell','Ivana');
lexize
--------------
{ivana,ivan}
(1 row)

On the other side I have FreeBSD also with postgresql 8.0.3 and with
unicode database. I also installed slovak ispell dictionary and tsearch2
is working very well.
but result:
hlk=# SELECT lexize('sk_ispell','Ivana');
ERROR: Regex error in 'ia[tО©╫О©╫ina$': brackets [] not balanced

where is the problem??? thanks, miso

---------------------------(end of broadcast)---------------------------
TIP 5: don't forget to increase your free space map settings

Regards,
Oleg
_____________________________________________________________
Oleg Bartunov, sci.researcher, hostmaster of AstroNet,
Sternberg Astronomical Institute, Moscow University (Russia)
Internet: oleg@sai.msu.su, http://www.sai.msu.su/~megera/
phone: +007(095)939-16-83, +007(095)939-23-83
---559023410-637165587-1126029515=:14250--

#3Michal Hlavac
hlavki@hlavki.sk
In reply to: Oleg Bartunov (#2)
Re: tsearch2 & unicode

Oleg Bartunov wrote:

Michal,

tsearch2 doesn't supports UTF-8. It might works though.
full support of UTF-8 and other features are planned
http://www.sai.msu.su/~megera/oddmuse/index.cgi/todo

hello Oleg,

thanks for reply... is there any time plan for new release of tsearch2?

miso