tsearch2 and parsing host strings

Started by Laimonas Simutisover 18 years ago2 messagesgeneral
Jump to latest
#1Laimonas Simutis
laimis@gmail.com

A question related to tsearch2 functionality in postgres:

When I run the following query:

select to_tsvector('default', 'website.com')

I get "'website.com':1".

What I need to get back is 'website':1 instead. I can see that the parser
correctly determines term website.com as a host token, which then are routed
(on my, and I believe default, configuration) to 'default' dictionary
(en_stem for me). Has anyone written a special dictionary for cases just
like the above, so that I could change the pg_ts_cfgmap to map it to that
special dictionary?

Is there any way I can accomplish this with tsearch2?

Thanks,

Laimis

#2Oleg Bartunov
oleg@sai.msu.su
In reply to: Laimonas Simutis (#1)
Re: tsearch2 and parsing host strings

On Tue, 11 Sep 2007, Laimonas Simutis wrote:

A question related to tsearch2 functionality in postgres:

When I run the following query:

select to_tsvector('default', 'website.com')

I get "'website.com':1".

What I need to get back is 'website':1 instead. I can see that the parser
correctly determines term website.com as a host token, which then are routed
(on my, and I believe default, configuration) to 'default' dictionary
(en_stem for me). Has anyone written a special dictionary for cases just
like the above, so that I could change the pg_ts_cfgmap to map it to that
special dictionary?

Is there any way I can accomplish this with tsearch2?

Check my reply about pg_regex dictionary. Simple regex will save you.

Thanks,

Laimis

Regards,
Oleg
_____________________________________________________________
Oleg Bartunov, Research Scientist, Head of AstroNet (www.astronet.ru),
Sternberg Astronomical Institute, Moscow University, Russia
Internet: oleg@sai.msu.su, http://www.sai.msu.su/~megera/
phone: +007(495)939-16-83, +007(495)939-23-83