tsearch2 parser configuration

Started by Worky Workersonover 19 years ago3 messagesgeneral

worky.workerson@gmail.com

over 19 years ago

Is it possible to configure the tsearch2 parser? I'd like a very dumb
parser that splits on everything that is not [a-zA-Z0-9.]. The
default parser seems to work well on my dataset except for the '/'
character ... it doesn't split mike/john into two lexemes. And ideas?

Thanks!

Oleg Bartunov

oleg@sai.msu.su

over 19 years ago

In reply to: Worky Workerson (#1)

Re: tsearch2 parser configuration

http://www.sai.msu.su/~megera/postgres/gist/tsearch/V2/docs/HOWTO-parser-tsearch2.html

On Tue, 6 Feb 2007, Worky Workerson wrote:

Is it possible to configure the tsearch2 parser? I'd like a very dumb
parser that splits on everything that is not [a-zA-Z0-9.]. The
default parser seems to work well on my dataset except for the '/'
character ... it doesn't split mike/john into two lexemes. And ideas?

Thanks!

---------------------------(end of broadcast)---------------------------
TIP 5: don't forget to increase your free space map settings

Regards,
Oleg
_____________________________________________________________
Oleg Bartunov, Research Scientist, Head of AstroNet (www.astronet.ru),
Sternberg Astronomical Institute, Moscow University, Russia
Internet: oleg@sai.msu.su, http://www.sai.msu.su/~megera/
phone: +007(495)939-16-83, +007(495)939-23-83

Hannes Dorbath

light@theendofthetunnel.de

over 19 years ago

In reply to: Worky Workerson (#1)

Re: tsearch2 parser configuration

On 06.02.2007 21:39, Worky Workerson wrote:

Is it possible to configure the tsearch2 parser? I'd like a very dumb
parser that splits on everything that is not [a-zA-Z0-9.]. The
default parser seems to work well on my dataset except for the '/'
character ... it doesn't split mike/john into two lexemes. And ideas?

Sure that's an ugly, but I'd just use:

SELECT to_tsvector(REPLACE('mike/john', '/', ' '));

and the same with to_tsquery();

--
Regards,
Hannes Dorbath