spelling errors in query terms

Started by Joel Rodriguesover 22 years ago6 messagesgeneral
Jump to latest
#1Joel Rodrigues
joelrodrigues@ifrance.com

Hi,

I seem to recall once coming across & using functionality in PostgreSQL
that allowed for some flexibility in the spelling of a query term. For
example, if one meant to look for 'Honda', but typed in 'Zonda'. There
was even a 'looseness' factor of sorts available. I've spent a lot of
time trying to find it in the docs and various articles & tutorials on
my hard drive, even an hour on Google in vain. It was not
contrib/fuzzystrmatch.

Anyone know what I'm talking about ?

Cheers,
Joel

_____________________________________________________________________
Envie de discuter en "live" avec vos amis ? T���l���charger MSN Messenger
http://www.ifrance.com/_reloc/m la 1���re messagerie instantan���e de France

#2Craig O'Shannessy
craig@ucw.com.au
In reply to: Joel Rodrigues (#1)
Re: spelling errors in query terms

Hi Joel,

If your using java, and full text, fuzzy searching is an important part of
your app, you might want to check out Lucene. I evaluated it for a
project, and it was very good (the project was a fuzzy search of huge
pages of text).

http://jakarta.apache.org/lucene/docs/index.html

On Mon, 8 Dec 2003, Joel Rodrigues wrote:

Show quoted text

Hi,

I seem to recall once coming across & using functionality in PostgreSQL
that allowed for some flexibility in the spelling of a query term. For
example, if one meant to look for 'Honda', but typed in 'Zonda'. There
was even a 'looseness' factor of sorts available. I've spent a lot of
time trying to find it in the docs and various articles & tutorials on
my hard drive, even an hour on Google in vain. It was not
contrib/fuzzystrmatch.

Anyone know what I'm talking about ?

Cheers,
Joel

_____________________________________________________________________
Envie de discuter en "live" avec vos amis ? T�l�charger MSN Messenger
http://www.ifrance.com/_reloc/m la 1�re messagerie instantan�e de France

---------------------------(end of broadcast)---------------------------
TIP 3: if posting/reading through Usenet, please send an appropriate
subscribe-nomail command to majordomo@postgresql.org so that your
message can get through to the mailing list cleanly

#3Oleg Bartunov
oleg@sai.msu.su
In reply to: Joel Rodrigues (#1)
Re: spelling errors in query terms

On Mon, 8 Dec 2003, Joel Rodrigues wrote:

Hi,

I seem to recall once coming across & using functionality in PostgreSQL
that allowed for some flexibility in the spelling of a query term. For
example, if one meant to look for 'Honda', but typed in 'Zonda'. There
was even a 'looseness' factor of sorts available. I've spent a lot of
time trying to find it in the docs and various articles & tutorials on
my hard drive, even an hour on Google in vain. It was not
contrib/fuzzystrmatch.

You might try http://www.sai.msu.su/~megera/postgres/gist/trgm/
which uses trigram.

Anyone know what I'm talking about ?

Cheers,
Joel

_____________________________________________________________________
Envie de discuter en "live" avec vos amis ? TО©╫lО©╫charger MSN Messenger
http://www.ifrance.com/_reloc/m la 1О©╫re messagerie instantanО©╫e de France

---------------------------(end of broadcast)---------------------------
TIP 3: if posting/reading through Usenet, please send an appropriate
subscribe-nomail command to majordomo@postgresql.org so that your
message can get through to the mailing list cleanly

Regards,
Oleg
_____________________________________________________________
Oleg Bartunov, sci.researcher, hostmaster of AstroNet,
Sternberg Astronomical Institute, Moscow University (Russia)
Internet: oleg@sai.msu.su, http://www.sai.msu.su/~megera/
phone: +007(095)939-16-83, +007(095)939-23-83

#4Joel Rodrigues
joelrodrigues@ifrance.com
In reply to: Oleg Bartunov (#3)
Re: spelling errors in query terms

Thanks Oleg, I'll take a look at it. Too bad there is no documentation.

Also, there's something wrong with 'trgm.tgz'. This is what happens if I
try to extract it's contents :

gzip: stdin is encrypted -- get newer version of gzip
tar: End of archive volume 1 reached
tar: Sorry, unable to determine archive format.

Cheers ! - Joel

On Monday, December 8, 2003, at 05:39 , Oleg Bartunov wrote:

On Mon, 8 Dec 2003, Joel Rodrigues wrote:

Hi,

I seem to recall once coming across & using functionality in PostgreSQL
that allowed for some flexibility in the spelling of a query term. For
example, if one meant to look for 'Honda', but typed in 'Zonda'. There
was even a 'looseness' factor of sorts available. I've spent a lot of
time trying to find it in the docs and various articles & tutorials on
my hard drive, even an hour on Google in vain. It was not
contrib/fuzzystrmatch.

You might try http://www.sai.msu.su/~megera/postgres/gist/trgm/
which uses trigram.

_____________________________________________________________________
Envie de discuter en "live" avec vos amis ? T���l���charger MSN Messenger
http://www.ifrance.com/_reloc/m la 1���re messagerie instantan���e de France

#5Joel Rodrigues
joelrodrigues@ifrance.com
In reply to: Oleg Bartunov (#3)
Re: spelling errors in query terms

Thanks Oleg, I'll take a look at it. Too bad there is no documentation.

Also, there's something wrong with 'trgm.tgz'. This is what happens if I
try to extract it's contents :

gzip: stdin is encrypted -- get newer version of gzip
tar: End of archive volume 1 reached
tar: Sorry, unable to determine archive format.

Cheers ! - Joel

On Monday, December 8, 2003, at 05:39 , Oleg Bartunov wrote:

On Mon, 8 Dec 2003, Joel Rodrigues wrote:

Hi,

I seem to recall once coming across & using functionality in PostgreSQL
that allowed for some flexibility in the spelling of a query term. For
example, if one meant to look for 'Honda', but typed in 'Zonda'. There
was even a 'looseness' factor of sorts available. I've spent a lot of
time trying to find it in the docs and various articles & tutorials on
my hard drive, even an hour on Google in vain. It was not
contrib/fuzzystrmatch.

You might try http://www.sai.msu.su/~megera/postgres/gist/trgm/
which uses trigram.

_____________________________________________________________________
Envie de discuter en "live" avec vos amis ? T���l���charger MSN Messenger
http://www.ifrance.com/_reloc/m la 1���re messagerie instantan���e de France

#6Joel Rodrigues
joelrodrigues@ifrance.com
In reply to: Craig O'Shannessy (#2)
Re: spelling errors in query terms

Thanks for the tip. However I use Python with psycopg to interface with
PostgreSQL, so unfortunately it's not an option.

It's really strange, I could swear I used a function within PostgreSQL
that could handle small spelling errors in the query term.

Cheers,
Joel

On Monday, December 8, 2003, at 11:34 , Craig O'Shannessy wrote:

Hi Joel,

If your using java, and full text, fuzzy searching is an important part
of
your app, you might want to check out Lucene. I evaluated it for a
project, and it was very good (the project was a fuzzy search of huge
pages of text).

http://jakarta.apache.org/lucene/docs/index.html

_____________________________________________________________________
Envie de discuter en "live" avec vos amis ? T���l���charger MSN Messenger
http://www.ifrance.com/_reloc/m la 1���re messagerie instantan���e de France