Full Text Search

Started by PG Bug reporting formover 2 years ago3 messagesdocs
Jump to latest
#1PG Bug reporting form
noreply@postgresql.org

The following documentation comment has been logged on the website:

Page: https://www.postgresql.org/docs/14/textsearch-limitations.html
Description:

Hello,

In the FTS/Limitations part of the documentation, it says :

"Another example — the PostgreSQL mailing list archives contained 910,989
unique words with 57,491,343 lexemes in 461,020 messages."

How could the number of lexemes be greater than unique words ?

Thank you
Julien

#2David G. Johnston
david.g.johnston@gmail.com
In reply to: PG Bug reporting form (#1)
Re: Full Text Search

On Sun, Oct 29, 2023, 13:58 PG Doc comments form <noreply@postgresql.org>
wrote:

The following documentation comment has been logged on the website:

Page: https://www.postgresql.org/docs/14/textsearch-limitations.html
Description:

Hello,

In the FTS/Limitations part of the documentation, it says :

"Another example — the PostgreSQL mailing list archives contained 910,989
unique words with 57,491,343 lexemes in 461,020 messages."

How could the number of lexemes be greater than unique words ?

https://www.postgresql.org/docs/14/textsearch-parsers.html

Note the part with the hyphenated word example.

David J.

Show quoted text
#3Julien Champalbert
julien.champalbert@gmail.com
In reply to: David G. Johnston (#2)
Re: Full Text Search

Thank you David for your response, it makes sense now.

Julien

Show quoted text

Le 29 oct. 2023 à 22:02, David G. Johnston <david.g.johnston@gmail.com> a écrit :

On Sun, Oct 29, 2023, 13:58 PG Doc comments form <noreply@postgresql.org <mailto:noreply@postgresql.org>> wrote:
The following documentation comment has been logged on the website:

Page: https://www.postgresql.org/docs/14/textsearch-limitations.html <https://www.postgresql.org/docs/14/textsearch-limitations.html&gt;
Description:

Hello,

In the FTS/Limitations part of the documentation, it says :

"Another example — the PostgreSQL mailing list archives contained 910,989
unique words with 57,491,343 lexemes in 461,020 messages."

How could the number of lexemes be greater than unique words ?

https://www.postgresql.org/docs/14/textsearch-parsers.html <https://www.postgresql.org/docs/14/textsearch-parsers.html&gt;

Note the part with the hyphenated word example.

David J.