Substring function incorrect when searching for '@.'
To reproduce:
create a table with a data type of varchar (50) and name it email
insert into this table the following values:
test1@anyemail.com
Execute the following statement:
SELECT CASE WHEN count(substring(email FROM '@.')) > 0 THEN count(substring(email FROM '@.')) ELSE 0 END, email FROM your_schema.your_table GROUP BY email;
Result with be equal to 1 / True. It should be 0 / False.
If you execute the above but replace '@.' with '@a' it will also return 1 / True is correct.
The only time it fails for me is if the @ is immediately followed by a period.
7.3.4 using psql via pgadminIII under cygwin.
Mike
"Mike G." <mike@thegodshalls.com> writes:
create a table with a data type of varchar (50) and name it email
insert into this table the following values:
test1@anyemail.com
Execute the following statement:
SELECT CASE WHEN count(substring(email FROM '@.')) > 0 THEN count(substring(email FROM '@.')) ELSE 0 END, email FROM your_schema.your_table GROUP BY email;
Result with be equal to 1 / True. It should be 0 / False.
This is not a bug; it's a POSIX regular expression match, and it's
behaving exactly as it should ('.' matches any character).
The particular syntax substring(char-expression FROM char-expression)
is not defined by SQL99 --- their regular-expression construct requires
a third parameter (ESCAPE something). We have chosen to interpret it
as a POSIX regular-expression match. See
http://www.postgresql.org/docs/7.3/static/functions-matching.html
regards, tom lane