Can't ri_KeysEqual() consider two nulls as equal?

Started by Tom Laneover 18 years ago7 messages
#1Tom Lane
tgl@sss.pgh.pa.us

A recent discussion led me to the idea that FK triggers are fired
unnecessarily during an UPDATE if the foreign-key column(s) contain
any NULLs, because ri_KeysEqual() treats two nulls as unequal,
and therefore we conclude the row has changed when it has not.
I claim that both ri_KeysEqual() and ri_OneKeyEqual() could consider
two nulls to be equal. Furthermore it seems like ri_AllKeysUnequal()
should do so too; the case can't arise at the moment because the sole
caller already knows that one of the key sets contains no nulls, but
if this weren't so, the optimization would be actively wrong if we
concluded that two nulls were unequal.

Comments?

Also, I am wondering to what extent the ri_KeysEqual() calls in
ri_triggers.c are redundant, given that commands/trigger.c now has
the smarts to not even queue the trigger when those cases apply.

regards, tom lane

#2Simon Riggs
simon@2ndquadrant.com
In reply to: Tom Lane (#1)
Re: Can't ri_KeysEqual() consider two nulls as equal?

On Tue, 2007-04-17 at 17:16 -0400, Tom Lane wrote:

A recent discussion led me to the idea that FK triggers are fired
unnecessarily during an UPDATE if the foreign-key column(s) contain
any NULLs, because ri_KeysEqual() treats two nulls as unequal,
and therefore we conclude the row has changed when it has not.

FK trigger *can be optimised away* is true. No need to have a discussion
about whether NULL == NULL, but the critical test is: if I overwrote it,
would you be able to tell? The answer is No, so away it goes.

I claim that both ri_KeysEqual() and ri_OneKeyEqual() could consider
two nulls to be equal. Furthermore it seems like ri_AllKeysUnequal()
should do so too; the case can't arise at the moment because the sole
caller already knows that one of the key sets contains no nulls, but
if this weren't so, the optimization would be actively wrong if we
concluded that two nulls were unequal.

Comments?

Also, I am wondering to what extent the ri_KeysEqual() calls in
ri_triggers.c are redundant, given that commands/trigger.c now has
the smarts to not even queue the trigger when those cases apply.

Would be good to document that behaviour in ri_triggers.c - I was
completely misled when I looked at the code a while back. Probably more
than one thing can go.

--
Simon Riggs
EnterpriseDB http://www.enterprisedb.com

#3Stephan Szabo
sszabo@megazone.bigpanda.com
In reply to: Tom Lane (#1)
Re: Can't ri_KeysEqual() consider two nulls as equal?

On Tue, 17 Apr 2007, Tom Lane wrote:

A recent discussion led me to the idea that FK triggers are fired
unnecessarily during an UPDATE if the foreign-key column(s) contain
any NULLs, because ri_KeysEqual() treats two nulls as unequal,
and therefore we conclude the row has changed when it has not.
I claim that both ri_KeysEqual() and ri_OneKeyEqual() could consider
two nulls to be equal.

For ri_KeysEqual, I think so, since we actually aren't testing equality as
much as difference between the rows that might invalidate the constraint.
And, it does seem like with the code in trigger.c that the other checks in
the _upd functions in ri_triggers.c are redundant, but I'm vaguely afraid
I've forgotten something.

For ri_OneKeyEqual, I think like ri_AllKeysUnequal we know that the old
row doesn't have NULLs in the places it's currently called (although I
don't think this is commented). It seems like it should stay consistent
with ri_KeysEqual and that not putting the foo = NULL or foo = DEFAULT
seems better for the current calling cases besides.

Furthermore it seems like ri_AllKeysUnequal() should do so too; the case
can't arise at the moment because the sole caller already knows that one
of the key sets contains no nulls, but if this weren't so, the
optimization would be actively wrong if we concluded that two nulls were
unequal.

Hmm, probably so, although at least this does appear to be commented at
the calling site to mention that it's depending on the fact that there are
no NULLs.

#4Richard Huxton
dev@archonet.com
In reply to: Simon Riggs (#2)
Re: Can't ri_KeysEqual() consider two nulls as equal?

Simon Riggs wrote:

On Tue, 2007-04-17 at 17:16 -0400, Tom Lane wrote:

A recent discussion led me to the idea that FK triggers are fired
unnecessarily during an UPDATE if the foreign-key column(s) contain
any NULLs, because ri_KeysEqual() treats two nulls as unequal,
and therefore we conclude the row has changed when it has not.

FK trigger *can be optimised away* is true. No need to have a discussion
about whether NULL == NULL, but the critical test is: if I overwrote it,
would you be able to tell? The answer is No, so away it goes.

The test should perhaps be named "unchanged" rather than "equal".

--
Richard Huxton
Archonet Ltd

#5Stephan Szabo
sszabo@megazone.bigpanda.com
In reply to: Tom Lane (#1)
Re: Can't ri_KeysEqual() consider two nulls as equal?

On Tue, 17 Apr 2007, Tom Lane wrote:

A recent discussion led me to the idea that FK triggers are fired
unnecessarily during an UPDATE if the foreign-key column(s) contain
any NULLs, because ri_KeysEqual() treats two nulls as unequal,
and therefore we conclude the row has changed when it has not.
I claim that both ri_KeysEqual() and ri_OneKeyEqual() could consider
two nulls to be equal. Furthermore it seems like ri_AllKeysUnequal()
should do so too; the case can't arise at the moment because the sole
caller already knows that one of the key sets contains no nulls, but
if this weren't so, the optimization would be actively wrong if we
concluded that two nulls were unequal.

Do you have any suggestions for alternate names? Keeping them using Equal
seems to be dangerous since people would likely expect it to act like
normal equality (with nulls being different).

#6David Fetter
david@fetter.org
In reply to: Stephan Szabo (#5)
Re: Can't ri_KeysEqual() consider two nulls as equal?

On Wed, Apr 18, 2007 at 07:51:48AM -0700, Stephan Szabo wrote:

On Tue, 17 Apr 2007, Tom Lane wrote:

A recent discussion led me to the idea that FK triggers are fired
unnecessarily during an UPDATE if the foreign-key column(s)
contain any NULLs, because ri_KeysEqual() treats two nulls as
unequal, and therefore we conclude the row has changed when it has
not. I claim that both ri_KeysEqual() and ri_OneKeyEqual() could
consider two nulls to be equal. Furthermore it seems like
ri_AllKeysUnequal() should do so too; the case can't arise at the
moment because the sole caller already knows that one of the key
sets contains no nulls, but if this weren't so, the optimization
would be actively wrong if we concluded that two nulls were
unequal.

Do you have any suggestions for alternate names? Keeping them using
Equal seems to be dangerous since people would likely expect it to
act like normal equality (with nulls being different).

How about NotDistinct as in SQL's IS NOT DISTINCT FROM ?

Cheers,
D
--
David Fetter <david@fetter.org> http://fetter.org/
phone: +1 415 235 3778 AIM: dfetter666
Skype: davidfetter

Remember to vote!
Consider donating to PostgreSQL: http://www.postgresql.org/about/donate

#7Tom Lane
tgl@sss.pgh.pa.us
In reply to: Stephan Szabo (#5)
Re: Can't ri_KeysEqual() consider two nulls as equal?

Stephan Szabo <sszabo@megazone.bigpanda.com> writes:

On Tue, 17 Apr 2007, Tom Lane wrote:

I claim that both ri_KeysEqual() and ri_OneKeyEqual() could consider
two nulls to be equal.

Do you have any suggestions for alternate names? Keeping them using Equal
seems to be dangerous since people would likely expect it to act like
normal equality (with nulls being different).

I think Richard's suggestion of KeysUnchanged would work fine.

regards, tom lane