MERGE vs REPLACE

petr@2ndquadrant.com

over 20 years ago

In reply to: Peter Eisentraut (#1)

Re: MERGE vs REPLACE

Peter Eisentraut wrote:

It seems to me that it has always been implicitly assumed around here
that the MERGE command would be a substitute for a MySQL-like REPLACE
functionality. After rereading the spec it seems that this is not the
case. MERGE always operates on two different tables, which REPLACE
doesn't do.

That said, what kind of support for insert-or-update-this-row do we want
to provide, if any? Should it be a REPLACE command, an extension of
the INSERT command, a modication of the MERGE syntax, or something
else?

MERGE of course, it's standard, REPLACE is mysql extension

--
Regards
Petr Jelinek (PJMODOS)
www.parba.cz

jcasanov@systemguards.com.ec

over 20 years ago

In reply to: Peter Eisentraut (#1)

Re: MERGE vs REPLACE

On 11/11/05, Peter Eisentraut <peter_e@gmx.net> wrote:

It seems to me that it has always been implicitly assumed around here
that the MERGE command would be a substitute for a MySQL-like REPLACE
functionality. After rereading the spec it seems that this is not the
case. MERGE always operates on two different tables, which REPLACE
doesn't do.

That said, what kind of support for insert-or-update-this-row do we want
to provide, if any? Should it be a REPLACE command, an extension of
the INSERT command, a modication of the MERGE syntax, or something
else?

--
Peter Eisentraut
http://developer.postgresql.org/~petere/

MERGE seems to me the better option... not just because is standard
but at least i can see some use cases for it...

--
regards,
Jaime Casanova
(DBA: DataBase Aniquilator ;)

Peter Eisentraut

peter_e@gmx.net

over 20 years ago

In reply to: Jaime Casanova (#3)

Re: MERGE vs REPLACE

Jaime Casanova wrote:

MERGE seems to me the better option... not just because is standard
but at least i can see some use cases for it...

I don't think you understand my message: MERGE does not do what REPLACE
does.

--
Peter Eisentraut
http://developer.postgresql.org/~petere/

tgl@sss.pgh.pa.us

over 20 years ago

In reply to: Peter Eisentraut (#1)

Re: MERGE vs REPLACE

Peter Eisentraut <peter_e@gmx.net> writes:

It seems to me that it has always been implicitly assumed around here
that the MERGE command would be a substitute for a MySQL-like REPLACE
functionality. After rereading the spec it seems that this is not the
case. MERGE always operates on two different tables, which REPLACE
doesn't do.

Normally I'd plump for following the standard ... but AFAIR, we have had
bucketloads of requests for REPLACE functionality, and not one request
for spec-compatible MERGE. If, as it appears, full-spec MERGE is also a
whole lot harder and slower than REPLACE, it seems that we could do
worse than to concentrate on doing REPLACE for now. (We can always come
back to MERGE some other day.)

regards, tom lane

jcasanov@systemguards.com.ec

over 20 years ago

In reply to: Peter Eisentraut (#4)

Re: MERGE vs REPLACE

On 11/11/05, Peter Eisentraut <peter_e@gmx.net> wrote:

Jaime Casanova wrote:

MERGE seems to me the better option... not just because is standard
but at least i can see some use cases for it...

I don't think you understand my message: MERGE does not do what REPLACE
does.

--
Peter Eisentraut
http://developer.postgresql.org/~petere/

I understand you well... what i was trying to say is that i prefer
MERGE (standard SQL command) to be done because the functionally it
has (basically a merge of two tables) seems to me to be more usefull
than REPLACE (MySql Command)...

--
regards,
Jaime Casanova
(DBA: DataBase Aniquilator ;)

Josh Berkus

josh@agliodbs.com

over 20 years ago

In reply to: Jaime Casanova (#6)

Re: MERGE vs REPLACE

Guys,

I understand you well... what i was trying to say is that i prefer
MERGE (standard SQL command) to be done because the functionally it
has (basically a merge of two tables) seems to me to be more usefull
than REPLACE (MySql Command)...

But even REPLACE requires predicate locking. There's no real way to get
around it.

--Josh

--
--Josh

Josh Berkus
Aglio Database Solutions
San Francisco

jcasanov@systemguards.com.ec

over 20 years ago

In reply to: Josh Berkus (#7)

Re: MERGE vs REPLACE

On 11/11/05, Josh Berkus <josh@agliodbs.com> wrote:

Guys,

I understand you well... what i was trying to say is that i prefer
MERGE (standard SQL command) to be done because the functionally it
has (basically a merge of two tables) seems to me to be more usefull
than REPLACE (MySql Command)...

But even REPLACE requires predicate locking. There's no real way to get
around it.

--Josh

why? seems that REPLACE only work if there are at least one row matching...

--
Atentamente,
Jaime Casanova
(DBA: DataBase Aniquilator ;)

Josh Berkus

josh@agliodbs.com

over 20 years ago

In reply to: Jaime Casanova (#8)

Re: MERGE vs REPLACE

Jaime,

why? seems that REPLACE only work if there are at least one row
matching...

Scenario:

session1: REPLACE .... 1
session2: REPLACE ..... 1
session1: check to see that "1" exists .... no
session2: check to see that "1" exists .... no
session1: INSERT 1
session2: INSERT 1 .... ERROR

Get the picture? The only way to avoid a race condition is to be able to
do "predicate locking", that is to lock the table against any data write
matching that predicate.

--
--Josh

Josh Berkus
Aglio Database Solutions
San Francisco

#10

Rod Taylor

rbt@rbt.ca

over 20 years ago

In reply to: Josh Berkus (#9)

Re: MERGE vs REPLACE

On Fri, 2005-11-11 at 14:40 -0800, Josh Berkus wrote:

Jaime,

why? seems that REPLACE only work if there are at least one row
matching...

Get the picture? The only way to avoid a race condition is to be able to
do "predicate locking", that is to lock the table against any data write
matching that predicate.

So? That is what save points are for. You can even skip the select for
update if you don't mind dead tuples from the attempted insert.

SELECT ... FOR UPDATE;
IF not exists THEN
SAVEPOINT;
INSERT ;
IF UNIQUE VIOLATION THEN
/* Someone else inserted between the SELECT and our INSERT */
ROLLBACK TO SAVEPOINT;
UPDATE;
ELSE
RELEASE SAVEPOINT;
FI
ELSE
UPDATE;
FI
--

#11

http://www.treehou.se/~swm/peter_merge.jpg

tgl@sss.pgh.pa.us

over 20 years ago

In reply to: Josh Berkus (#7)

Re: MERGE vs REPLACE

Josh Berkus <josh@agliodbs.com> writes:

But even REPLACE requires predicate locking. There's no real way to get
around it.

The point though is that REPLACE is restricted to a type of predicate
narrow enough to be enforced through a unique-index mechanism, and so
it's implementable without solving the general case of predicate
locking.

Predicate locking for narrow cases isn't very hard; it's the general
case of arbitrary predicates that's hard.

regards, tom lane

#12

Gavin Sherry

swm@linuxworld.com.au

over 20 years ago

In reply to: Josh Berkus (#9)

Re: MERGE vs REPLACE

On Fri, 11 Nov 2005, Josh Berkus wrote:

Jaime,

why? seems that REPLACE only work if there are at least one row
matching...

Scenario:

session1: REPLACE .... 1
session2: REPLACE ..... 1
session1: check to see that "1" exists .... no
session2: check to see that "1" exists .... no
session1: INSERT 1
session2: INSERT 1 .... ERROR

Get the picture? The only way to avoid a race condition is to be able to
do "predicate locking", that is to lock the table against any data write
matching that predicate.

When it comes to predicate locking, I think we should defer to Peter's
comment at Open DB Con:

Gavin

#13

Mark Mielke

mark@mark.mielke.cc

over 20 years ago

In reply to: Rod Taylor (#10)

Re: MERGE vs REPLACE

On Fri, Nov 11, 2005 at 06:00:32PM -0500, Rod Taylor wrote:

So? That is what save points are for. You can even skip the select for
update if you don't mind dead tuples from the attempted insert.
SELECT ... FOR UPDATE;
IF not exists THEN
SAVEPOINT;
INSERT ;
IF UNIQUE VIOLATION THEN
/* Someone else inserted between the SELECT and our INSERT */
ROLLBACK TO SAVEPOINT;
UPDATE;
ELSE
RELEASE SAVEPOINT;
FI
ELSE
UPDATE;
FI

Isn't there still a race between INSERT and UPDATE?

Low probability, for sure, as it would have had to not exist, then
exist, then not exist, but still possible.

I'd like a REPLACE that could be safe, or at least cause a COMMIT to
fail, for this reason.

Cheers,
mark

--
mark@mielke.cc / markm@ncf.ca / markm@nortel.com __________________________
. . _ ._ . . .__ . . ._. .__ . . . .__ | Neighbourhood Coder
|\/| |_| |_| |/ |_ |\/| | |_ | |/ |_ |
| | | | | \ | \ |__ . | | .|. |__ |__ | \ |__ | Ottawa, Ontario, Canada

One ring to rule them all, one ring to find them, one ring to bring them all
and in the darkness bind them...

http://mark.mielke.cc/

#14

Rod Taylor

rbt@rbt.ca

over 20 years ago

In reply to: Mark Mielke (#13)

Re: MERGE vs REPLACE

On Fri, 2005-11-11 at 18:36 -0500, mark@mark.mielke.cc wrote:

On Fri, Nov 11, 2005 at 06:00:32PM -0500, Rod Taylor wrote:

So? That is what save points are for. You can even skip the select for
update if you don't mind dead tuples from the attempted insert.
SELECT ... FOR UPDATE;
IF not exists THEN
SAVEPOINT;
INSERT ;
IF UNIQUE VIOLATION THEN
/* Someone else inserted between the SELECT and our INSERT */
ROLLBACK TO SAVEPOINT;
UPDATE;
ELSE
RELEASE SAVEPOINT;
FI
ELSE
UPDATE;
FI

Isn't there still a race between INSERT and UPDATE?

I suppose there is although I hadn't noticed before. I've never run into
it and always check to ensure the expected number of tuples were touched
by the update or delete.

Within the PostgreSQL backend you might get away with having your insert
hold a lock on the index page and follow it up with a FOR UPDATE lock on
the offending tuple thus ensuring that your update will succeed. If you
hack index mechanisms for the support you don't need the SAVEPOINT
either -- just don't throw an error when you run across the existing
entry.

For client side code one possibility is to repeat until successful.

WHILE
SELECT FOR UPDATE;
IF NOT EXISTS THEN
SAVEPOINT
INSERT;
IF UNIQUE VIOLATION THEN
ROLLBACK TO SAVEPOINT;
ELSE
RELEASE SAVEPOINT
EXIT;
FI
ELSE
UPDATE;
EXIT;
END

-- Check for infinite loop
END

#15

Matteo Beccati

php@beccati.com

over 20 years ago

In reply to: Tom Lane (#5)

Re: MERGE vs REPLACE

Tom Lane wrote:

Peter Eisentraut <peter_e@gmx.net> writes:

It seems to me that it has always been implicitly assumed around here
that the MERGE command would be a substitute for a MySQL-like REPLACE
functionality. After rereading the spec it seems that this is not the
case. MERGE always operates on two different tables, which REPLACE
doesn't do.

Normally I'd plump for following the standard ... but AFAIR, we have had
bucketloads of requests for REPLACE functionality, and not one request
for spec-compatible MERGE. If, as it appears, full-spec MERGE is also a
whole lot harder and slower than REPLACE, it seems that we could do
worse than to concentrate on doing REPLACE for now. (We can always come
back to MERGE some other day.)

I would also like to add that MySQL's REPLACE is not exactly an INSERT
OR UPDATE, rather and INSERT OR (DELETE then INSERT): I mean that the
fields not specified in the query are set to their defaults:

i.e.

CREATE TABLE t (a int PRIMARY KEY, b int, c int);

INSERT INTO t (a, b, c) VALUES (1, 1, 2);

SELECT * FROM t;
+---+------+------+
| a | b | c |
+---+------+------+
| 1 | 1 | 2 |
+---+------+------+

REPLACE INTO t (a, b) VALUES (1, 1);

SELECT * FROM t;
+---+------+------+
| a | b | c |
+---+------+------+
| 1 | 1 | NULL |
+---+------+------+

I wanted to point it out this because people are commonly mistaking this.

Best regards
--
Matteo Beccati
http://phpadsnew.com
http://phppgads.com

#16

Robert Treat

xzilla@users.sourceforge.net

over 20 years ago

In reply to: Matteo Beccati (#15)

Re: MERGE vs REPLACE

On Saturday 12 November 2005 04:06, Matteo Beccati wrote:

Tom Lane wrote:

Peter Eisentraut <peter_e@gmx.net> writes:

It seems to me that it has always been implicitly assumed around here
that the MERGE command would be a substitute for a MySQL-like REPLACE
functionality. After rereading the spec it seems that this is not the
case. MERGE always operates on two different tables, which REPLACE
doesn't do.

Normally I'd plump for following the standard ... but AFAIR, we have had
bucketloads of requests for REPLACE functionality, and not one request
for spec-compatible MERGE. If, as it appears, full-spec MERGE is also a
whole lot harder and slower than REPLACE, it seems that we could do
worse than to concentrate on doing REPLACE for now. (We can always come
back to MERGE some other day.)

I would also like to add that MySQL's REPLACE is not exactly an INSERT
OR UPDATE, rather and INSERT OR (DELETE then INSERT): I mean that the
fields not specified in the query are set to their defaults:

i.e.

CREATE TABLE t (a int PRIMARY KEY, b int, c int);

INSERT INTO t (a, b, c) VALUES (1, 1, 2);

SELECT * FROM t;
+---+------+------+

| a | b | c |

+---+------+------+

| 1 | 1 | 2 |

+---+------+------+

REPLACE INTO t (a, b) VALUES (1, 1);

SELECT * FROM t;
+---+------+------+

| a | b | c |

+---+------+------+

| 1 | 1 | NULL |

+---+------+------+

I wanted to point it out this because people are commonly mistaking this.

Wow, that seems ugly.... maybe there's a reason for it, but I'm not sure we
could deviate from my$ql's behavior on this even if we wanted... they are the
"standard" here.

--
Robert Treat
Build A Brighter Lamp :: Linux Apache {middleware} PostgreSQL

#17

Gregory Maxwell

gmaxwell@gmail.com

over 20 years ago

In reply to: Robert Treat (#16)

Re: MERGE vs REPLACE

On 11/13/05, Robert Treat <xzilla@users.sourceforge.net> wrote:

On Saturday 12 November 2005 04:06, Matteo Beccati wrote:

| 1 | 1 | NULL |

Wow, that seems ugly.... maybe there's a reason for it, but I'm not sure we
could deviate from my$ql's behavior on this even if we wanted... they are the
"standard" here.

I don't think that's ugly, I think that's exactly working as
advertised. Replace behaves exactly like deleting the record with the
matching primary key and inserting the provided input. ... not merging
together old data with new.

#18

Robert Treat

xzilla@users.sourceforge.net

over 20 years ago

In reply to: Gregory Maxwell (#17)

Re: MERGE vs REPLACE

On Sunday 13 November 2005 10:01, Gregory Maxwell wrote:

On 11/13/05, Robert Treat <xzilla@users.sourceforge.net> wrote:

On Saturday 12 November 2005 04:06, Matteo Beccati wrote:

| 1 | 1 | NULL |

Wow, that seems ugly.... maybe there's a reason for it, but I'm not sure
we could deviate from my$ql's behavior on this even if we wanted... they
are the "standard" here.

I don't think that's ugly, I think that's exactly working as
advertised. Replace behaves exactly like deleting the record with the
matching primary key and inserting the provided input. ... not merging
together old data with new.

I disagree in that REPLACE is advertised as a solution for the INSERT else
UPDATE problem, but has a different behavior than a true INSERT else UPDATE
would produce. Maybe that's a problem with the implementation, or maybe
it's a problem in the advertisment, but there is certainly a discrepency
there.

--
Robert Treat
Build A Brighter Lamp :: Linux Apache {middleware} PostgreSQL

#19

tgl@sss.pgh.pa.us

over 20 years ago

In reply to: Robert Treat (#18)

Re: MERGE vs REPLACE

Robert Treat <xzilla@users.sourceforge.net> writes:

I disagree in that REPLACE is advertised as a solution for the INSERT else
UPDATE problem, but has a different behavior than a true INSERT else UPDATE
would produce. Maybe that's a problem with the implementation, or maybe
it's a problem in the advertisment, but there is certainly a discrepency
there.

Yeah. REPLACE fails to solve common examples like a web hit counter
("if key doesn't exist, insert row with count 1; if it does exist,
add 1 to the current count").

IIRC, SQL's MERGE deals with this by offering two quite separate
specifications of what to do when there is or isn't already a matching
row.

I don't necessarily feel that we have to slavishly duplicate what MySQL
offers. I do think that it's reasonable to restrict the functionality
to updating/replacing a row with matching primary key --- that gets us
out of the problem of needing a full predicate-locking mechanism, while
still covering most all of the practical use-cases that I can see.

It'd be useful to look at what comparable functionality is offered by
other DBs besides MySQL. Anyone know what DB2 or Oracle have in this
area?

regards, tom lane

#20

petr@2ndquadrant.com

over 20 years ago

In reply to: Tom Lane (#19)

Re: MERGE vs REPLACE

Tom Lane wrote:

It'd be useful to look at what comparable functionality is offered by
other DBs besides MySQL. Anyone know what DB2 or Oracle have in this
area?

IIRC they both have MERGE.

--
Regards
Petr Jelinek (PJMODOS)

#21

Joshua D. Drake

jd@commandprompt.com

over 20 years ago

In reply to: Petr Jelinek (#20)

#22

Peter Eisentraut

peter_e@gmx.net

over 20 years ago

In reply to: Tom Lane (#19)

#23

petr@2ndquadrant.com

over 20 years ago

In reply to: Peter Eisentraut (#22)

#24

kleptog@svana.org

over 20 years ago

In reply to: Petr Jelinek (#23)

#25

Jochem van Dieten

jochemd@gmail.com

over 20 years ago

In reply to: Petr Jelinek (#23)

#26

Paolo Magnoli

pmagnoli@systemevolution.it

over 20 years ago

In reply to: Jochem van Dieten (#25)

#27

Jim.Nasby@BlueTreble.com

over 20 years ago

In reply to: Rod Taylor (#14)

#28

Jim.Nasby@BlueTreble.com

over 20 years ago

In reply to: Tom Lane (#5)

#29

Simon Riggs

simon@2ndQuadrant.com

over 20 years ago

In reply to: Martijn van Oosterhout (#24)

#30

Stephen Frost

sfrost@snowman.net

over 20 years ago

In reply to: Tom Lane (#5)

#31

Josh Berkus

josh@agliodbs.com

over 20 years ago

In reply to: Simon Riggs (#29)

#32

Mark Mielke

mark@mark.mielke.cc

over 20 years ago

In reply to: Josh Berkus (#31)

#33

jcasanov@systemguards.com.ec

over 20 years ago

In reply to: Josh Berkus (#31)

#34

Simon Riggs

simon@2ndQuadrant.com

over 20 years ago

In reply to: Josh Berkus (#31)

#35

bruce@momjian.us

over 20 years ago

In reply to: Josh Berkus (#9)

#36

bruce@momjian.us

over 20 years ago

In reply to: Tom Lane (#11)

#37

bruce@momjian.us

over 20 years ago

In reply to: Simon Riggs (#29)

#38

bruce@momjian.us

over 20 years ago

In reply to: Josh Berkus (#31)

#39

bruce@momjian.us

over 20 years ago

In reply to: Paolo Magnoli (#26)

#40

Jim.Nasby@BlueTreble.com

over 20 years ago

In reply to: Bruce Momjian (#36)

#41

Christopher Kings-Lynne

chriskl@familyhealth.com.au

over 20 years ago

In reply to: Jim Nasby (#40)

#42

Jim.Nasby@BlueTreble.com

over 20 years ago

In reply to: Christopher Kings-Lynne (#41)

#43

tgl@sss.pgh.pa.us

over 20 years ago

In reply to: Christopher Kings-Lynne (#41)

#44

Rick Gigger

rick@alpinenetworking.com

over 20 years ago

In reply to: Tom Lane (#43)

#45

jcasanov@systemguards.com.ec

over 20 years ago

In reply to: Rick Gigger (#44)

#46

bruce@momjian.us

over 20 years ago

In reply to: Rick Gigger (#44)

#47

jcasanov@systemguards.com.ec

over 20 years ago

In reply to: Bruce Momjian (#46)

#48

bruce@momjian.us

over 20 years ago

In reply to: Jaime Casanova (#47)

#49

tgl@sss.pgh.pa.us

over 20 years ago

In reply to: Jaime Casanova (#47)

#50

kleptog@svana.org

over 20 years ago

In reply to: Bruce Momjian (#46)

#51

bruce@momjian.us

over 20 years ago

In reply to: Martijn van Oosterhout (#50)

#52

Simon Riggs

simon@2ndQuadrant.com

over 20 years ago

In reply to: Martijn van Oosterhout (#50)

#53

kleptog@svana.org

over 20 years ago

In reply to: Bruce Momjian (#51)

#54

Dann Corbit

DCorbit@connx.com

over 20 years ago

In reply to: Martijn van Oosterhout (#53)

#55

kleptog@svana.org

over 20 years ago

In reply to: Dann Corbit (#54)

#56

Rick Gigger

rick@alpinenetworking.com

over 20 years ago

In reply to: Simon Riggs (#52)

#57

daveg

daveg@sonic.net

over 20 years ago

In reply to: Tom Lane (#43)

#58

tgl@sss.pgh.pa.us

over 20 years ago

In reply to: daveg (#57)

#59

Zeugswetter Andreas SB SD

kleptog@svana.org

over 20 years ago

In reply to: Tom Lane (#58)

#60

Csaba Nagy

nagy@ecircle-ag.com

over 20 years ago

In reply to: Martijn van Oosterhout (#55)

#61

ZeugswetterA@spardat.at

over 20 years ago

In reply to: Csaba Nagy (#60)

#62

Csaba Nagy

nagy@ecircle-ag.com

over 20 years ago

In reply to: Zeugswetter Andreas SB SD (#61)

#63

kleptog@svana.org

over 20 years ago

In reply to: Csaba Nagy (#62)

#64

Csaba Nagy

nagy@ecircle-ag.com

over 20 years ago

In reply to: Martijn van Oosterhout (#63)

#65

tgl@sss.pgh.pa.us

over 20 years ago

In reply to: Csaba Nagy (#64)

#66

Stephen Frost

sfrost@snowman.net

over 20 years ago

In reply to: Tom Lane (#65)

#67

Mark Mielke

mark@mark.mielke.cc

over 20 years ago

In reply to: Stephen Frost (#66)

#68

bruce@momjian.us

over 20 years ago

In reply to: Zeugswetter Andreas SB SD (#61)

#69

bruce@momjian.us

over 20 years ago

In reply to: Mark Mielke (#67)

#70

Zeugswetter Andreas SB SD

bruce@momjian.us

over 20 years ago

In reply to: Tom Lane (#65)

#71

Dennis Bjorklund

db@zigo.dhs.org

over 20 years ago

In reply to: Bruce Momjian (#68)

#72

ZeugswetterA@spardat.at

over 20 years ago

In reply to: Dennis Bjorklund (#71)

#73

bruce@momjian.us

over 20 years ago

In reply to: Dennis Bjorklund (#71)

#74

tgl@sss.pgh.pa.us

over 20 years ago

In reply to: Bruce Momjian (#73)

#75

bruce@momjian.us

over 20 years ago

In reply to: Tom Lane (#74)

#76

Jim.Nasby@BlueTreble.com

over 20 years ago

In reply to: Bruce Momjian (#69)

#77

Jim.Nasby@BlueTreble.com

over 20 years ago

In reply to: Stephen Frost (#66)

#78

petr@2ndquadrant.com

over 20 years ago

In reply to: Jim Nasby (#77)

#79

jcasanov@systemguards.com.ec

over 20 years ago

In reply to: Petr Jelinek (#78)

#80

kleptog@svana.org

over 20 years ago

In reply to: Petr Jelinek (#78)

#81

bruce@momjian.us

over 20 years ago

In reply to: Jaime Casanova (#79)

#82

jcasanov@systemguards.com.ec

over 20 years ago

In reply to: Bruce Momjian (#81)

#83

Jim.Nasby@BlueTreble.com

over 20 years ago

In reply to: Martijn van Oosterhout (#80)

#84

petr@2ndquadrant.com

over 20 years ago

In reply to: Jaime Casanova (#79)

#85