BUG #4689: Expanding the length of a VARCHAR column should not induce a table rewrite

Started by Nonamealmost 17 years ago13 messages
#1Noname
xuan--2009.03--submitbug--support--postgresql.org@baldauf.org

The following bug has been logged online:

Bug reference: 4689
Logged by:
Email address:
xuan--2009.03--submitbug--support--postgresql.org@baldauf.org
PostgreSQL version: 8.3.5
Operating system: Linux 2.6.18-6-amd64
Description: Expanding the length of a VARCHAR column should not
induce a table rewrite
Details:

Suppose there is a table "sometable" with a column "somecolumn" of type
"VARCHAR(5)".

This table as many rows.

When executing
"ALTER TABLE sometable ALTER COLUMN somecolumn TYPE VARCHAR(7)", the whole
table is re-written, and this rewrite takes many hours. During these hours,
all writers on this table stall, making the database effectively
unavailable.

However, in almost all cases, there is no need for such relaxing of limits
to require a table rewrite.

So the expected run time needed for this statement is about one second, the
actual run time needed for this statement is many hours.

#2Heikki Linnakangas
heikki.linnakangas@enterprisedb.com
In reply to: Noname (#1)
Re: BUG #4689: Expanding the length of a VARCHAR column should not induce a table rewrite

xuan--2009.03--submitbug--support--postgresql.org@baldauf.org wrote:

When executing
"ALTER TABLE sometable ALTER COLUMN somecolumn TYPE VARCHAR(7)", the whole
table is re-written, and this rewrite takes many hours. During these hours,
all writers on this table stall, making the database effectively
unavailable.

However, in almost all cases, there is no need for such relaxing of limits
to require a table rewrite.

While this isn't a bug, it's a reasonable feature request. I've added
this to the TODO page: http://wiki.postgresql.org/wiki/Todo#ALTER

Patches are welcome.

--
Heikki Linnakangas
EnterpriseDB http://www.enterprisedb.com

#3Guillaume Smet
guillaume.smet@gmail.com
In reply to: Heikki Linnakangas (#2)
Re: [BUGS] BUG #4689: Expanding the length of a VARCHAR column should not induce a table rewrite

On Wed, Mar 4, 2009 at 11:06 AM, Heikki Linnakangas
<heikki.linnakangas@enterprisedb.com> wrote:

While this isn't a bug, it's a reasonable feature request. I've added this
to the TODO page: http://wiki.postgresql.org/wiki/Todo#ALTER

Patches are welcome.

I remember someone proposed a patch for that a long time ago:
http://archives.postgresql.org/pgsql-patches/2006-10/msg00154.php

AFAICS there were 2 possible follow-ups to this patch:
- only manage the case of char/varchar (which is the most common case anyway);
- develop a complete infrastructure to handle all the cases as
suggested by Gregory.

--
Guillaume

#4Peter Eisentraut
peter_e@gmx.net
In reply to: Heikki Linnakangas (#2)
Re: BUG #4689: Expanding the length of a VARCHAR column should not induce a table rewrite

Heikki Linnakangas wrote:

xuan--2009.03--submitbug--support--postgresql.org@baldauf.org wrote:

When executing
"ALTER TABLE sometable ALTER COLUMN somecolumn TYPE VARCHAR(7)", the
whole
table is re-written, and this rewrite takes many hours. During these
hours,
all writers on this table stall, making the database effectively
unavailable.

However, in almost all cases, there is no need for such relaxing of
limits
to require a table rewrite.

While this isn't a bug, it's a reasonable feature request. I've added
this to the TODO page: http://wiki.postgresql.org/wiki/Todo#ALTER

Patches are welcome.

The question is how you want to implement this in a data type
independent fashion. You can't assume that increasing the typmod is a
noop for all data types.

#5Guillaume Smet
guillaume.smet@gmail.com
In reply to: Peter Eisentraut (#4)
Re: BUG #4689: Expanding the length of a VARCHAR column should not induce a table rewrite

On Wed, Mar 4, 2009 at 11:50 AM, Peter Eisentraut <peter_e@gmx.net> wrote:

The question is how you want to implement this in a data type independent
fashion.  You can't assume that increasing the typmod is a noop for all data
types.

Sure. See my previous answer on -hackers (I don't think this
discussion belong to -bugs) and especially the discussion in the
archives about Jonas' patch.

--
Guillaume

#6Matteo Beccati
php@beccati.com
In reply to: Guillaume Smet (#5)
Re: [HACKERS] Re: BUG #4689: Expanding the length of a VARCHAR column should not induce a table rewrite

Guillaume Smet ha scritto:

On Wed, Mar 4, 2009 at 11:50 AM, Peter Eisentraut <peter_e@gmx.net> wrote:

The question is how you want to implement this in a data type independent
fashion. You can't assume that increasing the typmod is a noop for all data
types.

Sure. See my previous answer on -hackers (I don't think this
discussion belong to -bugs) and especially the discussion in the
archives about Jonas' patch.

I recently had a similar problem when I added some domains to the
application. ALTER TABLE ... TYPE varchar_dom was leading to a full
table rewrite even though the underlying type definition were exactly
the same (i.e. varchar(64)). I can live with it, but I suppose this fix
might be related to the varlen one.

Cheers

--
Matteo Beccati

OpenX - http://www.openx.org

#7Kevin Grittner
Kevin.Grittner@wicourts.gov
In reply to: Matteo Beccati (#6)
Re: Expanding the length of a VARCHAR column should not induce a table rewrite

Took bugs list off, as this is clearly not a bug.

Matteo Beccati <php@beccati.com> wrote:

I recently had a similar problem when I added some domains to the
application. ALTER TABLE ... TYPE varchar_dom was leading to a full
table rewrite even though the underlying type definition were

exactly

the same (i.e. varchar(64)). I can live with it, but I suppose this

fix

might be related to the varlen one.

Been there. We also occasionally increase the length of a
verchar-based domain. The process could be made faster and more
convenient by avoiding rewrites when possible. On particularly large
tables I've sometimes ventured into direct updates to the system
tables for these.

-Kevin

#8Jaime Casanova
jcasanov@systemguards.com.ec
In reply to: Matteo Beccati (#6)
Re: [HACKERS] Re: BUG #4689: Expanding the length of a VARCHAR column should not induce a table rewrite

On Thu, Mar 5, 2009 at 10:47 AM, Matteo Beccati <php@beccati.com> wrote:

Guillaume Smet ha scritto:

On Wed, Mar 4, 2009 at 11:50 AM, Peter Eisentraut <peter_e@gmx.net> wrote:

The question is how you want to implement this in a data type independent
fashion.  You can't assume that increasing the typmod is a noop for all data
types.

Sure. See my previous answer on -hackers (I don't think this
discussion belong to -bugs) and especially the discussion in the
archives about Jonas' patch.

I recently had a similar problem when I added some domains to the
application. ALTER TABLE ... TYPE varchar_dom was leading to a full
table rewrite even though the underlying type definition were exactly
the same (i.e. varchar(64)). I can live with it, but I suppose this fix
might be related to the varlen one.

ALTER TABLE ... TYPE does cause a table rewrite even if new_type =
old_type, and that is actually useful...
for example when you add a fillfactor to an existing table that
fillfactor will not affect the existing data until you rewrite the
table and a convenient way is exactly using ALTER TABLE ... TYPE.

now, back to the problem... is not easier to define a column as TEXT
and to put a check to constraint the length? if you wanna change the
constraint that will be almost free

--
Atentamente,
Jaime Casanova
Soporte y capacitación de PostgreSQL
Asesoría y desarrollo de sistemas
Guayaquil - Ecuador
Cel. +59387171157

#9Kevin Grittner
Kevin.Grittner@wicourts.gov
In reply to: Jaime Casanova (#8)
Re: Re: [BUGS] BUG #4689: Expanding the length of a VARCHAR column should not induce a table rewrite

Jaime Casanova <jcasanov@systemguards.com.ec> wrote:

ALTER TABLE ... TYPE does cause a table rewrite even if new_type =
old_type, and that is actually useful...
for example when you add a fillfactor to an existing table that
fillfactor will not affect the existing data until you rewrite the
table and a convenient way is exactly using ALTER TABLE ... TYPE.

I find that to be exactly as useful as it would be to have a table
rewrite if I added a new null-capable column, and somewhat less useful
than it would be have a table rewrite on dropping a column.
Maintaining the function of this clever trick should not be the basis
of imposing a burden on relatively common maintenance operations.

now, back to the problem... is not easier to define a column as TEXT
and to put a check to constraint the length? if you wanna change the
constraint that will be almost free

Thanks for the interesting suggestion. I'm not sure I'd want to go
there for various reasons; but even if I wanted to go that route, how
would I modify that constraint without causing the whole table to be
scanned for compliance?

-Kevin

#10Jaime Casanova
jcasanov@systemguards.com.ec
In reply to: Kevin Grittner (#9)
Re: Re: [BUGS] BUG #4689: Expanding the length of a VARCHAR column should not induce a table rewrite

On Thu, Mar 5, 2009 at 2:46 PM, Kevin Grittner
<Kevin.Grittner@wicourts.gov> wrote:

now, back to the problem... is not easier to define a column as TEXT
and to put a check to constraint the length? if you wanna change the
constraint that will be almost free

Thanks for the interesting suggestion.  I'm not sure I'd want to go
there for various reasons; but even if I wanted to go that route, how
would I modify that constraint without causing the whole table to be
scanned for compliance?

the table will be scanned but not rewritten

--
Atentamente,
Jaime Casanova
Soporte y capacitación de PostgreSQL
Asesoría y desarrollo de sistemas
Guayaquil - Ecuador
Cel. +59387171157

#11Kevin Grittner
Kevin.Grittner@wicourts.gov
In reply to: Jaime Casanova (#10)
Re: Re: [BUGS] BUG #4689: Expanding the length of a VARCHAR column should not induce a table rewrite

Jaime Casanova <jcasanov@systemguards.com.ec> wrote:

the table will be scanned but not rewritten

That can still be a very long time on some tables.

And there would still be the issue of dodging all the brickbats thrown
at me by developers whose tools use the system tables to limit the
number of characters a user is allowed to type into an application.

-Kevin

#12Xuân Baldauf
xuan--2009.03--submitbug--support--postgresql.org@baldauf.org
In reply to: Jaime Casanova (#8)
Re: Re: [BUGS] BUG #4689: Expanding the length of a VARCHAR column should not induce a table rewrite

Jaime Casanova wrote:

On Thu, Mar 5, 2009 at 10:47 AM, Matteo Beccati <php@beccati.com> wrote:

Guillaume Smet ha scritto:

On Wed, Mar 4, 2009 at 11:50 AM, Peter Eisentraut <peter_e@gmx.net> wrote:

The question is how you want to implement this in a data type independent
fashion. You can't assume that increasing the typmod is a noop for all data
types.

Sure. See my previous answer on -hackers (I don't think this
discussion belong to -bugs) and especially the discussion in the
archives about Jonas' patch.

I recently had a similar problem when I added some domains to the
application. ALTER TABLE ... TYPE varchar_dom was leading to a full
table rewrite even though the underlying type definition were exactly
the same (i.e. varchar(64)). I can live with it, but I suppose this fix
might be related to the varlen one.

ALTER TABLE ... TYPE does cause a table rewrite even if new_type =
old_type, and that is actually useful...
for example when you add a fillfactor to an existing table that
fillfactor will not affect the existing data until you rewrite the
table and a convenient way is exactly using ALTER TABLE ... TYPE.

Well, while this behaviour is well-known for PostgreSQL, this is
actually an abuse of syntax. If there are legitimate requirements for
rewriting a table, then there should be explicit syntax for such a
feature, like "ALTER TABLE ... REWRITE". Rewriting a table in case of
"ALTER TABLE ... TYPE" is, by the semantics of that statement, just a
side-effect, which may or may not happen, depending on how optimized the
DBMS is. It is bad design to avoid optimization just because an
unnecessary side-effect would be optimized away.

now, back to the problem... is not easier to define a column as TEXT
and to put a check to constraint the length? if you wanna change the
constraint that will be almost free

No. Is it possible to change the column type from VARCHAR(5) to TEXT
without a table-rewrite penalty?

ciao,
Xuân.

#13Jaime Casanova
jcasanov@systemguards.com.ec
In reply to: Xuân Baldauf (#12)
Re: [HACKERS] Re: BUG #4689: Expanding the length of a VARCHAR column should not induce a table rewrite

On Thu, Mar 5, 2009 at 3:27 PM, Xuân Baldauf
<xuan--2009.03--submitbug--support--postgresql.org@baldauf.org> wrote:

Well, while this behaviour is well-known for PostgreSQL, this is actually an
abuse of syntax. If there are legitimate requirements for rewriting a table,
then there should be explicit syntax for such a feature, like "ALTER TABLE
... REWRITE". Rewriting a table in case of "ALTER TABLE ... TYPE" is, by the
semantics of that statement, just a side-effect, which may or may not
happen, depending on how optimized the DBMS is. It is bad design to avoid
optimization just because an unnecessary side-effect would be optimized
away.

note that this is my opinion and not represent the PGDG (Postgresql
Global Development Group) opinion

now, back to the problem... is not easier to define a column as TEXT
and to put a check to constraint the length? if you wanna change the
constraint that will be almost free

No. Is it possible to change the column type from VARCHAR(5) to TEXT without
a table-rewrite penalty?

the idea is to make that change once (and to create new tables just with TEXT)

and then you can make ALTER TABLE ... ADD CHECK (length(column) =
a_value) as many times as you want without the need for a table
rewrite

--
Atentamente,
Jaime Casanova
Soporte y capacitación de PostgreSQL
Asesoría y desarrollo de sistemas
Guayaquil - Ecuador
Cel. +59387171157