vacuum process size

Started by Tatsuo Ishiiover 26 years ago27 messages

t-ishii@sra.co.jp

over 26 years ago

Just for a testing I made a huge table (>2GB and it has 10000000
tuples). copy 10000000 tuples took 23 minutes. This is not so
bad. Vacuum analyze took 11 minutes, not too bad. After this I created
an index on int4 column. It took 9 minutes. Next I deleted 5000000
tuples to see how long delete took. I found it was 6
minutes. Good. Then I ran into a problem. After that I did vacuum
analyze, and seemed it took forever! (actually took 47 minutes). The
biggest problem was postgres's process size. It was 478MB! This is not
acceptable for me. Any idea?

This is PostgreSQL 6.5.1 running on RH 6.0.
--
Tatsuo Ishii

Tom Lane

tgl@sss.pgh.pa.us

over 26 years ago

In reply to: Tatsuo Ishii (#1)

Re: [HACKERS] vacuum process size

Tatsuo Ishii <t-ishii@sra.co.jp> writes:

Just for a testing I made a huge table (>2GB and it has 10000000
tuples). copy 10000000 tuples took 23 minutes. This is not so
bad. Vacuum analyze took 11 minutes, not too bad. After this I created
an index on int4 column. It took 9 minutes. Next I deleted 5000000
tuples to see how long delete took. I found it was 6
minutes. Good. Then I ran into a problem. After that I did vacuum
analyze, and seemed it took forever! (actually took 47 minutes). The
biggest problem was postgres's process size. It was 478MB! This is not
acceptable for me. Any idea?

Yeah, I've complained about that before --- it seems that vacuum takes
a really unreasonable amount of time to remove dead tuples from an index.
It's been like that at least since 6.3.2, probably longer.

regards, tom lane

Import Notes

Reply to msg id not found: YourmessageofWed18Aug1999174430+0900199908180844.RAA28441@srapc451.sra.co.jp | Resolved by subject fallback

Tatsuo Ishii

t-ishii@sra.co.jp

over 26 years ago

In reply to: Tom Lane (#2)

Re: [HACKERS] vacuum process size

Tatsuo Ishii <t-ishii@sra.co.jp> writes:

Just for a testing I made a huge table (>2GB and it has 10000000
tuples). copy 10000000 tuples took 23 minutes. This is not so
bad. Vacuum analyze took 11 minutes, not too bad. After this I created
an index on int4 column. It took 9 minutes. Next I deleted 5000000
tuples to see how long delete took. I found it was 6
minutes. Good. Then I ran into a problem. After that I did vacuum
analyze, and seemed it took forever! (actually took 47 minutes). The
biggest problem was postgres's process size. It was 478MB! This is not
acceptable for me. Any idea?

Yeah, I've complained about that before --- it seems that vacuum takes
a really unreasonable amount of time to remove dead tuples from an index.
It's been like that at least since 6.3.2, probably longer.

Hiroshi came up with a work around for this(see included
patches). After applying it, the process size shrinked from 478MB to
86MB! (the processing time did not descrease, however). According to
him, repalloc seems not very effective with large number of calls. The
patches probably descreases the number to 1/10.
--
Tatsuo Ishii

-------------------------------------------------------------------------
*** vacuum.c.orig	Sat Jul  3 09:32:40 1999
--- vacuum.c	Thu Aug 19 17:34:18 1999
***************
*** 2519,2530 ****
  static void
  vc_vpinsert(VPageList vpl, VPageDescr vpnew)
  {

/* allocate a VPageDescr entry if needed */
if (vpl->vpl_num_pages == 0)
! vpl->vpl_pagedesc = (VPageDescr *) palloc(100 * sizeof(VPageDescr));
! else if (vpl->vpl_num_pages % 100 == 0)
! vpl->vpl_pagedesc = (VPageDescr *) repalloc(vpl->vpl_pagedesc, (vpl->vpl_num_pages + 100) * sizeof(VPageDescr));
vpl->vpl_pagedesc[vpl->vpl_num_pages] = vpnew;
(vpl->vpl_num_pages)++;

--- 2519,2531 ----
  static void
  vc_vpinsert(VPageList vpl, VPageDescr vpnew)
  {
+ #define PG_NPAGEDESC 1000

/* allocate a VPageDescr entry if needed */
if (vpl->vpl_num_pages == 0)
! vpl->vpl_pagedesc = (VPageDescr *) palloc(PG_NPAGEDESC * sizeof(VPageDescr));
! else if (vpl->vpl_num_pages % PG_NPAGEDESC == 0)
! vpl->vpl_pagedesc = (VPageDescr *) repalloc(vpl->vpl_pagedesc, (vpl->vpl_num_pages + PG_NPAGEDESC) * sizeof(VPageDescr));
vpl->vpl_pagedesc[vpl->vpl_num_pages] = vpnew;
(vpl->vpl_num_pages)++;

Import Notes

Reply to msg id not found: YourmessageofWed18Aug1999100242-0400.24633.934984962@sss.pgh.pa.us | Resolved by subject fallback

Hiroshi Inoue

Inoue@tpf.co.jp

over 26 years ago

In reply to: Tatsuo Ishii (#3)

RE: [HACKERS] vacuum process size

Hi all,

I found the following comment in utils/mmgr/aset.c.
The high memory usage of big vacuum is probably caused by this
change.
Calling repalloc() many times with its size parameter increasing
would need large amount of memory.

Should vacuum call realloc() directly ?
Or should AllocSet..() be changed ?

Comments ?

* NOTE:
* This is a new (Feb. 05, 1999) implementation of the allocation set
* routines. AllocSet...() does not use OrderedSet...() any more.
* Instead it manages allocations in a block pool by itself, combining
* many small allocations in a few bigger blocks. AllocSetFree() does
* never free() memory really. It just add's the free'd area to some
^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
* list for later reuse by AllocSetAlloc(). All memory blocks are
free()'d

Regards.

Hiroshi Inoue
Inoue@tpf.co.jp

Show quoted text

Tatsuo Ishii <t-ishii@sra.co.jp> writes:

Just for a testing I made a huge table (>2GB and it has 10000000
tuples). copy 10000000 tuples took 23 minutes. This is not so
bad. Vacuum analyze took 11 minutes, not too bad. After this I created
an index on int4 column. It took 9 minutes. Next I deleted 5000000
tuples to see how long delete took. I found it was 6
minutes. Good. Then I ran into a problem. After that I did vacuum
analyze, and seemed it took forever! (actually took 47 minutes). The
biggest problem was postgres's process size. It was 478MB! This is not
acceptable for me. Any idea?

Yeah, I've complained about that before --- it seems that vacuum takes
a really unreasonable amount of time to remove dead tuples from an index.
It's been like that at least since 6.3.2, probably longer.

Hiroshi came up with a work around for this(see included
patches). After applying it, the process size shrinked from 478MB to
86MB! (the processing time did not descrease, however). According to
him, repalloc seems not very effective with large number of calls. The
patches probably descreases the number to 1/10.
--
Tatsuo Ishii
-------------------------------------------------------------------------
*** vacuum.c.orig	Sat Jul  3 09:32:40 1999
--- vacuum.c	Thu Aug 19 17:34:18 1999
***************
*** 2519,2530 ****
static void
vc_vpinsert(VPageList vpl, VPageDescr vpnew)
{
/* allocate a VPageDescr entry if needed */
if (vpl->vpl_num_pages == 0)
! vpl->vpl_pagedesc = (VPageDescr *) palloc(100 *
sizeof(VPageDescr));
! else if (vpl->vpl_num_pages % 100 == 0)
! vpl->vpl_pagedesc = (VPageDescr *)
repalloc(vpl->vpl_pagedesc, (vpl->vpl_num_pages + 100) *
sizeof(VPageDescr));
vpl->vpl_pagedesc[vpl->vpl_num_pages] = vpnew;
(vpl->vpl_num_pages)++;
--- 2519,2531 ----
static void
vc_vpinsert(VPageList vpl, VPageDescr vpnew)
{
+ #define PG_NPAGEDESC 1000
/* allocate a VPageDescr entry if needed */
if (vpl->vpl_num_pages == 0)
! vpl->vpl_pagedesc = (VPageDescr *)
palloc(PG_NPAGEDESC * sizeof(VPageDescr));
! else if (vpl->vpl_num_pages % PG_NPAGEDESC == 0)
! vpl->vpl_pagedesc = (VPageDescr *)
repalloc(vpl->vpl_pagedesc, (vpl->vpl_num_pages + PG_NPAGEDESC) *
sizeof(VPageDescr));
vpl->vpl_pagedesc[vpl->vpl_num_pages] = vpnew;
(vpl->vpl_num_pages)++;

Mike Mascari

mascarim@yahoo.com

over 26 years ago

In reply to: Hiroshi Inoue (#4)

RE: [HACKERS] vacuum process size

At the very least, couldn't vc_vpinsert() double
vpl->vpl_num_pages whenever vpl->vpl_num_pages
needs to be expanded instead of expanding linearly
by PG_NPAGEDESC, or by the original 100?

Mike Mascari
(mascarim@yahoo.com)

--- Hiroshi Inoue <Inoue@tpf.co.jp> wrote:

Hi all,

I found the following comment in utils/mmgr/aset.c.
The high memory usage of big vacuum is probably
caused by this
change.
Calling repalloc() many times with its size
parameter increasing
would need large amount of memory.

Should vacuum call realloc() directly ?
Or should AllocSet..() be changed ?

Comments ?

* NOTE:
* This is a new (Feb. 05, 1999) implementation
of the allocation set
* routines. AllocSet...() does not use
OrderedSet...() any more.
* Instead it manages allocations in a block
pool by itself, combining
* many small allocations in a few bigger
blocks. AllocSetFree() does
* never free() memory really. It just add's
the free'd area to some
^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
* list for later reuse by AllocSetAlloc(). All
memory blocks are
free()'d

Regards.

Hiroshi Inoue
Inoue@tpf.co.jp

*** vacuum.c.orig	Sat Jul  3 09:32:40 1999
--- vacuum.c	Thu Aug 19 17:34:18 1999
***************
*** 2519,2530 ****
static void
vc_vpinsert(VPageList vpl, VPageDescr vpnew)
{
/* allocate a VPageDescr entry if needed */
if (vpl->vpl_num_pages == 0)
! vpl->vpl_pagedesc = (VPageDescr *) palloc(100
*

sizeof(VPageDescr));
! else if (vpl->vpl_num_pages % 100 == 0)
! vpl->vpl_pagedesc = (VPageDescr *)
repalloc(vpl->vpl_pagedesc, (vpl->vpl_num_pages +

100) *
sizeof(VPageDescr));
vpl->vpl_pagedesc[vpl->vpl_num_pages] = vpnew;
(vpl->vpl_num_pages)++;
--- 2519,2531 ----
static void
vc_vpinsert(VPageList vpl, VPageDescr vpnew)
{
+ #define PG_NPAGEDESC 1000
/* allocate a VPageDescr entry if needed */
if (vpl->vpl_num_pages == 0)
! vpl->vpl_pagedesc = (VPageDescr *)
palloc(PG_NPAGEDESC * sizeof(VPageDescr));
! else if (vpl->vpl_num_pages % PG_NPAGEDESC ==
0)

! vpl->vpl_pagedesc = (VPageDescr *)
repalloc(vpl->vpl_pagedesc, (vpl->vpl_num_pages +

PG_NPAGEDESC) *

sizeof(VPageDescr));
vpl->vpl_pagedesc[vpl->vpl_num_pages] = vpnew;
(vpl->vpl_num_pages)++;

__________________________________________________
Do You Yahoo!?
Bid and sell for free at http://auctions.yahoo.com

Import Notes

Resolved by subject fallback

Tom Lane

tgl@sss.pgh.pa.us

over 26 years ago

In reply to: Mike Mascari (#5)

Re: [HACKERS] vacuum process size

"Hiroshi Inoue" <Inoue@tpf.co.jp> writes:

I found the following comment in utils/mmgr/aset.c.
The high memory usage of big vacuum is probably caused by this
change.

AFAIK, there is no "change" there. free() doesn't give memory
back to the kernel either.

Calling repalloc() many times with its size parameter increasing
would need large amount of memory.

Good point, because aset.c doesn't coalesce adjacent free chunks.
And of course, reallocating the block bigger and bigger is exactly
the usual behavior with realloc-using code :-(

I don't think it would be a good idea to add coalescing logic to aset.c
--- that'd defeat the purpose of building a small/simple/fast allocator.

Perhaps for large standalone chunks (those that AllocSetAlloc made an
entire separate block for), AllocSetFree should free() the block instead
of putting the chunk on its own freelist. Assuming that malloc/free are
smart enough to coalesce adjacent blocks, that would prevent the bad
behavior from recurring once the request size gets past
ALLOC_SMALLCHUNK_LIMIT, and for small requests we don't care.

But it doesn't look like there is any cheap way to detect that a chunk
being freed takes up all of its block. We'd have to mark it specially
somehow. A kluge that comes to mind is to set the chunk->size to zero
when it is a standalone allocation.

I believe Jan designed the current aset.c logic. Jan, any comments?

Should vacuum call realloc() directly ?

Not unless you like *permanent* memory leaks instead of transient ones.
Consider what will happen at elog().

However, another possible solution is to redesign the data structure
in vacuum() so that it can be made up of multiple allocation blocks,
rather than insisting that all the array entries always be consecutive.
Then it wouldn't depend on repalloc at all. On the whole I like that
idea better --- even if repalloc can be fixed not to waste memory, it
still implies copying large amounts of data around for no purpose.

regards, tom lane

Import Notes

Reply to msg id not found: YourmessageofFri20Aug1999094156+0900000201beeaa4ce3887e02801007e@cadzone.tpf.co.jp | Resolved by subject fallback

Tatsuo Ishii

t-ishii@sra.co.jp

over 26 years ago

In reply to: Tom Lane (#6)

Re: [HACKERS] vacuum process size

Mike,

At the very least, couldn't vc_vpinsert() double
vpl->vpl_num_pages whenever vpl->vpl_num_pages
needs to be expanded instead of expanding linearly
by PG_NPAGEDESC, or by the original 100?

I have tested your idea and found even more improved memory usage
(86MB vs. 43MB). Standard vacuum consumes as much as 478MB memory with
deleting 5000000 tuples that would not be acceptable for most
configurations. I think we should fix this as soon as possible.  If
there's no objection, I will commit included patches to the stable
tree (seems Tom has more aggressive idea, so I'll leave the current
tree as it is).
---
Tatsuo Ishii
-------------------------------------------------------------------
*** vacuum.c.orig	Sat Jul  3 09:32:40 1999
--- vacuum.c	Tue Aug 24 10:08:43 1999
***************
*** 2519,2530 ****
  static void
  vc_vpinsert(VPageList vpl, VPageDescr vpnew)
  {

--- 2519,2538 ----
  static void
  vc_vpinsert(VPageList vpl, VPageDescr vpnew)
  {
+ #define PG_NPAGEDESC 1024
+ static uint num_pages;

/* allocate a VPageDescr entry if needed */
if (vpl->vpl_num_pages == 0)
! {
! vpl->vpl_pagedesc = (VPageDescr *) palloc(PG_NPAGEDESC * sizeof(VPageDescr));
! num_pages = PG_NPAGEDESC;
! }
! else if (vpl->vpl_num_pages >= num_pages)
! {
! num_pages *= 2;
! vpl->vpl_pagedesc = (VPageDescr *) repalloc(vpl->vpl_pagedesc, num_pages * sizeof(VPageDescr));
! }
vpl->vpl_pagedesc[vpl->vpl_num_pages] = vpnew;
(vpl->vpl_num_pages)++;

Import Notes

Reply to msg id not found: YourmessageofFri20Aug1999024119MST.19990820094119.20922.rocketmail@web103.yahoomail.com | Resolved by subject fallback

Bruce Momjian

maillist@candle.pha.pa.us

over 26 years ago

In reply to: Mike Mascari (#5)

Re: [HACKERS] vacuum process size

At the very least, couldn't vc_vpinsert() double
vpl->vpl_num_pages whenever vpl->vpl_num_pages
needs to be expanded instead of expanding linearly
by PG_NPAGEDESC, or by the original 100?

This seems like a good idea.

-- 
  Bruce Momjian                        |  http://www.op.net/~candle
  maillist@candle.pha.pa.us            |  (610) 853-3000
  +  If your life is a hard drive,     |  830 Blythe Avenue
  +  Christ can be your backup.        |  Drexel Hill, Pennsylvania 19026

Tom Lane

tgl@sss.pgh.pa.us

over 26 years ago

In reply to: Bruce Momjian (#8)

Re: [HACKERS] vacuum process size

Tatsuo Ishii <t-ishii@sra.co.jp> writes:

I have tested your idea and found even more improved memory usage
(86MB vs. 43MB). Standard vacuum consumes as much as 478MB memory with
deleting 5000000 tuples that would not be acceptable for most
configurations. I think we should fix this as soon as possible. If
there's no objection, I will commit included patches to the stable
tree (seems Tom has more aggressive idea, so I'll leave the current
tree as it is).

No, please make the change in current as well. I was thinking about
tweaking aset.c to be smarter about releasing large chunks, but in any
case having the doubling behavior at the request point will be a big
improvement.

I do not like your patch as given, however. By using a static variable
you are assuming that there is only one active VPageList at a time.
It looks to me like there are at least two --- and there is no reason
to think they'd be the same size.

You need to add a num_pages field to the VPageList struct, not use
a static.

regards, tom lane

Import Notes

Reply to msg id not found: YourmessageofTue24Aug1999101237+0900199908240112.KAA01811@ext16.sra.co.jp | Resolved by subject fallback

#10

Tom Lane

tgl@sss.pgh.pa.us

over 26 years ago

In reply to: Tom Lane (#9)

Re: [HACKERS] vacuum process size

I have been looking some more at the vacuum-process-size issue, and
I am having a hard time understanding why the VPageList data structure
is the critical one. As far as I can see, there should be at most one
pointer in it for each disk page of the relation. OK, you were
vacuuming a table with something like a quarter million pages, so
the end size of the VPageList would have been something like a megabyte,
and given the inefficient usage of repalloc() in the original code,
a lot more space than that would have been wasted as the list grew.
So doubling the array size at each step is a good change.

But there are a lot more tuples than pages in most relations.

I see two lists with per-tuple data in vacuum.c, "vtlinks" in
vc_scanheap and "vtmove" in vc_rpfheap, that are both being grown with
essentially the same technique of repalloc() after every N entries.
I'm not entirely clear on how many tuples get put into each of these
lists, but it sure seems like in ordinary circumstances they'd be much
bigger space hogs than any of the three VPageList lists.

I recommend going to a doubling approach for each of these lists as
well as for VPageList.

There is a fourth usage of repalloc with the same method, for "ioid"
in vc_getindices. This only gets one entry per index on the current
relation, so it's unlikely to be worth changing on its own merit.
But it might be worth building a single subroutine that expands a
growable list of entries (taking sizeof() each entry as a parameter)
and applying it in all four places.

regards, tom lane

Import Notes

Reply to msg id not found: YourmessageofTue24Aug1999101237+0900199908240112.KAA01811@ext16.sra.co.jp | Resolved by subject fallback

#11

Brian E Gallew

geek+@cmu.edu

over 26 years ago

In reply to: Tom Lane (#10)

Re: [HACKERS] vacuum process size

Then <tgl@sss.pgh.pa.us> spoke up and said:

So doubling the array size at each step is a good change.

But there are a lot more tuples than pages in most relations.

I see two lists with per-tuple data in vacuum.c, "vtlinks" in
vc_scanheap and "vtmove" in vc_rpfheap, that are both being grown with
essentially the same technique of repalloc() after every N entries.
I'm not entirely clear on how many tuples get put into each of these
lists, but it sure seems like in ordinary circumstances they'd be much
bigger space hogs than any of the three VPageList lists.

I recommend going to a doubling approach for each of these lists as
well as for VPageList.

Question: is there reliable information in pg_statistics (or other
system tables) which can be used to make a reasonable estimate for the
sizes of these structures before initial allocation? Certainly the
file size can be gotten from a stat (some portability issues, sparse
file issues).

--
=====================================================================
| JAVA must have been developed in the wilds of West Virginia. |
| After all, why else would it support only single inheritance?? |
=====================================================================
| Finger geek@cmu.edu for my public key. |
=====================================================================

#12

Tom Lane

tgl@sss.pgh.pa.us

over 26 years ago

In reply to: Brian E Gallew (#11)

Re: [HACKERS] vacuum process size

If there's no objection, I will commit included patches to the stable
tree (seems Tom has more aggressive idea, so I'll leave the current
tree as it is).

No, please make the change in current as well. I was thinking about
tweaking aset.c to be smarter about releasing large chunks, but in any
case having the doubling behavior at the request point will be a big
improvement.

I have just committed changes into current (but not REL6_5) to make
aset.c smarter about giving back memory from large requests. Basically,
for chunk sizes >= ALLOC_BIGCHUNK_LIMIT, pfree() does an actual free()
and repalloc() does an actual realloc(). There is no change in behavior
for smaller chunk sizes. This should cap the amount of space that can
be wasted by aset.c while repalloc'ing a chunk larger and larger.

For lack of a better idea I set ALLOC_BIGCHUNK_LIMIT to 64K. I don't
think it'd pay to make it very small, but I don't really know whether
this is a good choice or not.

It would still be a good idea to fix vacuum.c to double its repalloc
requests at each step, but Tatsuo was already working on that part
so I won't joggle his elbow...

regards, tom lane

Import Notes

Reply to msg id not found: YourmessageofTue24Aug1999110528-04002107.935507128@sss.pgh.pa.us | Resolved by subject fallback

#13

Tom Lane

tgl@sss.pgh.pa.us

over 26 years ago

In reply to: Tom Lane (#12)

Re: [HACKERS] vacuum process size

Brian E Gallew <geek+@cmu.edu> writes:

Question: is there reliable information in pg_statistics (or other
system tables) which can be used to make a reasonable estimate for the
sizes of these structures before initial allocation? Certainly the
file size can be gotten from a stat (some portability issues, sparse
file issues).

pg_statistics would tell you what was found out by the last vacuum on
the table, if there ever was one. Dunno how reliable you want to
consider that to be. stat() would provide up-to-date info, but the
problem with it is that the total file size might be a drastic
overestimate of the number of pages that vacuum needs to put in these
lists. There's not really much chance of getting a useful estimate from
the last vacuum run, either. AFAICT what we are interested in is the
number of pages containing dead tuples, and by definition all of those
tuples will have died since the last vacuum...

On the whole, just fixing the memory management seems like the best bet.
We know how to do that, and it may benefit other things besides vacuum.

regards, tom lane

Import Notes

Reply to msg id not found: Yourmessageof24Aug1999130112-0400emacs-smtp-447-14274-53208-353182@export.andrew.cmu.edu | Resolved by subject fallback

#14

Tatsuo Ishii

t-ishii@sra.co.jp

over 26 years ago

In reply to: Tom Lane (#13)

Re: [HACKERS] vacuum process size

I have just committed changes into current (but not REL6_5) to make

Just for a confirmation: I see REL6_5_PATCHES and REL6_5 Tag in the
CVS respository. I thought that REL6_5_PATCHES is the Tag for the 6.5
statble tree and would eventually become 6.5.2. If so, what is the
REL6_5 Tag? Or I totally miss the point?
--
Tatsuo Ishii

Import Notes

Reply to msg id not found: YourmessageofTue24Aug1999162124-0400.3691.935526084@sss.pgh.pa.us | Resolved by subject fallback

#15

Bruce Momjian

maillist@candle.pha.pa.us

over 26 years ago

In reply to: Tatsuo Ishii (#14)

Re: [HACKERS] vacuum process size

I have just committed changes into current (but not REL6_5) to make

Just for a confirmation: I see REL6_5_PATCHES and REL6_5 Tag in the
CVS respository. I thought that REL6_5_PATCHES is the Tag for the 6.5
statble tree and would eventually become 6.5.2. If so, what is the
REL6_5 Tag? Or I totally miss the point?

REL6_5 was a mistake.
-- 
  Bruce Momjian                        |  http://www.op.net/~candle
  maillist@candle.pha.pa.us            |  (610) 853-3000
  +  If your life is a hard drive,     |  830 Blythe Avenue
  +  Christ can be your backup.        |  Drexel Hill, Pennsylvania 19026

#16

Hiroshi Inoue

Inoue@tpf.co.jp

over 26 years ago

In reply to: Tom Lane (#10)

RE: [HACKERS] vacuum process size

-----Original Message-----
From: Tom Lane [mailto:tgl@sss.pgh.pa.us]
Sent: Wednesday, August 25, 1999 1:20 AM
To: t-ishii@sra.co.jp
Cc: Mike Mascari; Hiroshi Inoue; pgsql-hackers@postgreSQL.org
Subject: Re: [HACKERS] vacuum process size

I have been looking some more at the vacuum-process-size issue, and
I am having a hard time understanding why the VPageList data structure
is the critical one. As far as I can see, there should be at most one
pointer in it for each disk page of the relation. OK, you were
vacuuming a table with something like a quarter million pages, so
the end size of the VPageList would have been something like a megabyte,
and given the inefficient usage of repalloc() in the original code,
a lot more space than that would have been wasted as the list grew.
So doubling the array size at each step is a good change.

But there are a lot more tuples than pages in most relations.

I see two lists with per-tuple data in vacuum.c, "vtlinks" in
vc_scanheap and "vtmove" in vc_rpfheap, that are both being grown with
essentially the same technique of repalloc() after every N entries.
I'm not entirely clear on how many tuples get put into each of these
lists, but it sure seems like in ordinary circumstances they'd be much
bigger space hogs than any of the three VPageList lists.

AFAIK,both vtlinks and vtmove are NULL if vacuum is executed
without concurrent transactions.
They won't be so big unless loooong concurrent transactions exist.

Regards.

Hiroshi Inoue
Inoue@tpf.co.jp

#17

Tom Lane

tgl@sss.pgh.pa.us

over 26 years ago

In reply to: Hiroshi Inoue (#16)

Re: [HACKERS] vacuum process size

Tatsuo Ishii <t-ishii@sra.co.jp> writes:

Just for a confirmation: I see REL6_5_PATCHES and REL6_5 Tag in the
CVS respository. I thought that REL6_5_PATCHES is the Tag for the 6.5
statble tree and would eventually become 6.5.2. If so, what is the
REL6_5 Tag? Or I totally miss the point?

Right, REL6_5_PATCHES is the 6.5.* branch. REL6_5 is just a tag ---
that is, it's effectively a frozen snapshot of the 6.5 release,
not an evolvable branch.

I am not sure if Marc intends to continue this naming convention
in future, or if it was just a mistake to create REL6_5 as a tag
not a branch. I don't see a whole lot of use for the frozen tag
myself...

regards, tom lane

Import Notes

Reply to msg id not found: YourmessageofWed25Aug1999090829+0900199908250008.JAA21741@srapc451.sra.co.jp | Resolved by subject fallback

#18

Tatsuo Ishii

t-ishii@sra.co.jp

over 26 years ago

In reply to: Tom Lane (#17)

Re: [HACKERS] vacuum process size

Tatsuo Ishii <t-ishii@sra.co.jp> writes:

I have tested your idea and found even more improved memory usage
(86MB vs. 43MB). Standard vacuum consumes as much as 478MB memory with
deleting 5000000 tuples that would not be acceptable for most
configurations. I think we should fix this as soon as possible. If
there's no objection, I will commit included patches to the stable
tree (seems Tom has more aggressive idea, so I'll leave the current
tree as it is).

No, please make the change in current as well. I was thinking about
tweaking aset.c to be smarter about releasing large chunks, but in any
case having the doubling behavior at the request point will be a big
improvement.

I do not like your patch as given, however. By using a static variable
you are assuming that there is only one active VPageList at a time.
It looks to me like there are at least two --- and there is no reason
to think they'd be the same size.

You need to add a num_pages field to the VPageList struct, not use
a static.

Good point. I have committed new patches that do not use static
variables anymore to both REL6_5_PATCHES and current tree.

Modified files: backend/commands/vacuum.c and
include/commands/vacuum.h.
---
Tatsuo Ishii

Import Notes

Reply to msg id not found: YourmessageofTue24Aug1999110528-0400.2107.935507128@sss.pgh.pa.us | Resolved by subject fallback

#19

Ansley, Michael

Michael.Ansley@intec.co.za

over 26 years ago

In reply to: Tatsuo Ishii (#18)

RE: [HACKERS] vacuum process size

The reason for the tag as to be able to return to the 6.5 release source
code. It's production code, and should be accessible at least for the next
couple of months.

Was a tag created for 6.5.1? The object is to be able to check out any
particular release, bugs and all, whenever we feel like it.

MikeA

Show quoted text

Tatsuo Ishii <t-ishii@sra.co.jp> writes:

Just for a confirmation: I see REL6_5_PATCHES and REL6_5 Tag in the
CVS respository. I thought that REL6_5_PATCHES is the Tag

for the 6.5

statble tree and would eventually become 6.5.2. If so, what is the
REL6_5 Tag? Or I totally miss the point?

Right, REL6_5_PATCHES is the 6.5.* branch. REL6_5 is just a tag ---
that is, it's effectively a frozen snapshot of the 6.5 release,
not an evolvable branch.

I am not sure if Marc intends to continue this naming convention
in future, or if it was just a mistake to create REL6_5 as a tag
not a branch. I don't see a whole lot of use for the frozen tag
myself...

regards, tom lane

************

Import Notes

Resolved by subject fallback

#20

The Hermit Hacker

scrappy@hub.org

over 26 years ago

In reply to: Tom Lane (#17)

Re: [HACKERS] vacuum process size

On Wed, 25 Aug 1999, Tom Lane wrote:

Tatsuo Ishii <t-ishii@sra.co.jp> writes:

Just for a confirmation: I see REL6_5_PATCHES and REL6_5 Tag in the
CVS respository. I thought that REL6_5_PATCHES is the Tag for the 6.5
statble tree and would eventually become 6.5.2. If so, what is the
REL6_5 Tag? Or I totally miss the point?

Right, REL6_5_PATCHES is the 6.5.* branch. REL6_5 is just a tag ---
that is, it's effectively a frozen snapshot of the 6.5 release,
not an evolvable branch.

I am not sure if Marc intends to continue this naming convention
in future, or if it was just a mistake to create REL6_5 as a tag
not a branch. I don't see a whole lot of use for the frozen tag
myself...

I like the frozen tag myself, since, in the future, if we need to create a
quick tar ball of what things looked like at that release (ie.
v6.5->v6.5.2 patch?), its easy to generate...

Actually, come to think of it...am going to try that out now...report back
in a bit...

Marc G. Fournier ICQ#7615664 IRC Nick: Scrappy
Systems Administrator @ hub.org
primary: scrappy@hub.org secondary: scrappy@{freebsd|postgresql}.org

#21

Ansley, Michael

Michael.Ansley@intec.co.za

over 26 years ago

In reply to: The Hermit Hacker (#20)

RE: [HACKERS] vacuum process size

Yes, all that too ;-)

On Wed, 25 Aug 1999, Ansley, Michael wrote:

The reason for the tag as to be able to return to the 6.5 release

source

code. It's production code, and should be accessible at least for the

couple of months.

Was a tag created for 6.5.1? The object is to be able to check out any
particular release, bugs and all, whenever we feel like it.

Never did v6.5.1...but I have no problem with starting to do
this on minor
releases to, since...

Could someone try out the following patch?

ftp://ftp.postgresql.org/pub/postgresql-6.5-6.5.x.patch.gz

It is a patch against v6.5 that will bring it up to the most stable
version *if* it worked right. Reading through the patch,
everything looks
good, but...

If this actually works, we just might have a way of saving
ppl downloading
5Meg files on minor releases, as the above is <100k :)

MikeA

Tatsuo Ishii <t-ishii@sra.co.jp> writes:

Just for a confirmation: I see REL6_5_PATCHES and REL6_5 Tag in

the

CVS respository. I thought that REL6_5_PATCHES is the Tag for the

6.5

Show quoted text

statble tree and would eventually become 6.5.2. If so, what is the
REL6_5 Tag? Or I totally miss the point?

Right, REL6_5_PATCHES is the 6.5.* branch. REL6_5 is just a tag ---
that is, it's effectively a frozen snapshot of the 6.5 release,
not an evolvable branch.

I am not sure if Marc intends to continue this naming convention
in future, or if it was just a mistake to create REL6_5 as a tag
not a branch. I don't see a whole lot of use for the frozen tag
myself...

regards, tom lane

************

************

Marc G. Fournier ICQ#7615664
IRC Nick: Scrappy
Systems Administrator @ hub.org
primary: scrappy@hub.org secondary:
scrappy@{freebsd|postgresql}.org

Import Notes

Resolved by subject fallback

#22

The Hermit Hacker

scrappy@hub.org

over 26 years ago

In reply to: Ansley, Michael (#19)

RE: [HACKERS] vacuum process size

On Wed, 25 Aug 1999, Ansley, Michael wrote:

The reason for the tag as to be able to return to the 6.5 release source
code. It's production code, and should be accessible at least for the next
couple of months.

Was a tag created for 6.5.1? The object is to be able to check out any
particular release, bugs and all, whenever we feel like it.

Never did v6.5.1...but I have no problem with starting to do this on minor
releases to, since...

Could someone try out the following patch?

ftp://ftp.postgresql.org/pub/postgresql-6.5-6.5.x.patch.gz

It is a patch against v6.5 that will bring it up to the most stable
version *if* it worked right. Reading through the patch, everything looks
good, but...

If this actually works, we just might have a way of saving ppl downloading
5Meg files on minor releases, as the above is <100k :)

MikeA

Tatsuo Ishii <t-ishii@sra.co.jp> writes:

Just for a confirmation: I see REL6_5_PATCHES and REL6_5 Tag in the
CVS respository. I thought that REL6_5_PATCHES is the Tag

for the 6.5

statble tree and would eventually become 6.5.2. If so, what is the
REL6_5 Tag? Or I totally miss the point?

Right, REL6_5_PATCHES is the 6.5.* branch. REL6_5 is just a tag ---
that is, it's effectively a frozen snapshot of the 6.5 release,
not an evolvable branch.

I am not sure if Marc intends to continue this naming convention
in future, or if it was just a mistake to create REL6_5 as a tag
not a branch. I don't see a whole lot of use for the frozen tag
myself...

regards, tom lane

************

************

Marc G. Fournier ICQ#7615664 IRC Nick: Scrappy
Systems Administrator @ hub.org
primary: scrappy@hub.org secondary: scrappy@{freebsd|postgresql}.org

#23

Tom Lane

tgl@sss.pgh.pa.us

over 26 years ago

In reply to: The Hermit Hacker (#22)

Re: [HACKERS] vacuum process size

"Ansley, Michael" <Michael.Ansley@intec.co.za> writes:

The reason for the tag as to be able to return to the 6.5 release source
code. It's production code, and should be accessible at least for the next
couple of months.
Was a tag created for 6.5.1? The object is to be able to check out any
particular release, bugs and all, whenever we feel like it.

You can always do a checkout by date if you need to capture the state of
the cvs tree at some particular past time. Frozen tags are just a (very
inefficient) way of remembering specific past times that you think are
likely to be of interest.

regards, tom lane

Import Notes

Reply to msg id not found: YourmessageofWed25Aug1999155630+02001BF7C7482189D211B03F00805F8527F70ED134@S-NATH-EXCH2 | Resolved by subject fallback

#24

Zeugswetter Andreas IZ5

Andreas.Zeugswetter@telecom.at

over 26 years ago

In reply to: Tom Lane (#23)

AW: [HACKERS] vacuum process size

What I remember REL6_5 tags the version 6.5.1,
so probably the tags should read REL6_5_0, REL6_5_1 ....
to show that fact.

I like the frozen tag myself, since, in the future, if we need to create a
quick tar ball of what things looked like at that release (ie.
v6.5->v6.5.2 patch?), its easy to generate...

Yes, very handy.

Andreas

Import Notes

Resolved by subject fallback

#25

The Hermit Hacker

scrappy@hub.org

over 26 years ago

In reply to: Tom Lane (#23)

Re: [HACKERS] vacuum process size

On Wed, 25 Aug 1999, Tom Lane wrote:

"Ansley, Michael" <Michael.Ansley@intec.co.za> writes:

The reason for the tag as to be able to return to the 6.5 release source
code. It's production code, and should be accessible at least for the next
couple of months.
Was a tag created for 6.5.1? The object is to be able to check out any
particular release, bugs and all, whenever we feel like it.

You can always do a checkout by date if you need to capture the state of
the cvs tree at some particular past time. Frozen tags are just a (very
inefficient) way of remembering specific past times that you think are
likely to be of interest.

Okay, you lost me on this one...why is it inefficient to tag the tree on
the date of a release vs trying to remember that date? *raised eyebrow*
In fact, vs trying to remember the exact date *and* time of a release?

Marc G. Fournier ICQ#7615664 IRC Nick: Scrappy
Systems Administrator @ hub.org
primary: scrappy@hub.org secondary: scrappy@{freebsd|postgresql}.org

#26

Tom Lane

tgl@sss.pgh.pa.us

over 26 years ago

In reply to: The Hermit Hacker (#25)

Re: [HACKERS] vacuum process size

The Hermit Hacker <scrappy@hub.org> writes:

Okay, you lost me on this one...why is it inefficient to tag the tree on
the date of a release vs trying to remember that date? *raised eyebrow*
In fact, vs trying to remember the exact date *and* time of a release?

Because you make an entry "REL6_5 => something or other" in *every*
*single* *file* of the CVS tree. It'd be more logical to store
"REL6_5 => 25 Aug 1999 11:55:32 -0300 (ADT)", or some such, in one
place. Dunno why the CVS people didn't think of that.

Inefficient though it be, I agree it's better than trying to remember
the release timestamps manually.

I'd suggest, though, that from here on out we use the short strings
like "REL6_6" for the branches, since people have much more need to
refer to the branches than specific release points. Tags for releases
could maybe be called "REL6_6_0", "REL6_6_1", etc.

regards, tom lane

Import Notes

Reply to msg id not found: YourmessageofWed25Aug1999115532-0300Pine.BSF.4.10.9908251145370.86612-100000@thelab.hub.org | Resolved by subject fallback

#27

Leon

leon@udmnet.ru

over 26 years ago

In reply to: The Hermit Hacker (#22)

1 attachment(s)

Re: [HACKERS] vacuum process size

The Hermit Hacker wrote:

Never did v6.5.1...but I have no problem with starting to do this on minor
releases to, since...

Could someone try out the following patch?

ftp://ftp.postgresql.org/pub/postgresql-6.5-6.5.x.patch.gz

It is a patch against v6.5 that will bring it up to the most stable
version *if* it worked right. Reading through the patch, everything looks
good, but...

Great idea! It will be good practice - to have simply patches for
minor versions. But this is definitely not a patch for 6.5.0, but
some other version. Unfortunately I lost .tar.gz 6.5.0 distribution,
but I am pretty sure that my sources were intact. There were a lot of
hunks failed, and patched version failed to compile.
Isn't is the right way to do a patch: take old distribution
and simply make a diff against new tree? Seems that current
patch isn't done that way. Included here is patch log file
for your reference.

--
Leon.

vacuum process size

Attachments: