TODO : Allow parallel cores to be used by vacuumdb [ WIP ]

Started by Dilip kumarover 12 years ago125 messageshackers

dilip.kumar@huawei.com

over 12 years ago

This patch implementing the following TODO item

Allow parallel cores to be used by vacuumdb
/messages/by-id/4F10A728.7090403@agliodbs.com

Like Parallel pg_dump, vacuumdb is provided with the option to run the vacuum of multiple tables in parallel. [ vacuumdb -j ]

1. One new option is provided with vacuumdb to give the number of workers.

2. All worker will be started in beginning and all will be waiting for the vacuum instruction from the master.

3. Now, if table list is provided in vacuumdb command using -t then, it will send the vacuum of one table to one of the IDLE worker, next table to next IDLE worker and so on.

4. If vacuum is given for one DB then, it will execute select on pg_class to get the table list and fetch the table name one by one and also assign the vacuum responsibility to IDLE workers.

Performance Data by parallel vacuumdb:

Machine Configuration:

Core : 8

RAM: 24GB

Test Scenario:

16 tables all with 4M records. [many records are deleted and inserted using some pattern, (files is attached in the mail)]

Test Result

{Base Code} Time(s) %CPU Usage Avg Read(kB/s) Avg Write(kB/s)

521 3% 12000 20000

{With Parallel Vacuum Patch}

worker Time(s) %CPU Usage Avg Read(kB/s) Avg Write(kB/s)

1 518 3% 12000 20000 --> this will take the same path as base code

2 390 5% 14000 30000

8 235 7% 18000 40000

16 197 8% 20000 50000

Conclusion:

By running the vacuumdb in parallel, CPU and I/O throughput is increasing and it can give >50% performance improvement.

Work to be Done:

1. Documentations of the new command.

2. Parallel support for vacuum all db.

Is it required to move the common code for parallel operation of pg_dump and vacuumdb to one place and reuse it ?

Prototype patch is attached in the mail, please provide your feedback/Suggestions...

Thanks & Regards,
Dilip Kumar

Euler Taveira de Oliveira

euler@timbira.com

over 12 years ago

In reply to: Dilip kumar (#1)

Re: TODO : Allow parallel cores to be used by vacuumdb [ WIP ]

On 07-11-2013 09:42, Dilip kumar wrote:

Dilip, this is on my TODO for 9.4. I've already had a half-backed patch
for it. Let's see what I can come up with.

Is it required to move the common code for parallel operation of pg_dump and vacuumdb to one place and reuse it ?

I'm not sure about that because the pg_dump parallel code is tight to
TOC entry. Also, dependency matters for pg_dump while in the scripts
case, an order to be choosen will be used. However, vacuumdb can share
the parallel code with clusterdb and reindexdb (my patch does it).

Of course, a refactor to unify parallel code (pg_dump and scripts) can
be done in a separate patch.

Prototype patch is attached in the mail, please provide your feedback/Suggestions...

I'll try to merge your patch with the one I have here until the next CF.

--
Euler Taveira Timbira - http://www.timbira.com.br/
PostgreSQL: Consultoria, Desenvolvimento, Suporte 24x7 e Treinamento

--
Sent via pgsql-hackers mailing list (pgsql-hackers@postgresql.org)
To make changes to your subscription:
http://www.postgresql.org/mailpref/pgsql-hackers

Dilip kumar

dilip.kumar@huawei.com

over 12 years ago

In reply to: Euler Taveira de Oliveira (#2)

Re: TODO : Allow parallel cores to be used by vacuumdb [ WIP ]

On 08 November 2013 03:22, Euler Taveira Wrote

On 07-11-2013 09:42, Dilip kumar wrote:

Dilip, this is on my TODO for 9.4. I've already had a half-backed patch
for it. Let's see what I can come up with.

Ok, Let me know if I can contribute to this..

Is it required to move the common code for parallel operation of

pg_dump and vacuumdb to one place and reuse it ?

I'm not sure about that because the pg_dump parallel code is tight to
TOC entry. Also, dependency matters for pg_dump while in the scripts
case, an order to be choosen will be used. However, vacuumdb can share
the parallel code with clusterdb and reindexdb (my patch does it).

Of course, a refactor to unify parallel code (pg_dump and scripts) can
be done in a separate patch.

Prototype patch is attached in the mail, please provide your

feedback/Suggestions...

I'll try to merge your patch with the one I have here until the next CF.

Regards,
Dilip

--
Sent via pgsql-hackers mailing list (pgsql-hackers@postgresql.org)
To make changes to your subscription:
http://www.postgresql.org/mailpref/pgsql-hackers

Jan Lentfer

Jan.Lentfer@web.de

over 12 years ago

In reply to: Dilip kumar (#1)

Re: TODO : Allow parallel cores to be used by vacuumdb [ WIP ]

Am 07.11.2013 12:42, schrieb Dilip kumar:

This patch

implementing the following TODO item

Allow parallel cores to be

used by vacuumdb

/messages/by-id/4F10A728.7090403@agliodbs.com [1]/messages/by-id/4F10A728.7090403@agliodbs.com

Like Parallel pg_dump, vacuumdb is provided with the option to run

the vacuum of multiple tables in parallel. [ VACUUMDB –J ]

1. One

new option is provided with vacuumdb to give the number of workers.

2. All worker will be started in beginning and all will be waiting

for the vacuum instruction from the master.

3. Now, if table list

is provided in vacuumdb command using -t then, it will send the vacuum
of one table to one of the IDLE worker, next table to next IDLE worker
and so on.

4. If vacuum is given for one DB then, it will execute

select on pg_class to get the table list and fetch the table name one by
one and also assign the vacuum responsibility to IDLE workers.

[...]

For this use case, would it make sense to queue work (tables) in
order of their size, starting on the largest one?

For the case where
you have tables of varying size this would lead to a reduced overall
processing time as it prevents large (read: long processing time) tables
to be processed in the last step. While processing large tables at first
and filling up "processing slots/jobs" when they get free with smaller
tables one after the other would safe overall execution time.

Regards

Jan

--
professional: http://www.oscar-consult.de

Links:
------
[1]: /messages/by-id/4F10A728.7090403@agliodbs.com
/messages/by-id/4F10A728.7090403@agliodbs.com

Dilip kumar

dilip.kumar@huawei.com

over 12 years ago

In reply to: Jan Lentfer (#4)

Re: TODO : Allow parallel cores to be used by vacuumdb [ WIP ]

On 08 November 2013 13:38, Jan Lentfer

For this use case, would it make sense to queue work (tables) in order of their size, starting on the largest one?

For the case where you have tables of varying size this would lead to a reduced overall processing time as it prevents large (read: long processing time) tables to be processed in the last step. While processing large tables at first and filling up "processing slots/jobs" when they get free with smaller tables one after the other would safe overall execution time.

Good point, I have made the change and attached the modified patch.

Regards,
Dilip

Euler Taveira de Oliveira

euler@timbira.com

over 12 years ago

In reply to: Jan Lentfer (#4)

Re: TODO : Allow parallel cores to be used by vacuumdb [ WIP ]

On 08-11-2013 05:07, Jan Lentfer wrote:

For the case where you have tables of varying size this would lead to
a reduced overall processing time as it prevents large (read: long
processing time) tables to be processed in the last step. While
processing large tables at first and filling up "processing
slots/jobs" when they get free with smaller tables one after the
other would safe overall execution time.

That is certainly a good strategy (not the optimal [1]/messages/by-id/CA+TgmobwxqsagXKtyQ1S8+gMpqxF_MLXv=4350tFZVqAwKEqgQ@mail.gmail.com -- that is hard
to achieve). Also, the strategy must:

(i) consider the relation age before size (for vacuum);
(ii) consider that you can't pick indexes for the same relation (for
reindex).

[1]: /messages/by-id/CA+TgmobwxqsagXKtyQ1S8+gMpqxF_MLXv=4350tFZVqAwKEqgQ@mail.gmail.com
/messages/by-id/CA+TgmobwxqsagXKtyQ1S8+gMpqxF_MLXv=4350tFZVqAwKEqgQ@mail.gmail.com

--
Euler Taveira Timbira - http://www.timbira.com.br/
PostgreSQL: Consultoria, Desenvolvimento, Suporte 24x7 e Treinamento

--
Sent via pgsql-hackers mailing list (pgsql-hackers@postgresql.org)
To make changes to your subscription:
http://www.postgresql.org/mailpref/pgsql-hackers

Michael Paquier

michael@paquier.xyz

over 12 years ago

In reply to: Dilip kumar (#1)

Re: TODO : Allow parallel cores to be used by vacuumdb [ WIP ]

On Thu, Nov 7, 2013 at 8:42 PM, Dilip kumar <dilip.kumar@huawei.com> wrote:

This patch implementing the following TODO item

Allow parallel cores to be used by vacuumdb
/messages/by-id/4F10A728.7090403@agliodbs.com

Like Parallel pg_dump, vacuumdb is provided with the option to run the
vacuum of multiple tables in parallel. [ vacuumdb –j ]

1. One new option is provided with vacuumdb to give the number of
workers.

2. All worker will be started in beginning and all will be waiting for
the vacuum instruction from the master.

3. Now, if table list is provided in vacuumdb command using –t then,
it will send the vacuum of one table to one of the IDLE worker, next table
to next IDLE worker and so on.

4. If vacuum is given for one DB then, it will execute select on
pg_class to get the table list and fetch the table name one by one and also
assign the vacuum responsibility to IDLE workers.

Performance Data by parallel vacuumdb:

Machine Configuration:

Core : 8

RAM: 24GB

Test Scenario:

16 tables all with 4M records. [many records
are deleted and inserted using some pattern, (files is attached in the
mail)]

Test Result

{Base Code} Time(s) %CPU Usage Avg Read(kB/s) Avg Write(kB/s)

521 3% 12000
20000

{With Parallel Vacuum Patch}

worker Time(s) %CPU Usage Avg Read(kB/s) Avg
Write(kB/s)

1 518 3% 12000
20000 --> this will take the same path as base code

2 390 5% 14000
30000

8 235 7% 18000
40000

16 197 8% 20000
50000

Conclusion:

By running the vacuumdb in parallel, CPU and I/O throughput
is increasing and it can give >50% performance improvement.

Work to be Done:

1. Documentations of the new command.

2. Parallel support for vacuum all db.

Is it required to move the common code for parallel operation of pg_dump and
vacuumdb to one place and reuse it ?

Prototype patch is attached in the mail, please provide your
feedback/Suggestions…

Thanks & Regards,

Dilip Kumar

--
Sent via pgsql-hackers mailing list (pgsql-hackers@postgresql.org)
To make changes to your subscription:
http://www.postgresql.org/mailpref/pgsql-hackers

--
Michael

--
Sent via pgsql-hackers mailing list (pgsql-hackers@postgresql.org)
To make changes to your subscription:
http://www.postgresql.org/mailpref/pgsql-hackers

Michael Paquier

michael@paquier.xyz

over 12 years ago

In reply to: Dilip kumar (#1)

Re: TODO : Allow parallel cores to be used by vacuumdb [ WIP ]

On Thu, Nov 7, 2013 at 8:42 PM, Dilip kumar <dilip.kumar@huawei.com> wrote:

This patch implementing the following TODO item

Allow parallel cores to be used by vacuumdb
/messages/by-id/4F10A728.7090403@agliodbs.com

Cool. Could you add this patch to the next commit fest for 9.4? It
begins officially in a couple of days. Here is the URL to it:
https://commitfest.postgresql.org/action/commitfest_view?id=20

Regards,
--
Michael

--
Sent via pgsql-hackers mailing list (pgsql-hackers@postgresql.org)
To make changes to your subscription:
http://www.postgresql.org/mailpref/pgsql-hackers

Euler Taveira de Oliveira

euler@timbira.com

over 12 years ago

In reply to: Dilip kumar (#5)

Re: TODO : Allow parallel cores to be used by vacuumdb [ WIP ]

On 08-11-2013 06:20, Dilip kumar wrote:

On 08 November 2013 13:38, Jan Lentfer

For this use case, would it make sense to queue work (tables) in order of their size, starting on the largest one?

For the case where you have tables of varying size this would lead to a reduced overall processing time as it prevents large (read: long processing time) tables to be processed in the last step. While processing large tables at first and filling up "processing slots/jobs" when they get free with smaller tables one after the other would safe overall execution time.

Good point, I have made the change and attached the modified patch.

Don't you submit it for a CF, do you? Is it too late for this CF?

--
Euler Taveira Timbira - http://www.timbira.com.br/
PostgreSQL: Consultoria, Desenvolvimento, Suporte 24x7 e Treinamento

--
Sent via pgsql-hackers mailing list (pgsql-hackers@postgresql.org)
To make changes to your subscription:
http://www.postgresql.org/mailpref/pgsql-hackers

#10

Alvaro Herrera

alvherre@2ndquadrant.com

over 12 years ago

In reply to: Euler Taveira de Oliveira (#9)

Re: TODO : Allow parallel cores to be used by vacuumdb [ WIP ]

Euler Taveira wrote:

On 08-11-2013 06:20, Dilip kumar wrote:

On 08 November 2013 13:38, Jan Lentfer

For this use case, would it make sense to queue work (tables) in order of their size, starting on the largest one?

For the case where you have tables of varying size this would lead to a reduced overall processing time as it prevents large (read: long processing time) tables to be processed in the last step. While processing large tables at first and filling up "processing slots/jobs" when they get free with smaller tables one after the other would safe overall execution time.

Good point, I have made the change and attached the modified patch.

Don't you submit it for a CF, do you? Is it too late for this CF?

Not too late.

--
ï¿½lvaro Herrera http://www.2ndQuadrant.com/
PostgreSQL Development, 24x7 Support, Training & Services

--
Sent via pgsql-hackers mailing list (pgsql-hackers@postgresql.org)
To make changes to your subscription:
http://www.postgresql.org/mailpref/pgsql-hackers

#11

Dilip kumar

dilip.kumar@huawei.com

about 12 years ago

In reply to: Euler Taveira de Oliveira (#9)

Re: TODO : Allow parallel cores to be used by vacuumdb [ WIP ]

On 16 January 2014 19:53, Euler Taveira Wrote,

For the case where you have tables of varying size this would lead

to a reduced overall processing time as it prevents large (read: long
processing time) tables to be processed in the last step. While
processing large tables at first and filling up "processing slots/jobs"
when they get free with smaller tables one after the other would safe
overall execution time.

Good point, I have made the change and attached the modified patch.

Don't you submit it for a CF, do you? Is it too late for this CF?

Attached the latest updated patch
1. Rebased the patch to current GIT head.
2. Doc is updated.
3. Supported parallel execution for all db option also.

Same I will add to current open commitfest..

Regards,
Dilip

#12

Jeff Janes

jeff.janes@gmail.com

almost 12 years ago

In reply to: Dilip kumar (#11)

Re: TODO : Allow parallel cores to be used by vacuumdb [ WIP ]

On Fri, Mar 21, 2014 at 12:48 AM, Dilip kumar <dilip.kumar@huawei.com> wrote:

On 16 January 2014 19:53, Euler Taveira Wrote,

For the case where you have tables of varying size this would lead

to a reduced overall processing time as it prevents large (read: long
processing time) tables to be processed in the last step. While
processing large tables at first and filling up "processing slots/jobs"
when they get free with smaller tables one after the other would safe
overall execution time.

Good point, I have made the change and attached the modified patch.

Don't you submit it for a CF, do you? Is it too late for this CF?

Attached the latest updated patch
1. Rebased the patch to current GIT head.
2. Doc is updated.
3. Supported parallel execution for all db option also.

This patch needs to be rebased after the analyze-in-stages patch,
c92c3d50d7fbe7391b5fc864b44434.

Although that patch still needs to some work itself, despite being
committed, as still loops over the stages for each db, rather than the
dbs for each stage.

So I don't know if this patch is really reviewable at this point, as
it is not clear how those things are going to interact with each
other.

Cheers,

Jeff

--
Sent via pgsql-hackers mailing list (pgsql-hackers@postgresql.org)
To make changes to your subscription:
http://www.postgresql.org/mailpref/pgsql-hackers

#13

Dilip kumar

dilip.kumar@huawei.com

almost 12 years ago

In reply to: Jeff Janes (#12)

Re: TODO : Allow parallel cores to be used by vacuumdb [ WIP ]

On 24 June 2014 03:31, Jeff Wrote,

Attached the latest updated patch
1. Rebased the patch to current GIT head.
2. Doc is updated.
3. Supported parallel execution for all db option also.

This patch needs to be rebased after the analyze-in-stages patch,
c92c3d50d7fbe7391b5fc864b44434.

Thank you for giving your attention to this, I will rebase this..

Although that patch still needs to some work itself, despite being
committed, as still loops over the stages for each db, rather than the
dbs for each stage.

If I understood your comment properly, Here you mean to say that
In vacuum_all_databases instead to running all DB's in parallel, we are running db by db in parallel?

I think we can fix this..

So I don't know if this patch is really reviewable at this point, as it
is not clear how those things are going to interact with each other.

Exactly what points you want to mention here ?

Regards,
Dilip Kumar

--
Sent via pgsql-hackers mailing list (pgsql-hackers@postgresql.org)
To make changes to your subscription:
http://www.postgresql.org/mailpref/pgsql-hackers

#14

Jeff Janes

jeff.janes@gmail.com

almost 12 years ago

In reply to: Dilip kumar (#13)

Re: TODO : Allow parallel cores to be used by vacuumdb [ WIP ]

On Monday, June 23, 2014, Dilip kumar <dilip.kumar@huawei.com> wrote:

On 24 June 2014 03:31, Jeff Wrote,

Attached the latest updated patch
1. Rebased the patch to current GIT head.
2. Doc is updated.
3. Supported parallel execution for all db option also.

This patch needs to be rebased after the analyze-in-stages patch,
c92c3d50d7fbe7391b5fc864b44434.

Thank you for giving your attention to this, I will rebase this..

Although that patch still needs to some work itself, despite being
committed, as still loops over the stages for each db, rather than the
dbs for each stage.

If I understood your comment properly, Here you mean to say that
In vacuum_all_databases instead to running all DB's in parallel, we are
running db by db in parallel?

I mean that the other commit, the one conflicting with your patch, is still
not finished. It probably would not have been committed if we realized the
problem at the time. That other patch runs analyze in stages at different
settings of default_statistics_target, but it has the loops in the wrong
order, so it analyzes one database in all three stages, then moves to the
next database. I think that these two changes are going to interact with
each other. But I can't predict right now what that interaction will look
like. So it is hard for me to evaluate your patch, until the other one is
resolved.

Normally I would evaluate your patch in isolation, but since the
conflicting patch is already committed (and is in the 9.4 branch) that
would probably not be very useful in this case.

Cheers,

Jeff

#15

Dilip kumar

dilip.kumar@huawei.com

almost 12 years ago

In reply to: Jeff Janes (#14)

Re: TODO : Allow parallel cores to be used by vacuumdb [ WIP ]

On 24 June 2014 11:02 Jeff Wrote,

I mean that the other commit, the one conflicting with your patch, is still not finished. It probably would not have been committed if we realized the problem at the time. That other patch runs analyze in stages at
different settings of default_statistics_target, but it has the loops in the wrong order, so it analyzes one database in all three stages, then moves to the next database. I think that these two changes are going to
interact with each other. But I can't predict right now what that interaction will look like. So it is hard for me to evaluate your patch, until the other one is resolved.

Normally I would evaluate your patch in isolation, but since the conflicting patch is already committed (and is in the 9.4 branch) that would probably not be very useful in this case.

Oh k, Got your point, I will also try to think how these two patch can interact together..

Regards,
Dilip

#16

Masahiko Sawada

sawada.mshk@gmail.com

almost 12 years ago

In reply to: Dilip kumar (#11)

Re: TODO : Allow parallel cores to be used by vacuumdb [ WIP ]

Hi,

I got following FAILED when I patched v3 to HEAD.

$ patch -d. -p1 < ../patch/vacuumdb_parallel_v3.patch
patching file doc/src/sgml/ref/vacuumdb.sgml
Hunk #1 succeeded at 224 (offset 20 lines).
patching file src/bin/scripts/Makefile
Hunk #2 succeeded at 65 with fuzz 2 (offset -1 lines).
patching file src/bin/scripts/vac_parallel.c
patching file src/bin/scripts/vac_parallel.h
patching file src/bin/scripts/vacuumdb.c
Hunk #3 succeeded at 61 with fuzz 2.
Hunk #4 succeeded at 87 (offset 2 lines).
Hunk #5 succeeded at 143 (offset 2 lines).
Hunk #6 succeeded at 158 (offset 5 lines).
Hunk #7 succeeded at 214 with fuzz 2 (offset 5 lines).
Hunk #8 FAILED at 223.
Hunk #9 succeeded at 374 with fuzz 1 (offset 35 lines).
Hunk #10 FAILED at 360.
Hunk #11 FAILED at 387.
3 out of 11 hunks FAILED -- saving rejects to file
src/bin/scripts/vacuumdb.c.rej

---
Sawada Masahiko

On Friday, March 21, 2014, Dilip kumar <dilip.kumar@huawei.com> wrote:

On 16 January 2014 19:53, Euler Taveira Wrote,

For the case where you have tables of varying size this would lead

to a reduced overall processing time as it prevents large (read: long
processing time) tables to be processed in the last step. While
processing large tables at first and filling up "processing slots/jobs"
when they get free with smaller tables one after the other would safe
overall execution time.

Good point, I have made the change and attached the modified patch.

Don't you submit it for a CF, do you? Is it too late for this CF?

Attached the latest updated patch
1. Rebased the patch to current GIT head.
2. Doc is updated.
3. Supported parallel execution for all db option also.

Same I will add to current open commitfest..

Regards,
Dilip

--
Regards,

-------
Sawada Masahiko

#17

Dilip kumar

dilip.kumar@huawei.com

almost 12 years ago

In reply to: Masahiko Sawada (#16)

Re: TODO : Allow parallel cores to be used by vacuumdb [ WIP ]

On 25 June 2014 23:37 Sawada Masahiko Wrote

I got following FAILED when I patched v3 to HEAD.

$ patch -d. -p1 < ../patch/vacuumdb_parallel_v3.patch
patching file doc/src/sgml/ref/vacuumdb.sgml
Hunk #1 succeeded at 224 (offset 20 lines).
patching file src/bin/scripts/Makefile
Hunk #2 succeeded at 65 with fuzz 2 (offset -1 lines).
patching file src/bin/scripts/vac_parallel.c
patching file src/bin/scripts/vac_parallel.h
patching file src/bin/scripts/vacuumdb.c
Hunk #3 succeeded at 61 with fuzz 2.
Hunk #4 succeeded at 87 (offset 2 lines).
Hunk #5 succeeded at 143 (offset 2 lines).
Hunk #6 succeeded at 158 (offset 5 lines).
Hunk #7 succeeded at 214 with fuzz 2 (offset 5 lines).
Hunk #8 FAILED at 223.
Hunk #9 succeeded at 374 with fuzz 1 (offset 35 lines).
Hunk #10 FAILED at 360.
Hunk #11 FAILED at 387.
3 out of 11 hunks FAILED -- saving rejects to file src/bin/scripts/vacuumdb.c.rej

Thank you for giving your time, Please review the updated patch attached in the mail.

1. Rebased the patch

2. Implemented parallel execution for new option --analyze-in-stages

Regards,
Dilip Kumar

#18

Jeff Janes

jeff.janes@gmail.com

almost 12 years ago

In reply to: Dilip kumar (#17)

Re: TODO : Allow parallel cores to be used by vacuumdb [ WIP ]

On Thu, Jun 26, 2014 at 2:35 AM, Dilip kumar <dilip.kumar@huawei.com> wrote:

Thank you for giving your time, Please review the updated patch attached in
the mail.

1. Rebased the patch

2. Implemented parallel execution for new option --analyze-in-stages

Hi Dilip,

Thanks for rebasing.

I haven't done an architectural or code review on it, I just applied
it and used it a little on Linux.

Based on that, I find most importantly that it doesn't seem to
correctly vacuum tables which have upper case letters in the name,
because it does not quote the table names when they need quotes.

Of course that needs to be fixed, but taking it as it is, the
resulting error message to the console is just:

: Execute failed

Which is not very informative. I get the same error if I do a "pg_ctl
shutdown -mi" while running the parallel vacuumdb. Without the -j
option it produces a more descriptive error message "FATAL:
terminating connection due to administrator command", so something
about the new feature suppresses the informative error messages.

I get some compiler warnings with the new patch:

vac_parallel.c: In function 'parallel_msg_master':
vac_parallel.c:147: warning: function might be possible candidate for
'gnu_printf' format attribute
vac_parallel.c:147: warning: function might be possible candidate for
'gnu_printf' format attribute
vac_parallel.c: In function 'exit_horribly':
vac_parallel.c:1071: warning: 'noreturn' function does return

In the usage message, the string has a tab embedded within it
(immediately before "use") that should be converted to literal spaces,
otherwise the output of --help gets misaligned:

printf(_(" -j, --jobs=NUM use this many parallel
jobs to vacuum\n"));

Thanks for the work on this.

Cheers,

Jeff

--
Sent via pgsql-hackers mailing list (pgsql-hackers@postgresql.org)
To make changes to your subscription:
http://www.postgresql.org/mailpref/pgsql-hackers

#19

Dilip kumar

dilip.kumar@huawei.com

almost 12 years ago

In reply to: Jeff Janes (#18)

Re: TODO : Allow parallel cores to be used by vacuumdb [ WIP ]

On 27 June 2014 02:57, Jeff Wrote,

Based on that, I find most importantly that it doesn't seem to
correctly vacuum tables which have upper case letters in the name,
because it does not quote the table names when they need quotes.

Thanks for your comments....

There are two problem
First -> When doing the vacuum of complete database that time if any table with upper case letter, it was giving error
--FIXED by adding quotes for table name

Second -> When user pass the table using -t option, and if it has uppercase letter
--This is the existing problem (without parallel implementation),

One solution to this is, always add Quote to the relation name passed by user, but this can break existing applications for some users..

Of course that needs to be fixed, but taking it as it is, the resulting
error message to the console is just:

FIXED

Which is not very informative. I get the same error if I do a "pg_ctl
shutdown -mi" while running the parallel vacuumdb. Without the -j
option it produces a more descriptive error message "FATAL:
terminating connection due to administrator command", so something
about the new feature suppresses the informative error messages.

I get some compiler warnings with the new patch:

vac_parallel.c: In function 'parallel_msg_master':
vac_parallel.c:147: warning: function might be possible candidate for
'gnu_printf' format attribute
vac_parallel.c:147: warning: function might be possible candidate for
'gnu_printf' format attribute
vac_parallel.c: In function 'exit_horribly':
vac_parallel.c:1071: warning: 'noreturn' function does return

FIXED

In the usage message, the string has a tab embedded within it
(immediately before "use") that should be converted to literal spaces,
otherwise the output of --help gets misaligned:

printf(_(" -j, --jobs=NUM use this many parallel
jobs to vacuum\n"));

FIXED

Updated patch is attached in the mail..

Thanks & Regards,
Dilip Kumar

#20

Jeff Janes

jeff.janes@gmail.com

almost 12 years ago

In reply to: Dilip kumar (#19)

Re: TODO : Allow parallel cores to be used by vacuumdb [ WIP ]

On Fri, Jun 27, 2014 at 4:10 AM, Dilip kumar <dilip.kumar@huawei.com> wrote:
...

Updated patch is attached in the mail..

Thanks Dilip.

I get a compiler warning when building on Windows. When I started
looking into that, I see that two files have too much code duplication
between them:

src/bin/scripts/vac_parallel.c (new file)
src/bin/pg_dump/parallel.c (existing file)

In particular, pgpipe is almost an exact duplicate between them,
except the copy in vac_parallel.c has fallen behind changes made to
parallel.c. (Those changes would have fixed the Windows warnings). I
think that this function (and perhaps other parts as
well--"exit_horribly" for example) need to refactored into a common
file that both files can include. I don't know where the best place
for that would be, though. (I haven't done this type of refactoring
myself.)

Also, there are several places in the patch which use spaces for
indentation where tabs are called for by the coding style. It looks
like you may have copied the code from one terminal window and copied
it into another one, converting tabs to spaces in the process. This
makes it hard to evaluate the amount of code duplication.

In some places the code spins in a tight loop while waiting for a
worker process to become free. If I strace the process, I got a long
list of selects with 0 time outs:

select(13, [6 8 10 12], NULL, NULL, {0, 0}) = 0 (Timeout)

I have not tried to track down the code that causes it. I did notice
that vacuumdb spends an awful lot of time at the top of the Linux
"top" output, and this is probably why.

Cheers,

Jeff

--
Sent via pgsql-hackers mailing list (pgsql-hackers@postgresql.org)
To make changes to your subscription:
http://www.postgresql.org/mailpref/pgsql-hackers

#21

Alvaro Herrera

alvherre@2ndquadrant.com

almost 12 years ago

In reply to: Jeff Janes (#20)

#22

Dilip kumar

dilip.kumar@huawei.com

almost 12 years ago

In reply to: Jeff Janes (#20)

#23

Dilip kumar

dilip.kumar@huawei.com

almost 12 years ago

In reply to: Alvaro Herrera (#21)

#24

Masahiko Sawada

sawada.mshk@gmail.com

almost 12 years ago

In reply to: Dilip kumar (#23)

#25

Dilip kumar

dilip.kumar@huawei.com

almost 12 years ago

In reply to: Masahiko Sawada (#24)

#26

Jeff Janes

jeff.janes@gmail.com

almost 12 years ago

In reply to: Alvaro Herrera (#21)

#27

Alvaro Herrera

alvherre@2ndquadrant.com

almost 12 years ago

In reply to: Jeff Janes (#26)

#28

Masahiko Sawada

sawada.mshk@gmail.com

almost 12 years ago

In reply to: Dilip kumar (#25)

#29

Dilip kumar

dilip.kumar@huawei.com

almost 12 years ago

In reply to: Masahiko Sawada (#28)

#30

Amit Kapila

amit.kapila16@gmail.com

almost 12 years ago

In reply to: Alvaro Herrera (#27)

#31

Dilip kumar

dilip.kumar@huawei.com

almost 12 years ago

In reply to: Alvaro Herrera (#27)

#32

Robert Haas

robertmhaas@gmail.com

almost 12 years ago

In reply to: Dilip kumar (#31)

#33

Dilip kumar

dilip.kumar@huawei.com

almost 12 years ago

In reply to: Robert Haas (#32)

#34

Jeff Janes

jeff.janes@gmail.com

almost 12 years ago

In reply to: Dilip kumar (#19)

#35

Magnus Hagander

magnus@hagander.net

almost 12 years ago

In reply to: Dilip kumar (#23)

#36

Dilip kumar

dilip.kumar@huawei.com

almost 12 years ago

In reply to: Magnus Hagander (#35)

#37

Tom Lane

tgl@sss.pgh.pa.us

almost 12 years ago

In reply to: Dilip kumar (#36)

#38

Alvaro Herrera

alvherre@2ndquadrant.com

almost 12 years ago

In reply to: Tom Lane (#37)

#39

Magnus Hagander

magnus@hagander.net

almost 12 years ago

In reply to: Alvaro Herrera (#38)

#40

Dilip kumar

dilip.kumar@huawei.com

almost 12 years ago

In reply to: Magnus Hagander (#39)

#41

Dilip kumar

dilip.kumar@huawei.com

almost 12 years ago

In reply to: Magnus Hagander (#39)

#42

Jeff Janes

jeff.janes@gmail.com

almost 12 years ago

In reply to: Dilip kumar (#40)

#43

Alvaro Herrera

alvherre@2ndquadrant.com

almost 12 years ago

In reply to: Jeff Janes (#42)

#44

Amit Kapila

amit.kapila16@gmail.com

almost 12 years ago

In reply to: Dilip kumar (#41)

#45

Dilip kumar

dilip.kumar@huawei.com

almost 12 years ago

In reply to: Amit Kapila (#44)

#46

Amit Kapila

amit.kapila16@gmail.com

almost 12 years ago

In reply to: Dilip kumar (#45)

#47

Dilip kumar

dilip.kumar@huawei.com

almost 12 years ago

In reply to: Amit Kapila (#46)

#48

Robert Haas

robertmhaas@gmail.com

almost 12 years ago

In reply to: Amit Kapila (#46)

#49

Amit Kapila

amit.kapila16@gmail.com

almost 12 years ago

In reply to: Dilip kumar (#47)

#50

Amit Kapila

amit.kapila16@gmail.com

almost 12 years ago

In reply to: Robert Haas (#48)

#51

Robert Haas

robertmhaas@gmail.com

almost 12 years ago

In reply to: Amit Kapila (#50)

#52

Amit Kapila

amit.kapila16@gmail.com

almost 12 years ago

In reply to: Robert Haas (#51)

#53

Dilip kumar

dilip.kumar@huawei.com

almost 12 years ago

In reply to: Amit Kapila (#52)

#54

Amit Kapila

amit.kapila16@gmail.com

almost 12 years ago

In reply to: Amit Kapila (#49)

#55

Dilip kumar

dilip.kumar@huawei.com

over 11 years ago

In reply to: Amit Kapila (#54)

#56

Jeff Janes

jeff.janes@gmail.com

over 11 years ago

In reply to: Dilip kumar (#55)

#57

Jeff Janes

jeff.janes@gmail.com

over 11 years ago

In reply to: Jeff Janes (#56)

#58

Amit Kapila

amit.kapila16@gmail.com

over 11 years ago

In reply to: Dilip kumar (#55)

#59

Alvaro Herrera

alvherre@2ndquadrant.com

over 11 years ago

In reply to: Amit Kapila (#58)

#60

Amit Kapila

amit.kapila16@gmail.com

over 11 years ago

In reply to: Alvaro Herrera (#59)

#61

Gavin Flower

GavinFlower@archidevsys.co.nz

over 11 years ago

In reply to: Alvaro Herrera (#59)

#62

Alvaro Herrera

alvherre@2ndquadrant.com

over 11 years ago

In reply to: Gavin Flower (#61)

#63

Gregory Smith

gregsmithpgsql@gmail.com

over 11 years ago

In reply to: Gavin Flower (#61)

#64

Jeff Janes

jeff.janes@gmail.com

over 11 years ago

In reply to: Alvaro Herrera (#62)

#65

Gavin Flower

GavinFlower@archidevsys.co.nz

over 11 years ago

In reply to: Gregory Smith (#63)

#66

Dilip kumar

dilip.kumar@huawei.com

over 11 years ago

In reply to: Jeff Janes (#57)

#67

Dilip kumar

dilip.kumar@huawei.com

over 11 years ago

In reply to: Amit Kapila (#58)

#68

Simon Riggs

simon@2ndQuadrant.com

over 11 years ago

In reply to: Jeff Janes (#64)

#69

Amit Kapila

amit.kapila16@gmail.com

over 11 years ago

In reply to: Simon Riggs (#68)

#70

Simon Riggs

simon@2ndQuadrant.com

over 11 years ago

In reply to: Amit Kapila (#69)

#71

Amit Kapila

amit.kapila16@gmail.com

over 11 years ago

In reply to: Simon Riggs (#70)

#72

Simon Riggs

simon@2ndQuadrant.com

over 11 years ago

In reply to: Amit Kapila (#71)

#73

Amit Kapila

amit.kapila16@gmail.com

over 11 years ago

In reply to: Simon Riggs (#72)

#74

Simon Riggs

simon@2ndQuadrant.com

over 11 years ago

In reply to: Amit Kapila (#73)

#75

Alvaro Herrera

alvherre@2ndquadrant.com

over 11 years ago

In reply to: Amit Kapila (#73)

#76

Simon Riggs

simon@2ndQuadrant.com

over 11 years ago

In reply to: Alvaro Herrera (#75)

#77

Amit Kapila

amit.kapila16@gmail.com

over 11 years ago

In reply to: Dilip kumar (#67)

#78

Amit Kapila

amit.kapila16@gmail.com

over 11 years ago

In reply to: Amit Kapila (#77)

#79

Dilip kumar

dilip.kumar@huawei.com

over 11 years ago

In reply to: Amit Kapila (#77)

#80

Amit Kapila

amit.kapila16@gmail.com

over 11 years ago

In reply to: Dilip kumar (#79)

#81

Dilip kumar

dilip.kumar@huawei.com

over 11 years ago

In reply to: Amit Kapila (#80)

#82

Amit Kapila

amit.kapila16@gmail.com

over 11 years ago

In reply to: Dilip kumar (#81)

#83

Amit Kapila

amit.kapila16@gmail.com

over 11 years ago

In reply to: Amit Kapila (#78)

#84

Dilip kumar

dilip.kumar@huawei.com

over 11 years ago

In reply to: Amit Kapila (#83)

#85

Amit Kapila

amit.kapila16@gmail.com

over 11 years ago

In reply to: Dilip kumar (#84)

#86

Dilip kumar

dilip.kumar@huawei.com

over 11 years ago

In reply to: Amit Kapila (#85)

#87

Amit Kapila

amit.kapila16@gmail.com

over 11 years ago

In reply to: Dilip kumar (#86)

#88

Dilip kumar

dilip.kumar@huawei.com

over 11 years ago

In reply to: Amit Kapila (#87)

#89

Amit Kapila

amit.kapila16@gmail.com

over 11 years ago

In reply to: Dilip kumar (#88)

#90

Michael Paquier

michael@paquier.xyz

over 11 years ago

In reply to: Amit Kapila (#89)

#91

Dilip kumar

dilip.kumar@huawei.com

over 11 years ago

In reply to: Amit Kapila (#89)

#92

Amit Kapila

amit.kapila16@gmail.com

over 11 years ago

In reply to: Dilip kumar (#91)

#93

Dilip kumar

dilip.kumar@huawei.com

over 11 years ago

In reply to: Amit Kapila (#89)

#94

Amit Kapila

amit.kapila16@gmail.com

over 11 years ago

In reply to: Dilip kumar (#93)

#95

Dilip kumar

dilip.kumar@huawei.com

over 11 years ago

In reply to: Amit Kapila (#94)

#96

Amit Kapila

amit.kapila16@gmail.com

over 11 years ago

In reply to: Dilip kumar (#95)

#97

Dilip kumar

dilip.kumar@huawei.com

over 11 years ago

In reply to: Amit Kapila (#96)

#98

Amit Kapila

amit.kapila16@gmail.com

over 11 years ago

In reply to: Dilip kumar (#97)

#99

Dilip kumar

dilip.kumar@huawei.com

over 11 years ago

In reply to: Amit Kapila (#98)

#100

Amit Kapila

amit.kapila16@gmail.com

over 11 years ago

In reply to: Dilip kumar (#99)

#101

Kevin Grittner

Kevin.Grittner@wicourts.gov

over 11 years ago

In reply to: Amit Kapila (#100)

#102

Amit Kapila

amit.kapila16@gmail.com

over 11 years ago

In reply to: Kevin Grittner (#101)

#103

Andres Freund

andres@anarazel.de

over 11 years ago

In reply to: Amit Kapila (#98)

#104

Michael Paquier

michael@paquier.xyz

over 11 years ago

In reply to: Andres Freund (#103)

#105

Alvaro Herrera

alvherre@2ndquadrant.com

over 11 years ago

In reply to: Michael Paquier (#104)

#106

Michael Paquier

michael@paquier.xyz

over 11 years ago

In reply to: Alvaro Herrera (#105)

#107

Dilip kumar

dilip.kumar@huawei.com

over 11 years ago

In reply to: Andres Freund (#103)

#108

Alvaro Herrera

alvherre@2ndquadrant.com

over 11 years ago

In reply to: Dilip kumar (#107)

#109

Amit Kapila

amit.kapila16@gmail.com

over 11 years ago

In reply to: Alvaro Herrera (#108)

#110

Alvaro Herrera

alvherre@2ndquadrant.com

over 11 years ago

In reply to: Amit Kapila (#109)

#111

Amit Kapila

amit.kapila16@gmail.com

over 11 years ago

In reply to: Alvaro Herrera (#110)

#112

Alvaro Herrera

alvherre@2ndquadrant.com

over 11 years ago

In reply to: Amit Kapila (#111)

#113

Dilip kumar

dilip.kumar@huawei.com

over 11 years ago

In reply to: Alvaro Herrera (#112)

#114

Alvaro Herrera

alvherre@2ndquadrant.com

over 11 years ago

In reply to: Dilip kumar (#113)

#115

Alvaro Herrera

alvherre@2ndquadrant.com

over 11 years ago

In reply to: Andres Freund (#103)

#116

Alvaro Herrera

alvherre@2ndquadrant.com

over 11 years ago

In reply to: Alvaro Herrera (#114)

#117

Dilip kumar

dilip.kumar@huawei.com

over 11 years ago

In reply to: Alvaro Herrera (#114)

#118

Dilip kumar

dilip.kumar@huawei.com

over 11 years ago

In reply to: Alvaro Herrera (#116)

#119

Pavel Stehule

pavel.stehule@gmail.com

over 11 years ago

In reply to: Dilip kumar (#117)

#120

Fabrízio de Royes Mello

fabriziomello@gmail.com

over 11 years ago

In reply to: Pavel Stehule (#119)

#121

Pavel Stehule

pavel.stehule@gmail.com

over 11 years ago

In reply to: Fabrízio de Royes Mello (#120)

#122

Alvaro Herrera

alvherre@2ndquadrant.com

over 11 years ago

In reply to: Pavel Stehule (#121)

#123

Laurent Laborde

kerdezixe@gmail.com

almost 11 years ago

In reply to: Amit Kapila (#100)

#124

Alvaro Herrera

alvherre@2ndquadrant.com

almost 11 years ago

In reply to: Laurent Laborde (#123)

#125

Laurent Laborde

kerdezixe@gmail.com

almost 11 years ago

In reply to: Alvaro Herrera (#124)

TODO : Allow parallel cores to be used by vacuumdb [ WIP ]

Attachments:

Attachments:

Attachments:

Attachments:

Attachments: