Improve pg_dump dumping publication tables

Started by Hsu, Johnover 5 years ago4 messages

hsuchen@amazon.com

over 5 years ago

1 attachment(s)

Hi hackers,

I was wondering if there's a good reason in pg_dump getPublicationTables()
to iterate through all tables one by one and querying to see if it has a
corresponding publication other than memory concerns?

Attached is a patch to construct the publications by querying for all publications
at once and then finding the corresponding tableInfo information in tblinfoindex.

It seems more likely that users will have a lot more tables than publications so
this should be a lot faster, especially when there's millions of tables.

Setup:
time pg_dump --schema-only -d postgres -f dump.sql -v 2> dump.log
10k tables, each table having its own publication.

Before patch:
real 0m6.409s
user 0m0.234s
sys 0m0.584s

With patch:
real 0m5.483s
user 0m0.399s
sys 0m0.443s

10k tables, no publications.

Before patch:
real 0m6.193s
user 0m0.276s
sys 0m0.531s

With patch:
real 0m5.261s
user 0m0.254s
sys 0m0.377s

Thanks,

John H

Attachments:

0001-pg_dump-Iterate-through-all-publication-tables-in-pg.patchapplication/octet-stream; name=0001-pg_dump-Iterate-through-all-publication-tables-in-pg.patchDownload

From 4ee95216fed740ba45237a308a4c1c9c383fba68 Mon Sep 17 00:00:00 2001
From: John Hsu <47648066+JohnHVancouver@users.noreply.github.com>
Date: Tue, 25 Aug 2020 20:01:49 +0000
Subject: [PATCH] pg_dump - Iterate through all publication tables in
 pg_publication_rel at once

Avoid querying on a per table basis so that we don't need to check if every
table has a publication
---
 src/bin/pg_dump/common.c  |  2 +-
 src/bin/pg_dump/pg_dump.c | 93 +++++++++++++++------------------------
 src/bin/pg_dump/pg_dump.h |  3 +-
 3 files changed, 37 insertions(+), 61 deletions(-)

diff --git a/src/bin/pg_dump/common.c b/src/bin/pg_dump/common.c
index 08239dde4f..63b12c7218 100644
--- a/src/bin/pg_dump/common.c
+++ b/src/bin/pg_dump/common.c
@@ -250,7 +250,7 @@ getSchemaData(Archive *fout, int *numTablesPtr)
 	getPublications(fout);
 
 	pg_log_info("reading publication membership");
-	getPublicationTables(fout, tblinfo, numTables);
+	getPublicationTables(fout);
 
 	pg_log_info("reading subscriptions");
 	getSubscriptions(fout);
diff --git a/src/bin/pg_dump/pg_dump.c b/src/bin/pg_dump/pg_dump.c
index 2cb3f9b083..9272e5e341 100644
--- a/src/bin/pg_dump/pg_dump.c
+++ b/src/bin/pg_dump/pg_dump.c
@@ -4044,7 +4044,7 @@ dumpPublication(Archive *fout, PublicationInfo *pubinfo)
  *	  get information about publication membership for dumpable tables.
  */
 void
-getPublicationTables(Archive *fout, TableInfo tblinfo[], int numTables)
+getPublicationTables(Archive *fout)
 {
 	PQExpBuffer query;
 	PGresult   *res;
@@ -4053,8 +4053,8 @@ getPublicationTables(Archive *fout, TableInfo tblinfo[], int numTables)
 	int			i_tableoid;
 	int			i_oid;
 	int			i_pubname;
+	int			i_relid;
 	int			i,
-				j,
 				ntups;
 
 	if (dopt->no_publications || fout->remoteVersion < 100000)
@@ -4062,73 +4062,50 @@ getPublicationTables(Archive *fout, TableInfo tblinfo[], int numTables)
 
 	query = createPQExpBuffer();
 
-	for (i = 0; i < numTables; i++)
-	{
-		TableInfo  *tbinfo = &tblinfo[i];
-
-		/*
-		 * Only regular and partitioned tables can be added to publications.
-		 */
-		if (tbinfo->relkind != RELKIND_RELATION &&
-			tbinfo->relkind != RELKIND_PARTITIONED_TABLE)
-			continue;
+	appendPQExpBuffer(query,
+						"SELECT pr.tableoid, pr.oid, pr.prrelid, p.pubname "
+						"FROM pg_publication_rel pr, pg_publication p "
+						"WHERE p.oid = pr.prpubid");
 
-		/*
-		 * Ignore publication membership of tables whose definitions are not
-		 * to be dumped.
-		 */
-		if (!(tbinfo->dobj.dump & DUMP_COMPONENT_DEFINITION))
-			continue;
+	res = ExecuteSqlQuery(fout, query->data, PGRES_TUPLES_OK);
 
-		pg_log_info("reading publication membership for table \"%s.%s\"",
-					tbinfo->dobj.namespace->dobj.name,
-					tbinfo->dobj.name);
+	ntups = PQntuples(res);
 
-		resetPQExpBuffer(query);
+	pubrinfo = pg_malloc(ntups * sizeof(PublicationRelInfo));
+	for (i = 0; i < ntups; i++)
+	{
+		TableInfo *tbInfo;
+		Oid tbRelid;
 
-		/* Get the publication membership for the table. */
-		appendPQExpBuffer(query,
-						  "SELECT pr.tableoid, pr.oid, p.pubname "
-						  "FROM pg_publication_rel pr, pg_publication p "
-						  "WHERE pr.prrelid = '%u'"
-						  "  AND p.oid = pr.prpubid",
-						  tbinfo->dobj.catId.oid);
-		res = ExecuteSqlQuery(fout, query->data, PGRES_TUPLES_OK);
+		i_tableoid = PQfnumber(res, "tableoid");
+		i_oid = PQfnumber(res, "oid");
+		i_pubname = PQfnumber(res, "pubname");
+		i_relid = PQfnumber(res, "prrelid");
+		tbRelid = atooid(PQgetvalue(res, i, i_relid));
 
-		ntups = PQntuples(res);
+		tbInfo = findTableByOid(tbRelid);
 
-		if (ntups == 0)
-		{
-			/*
-			 * Table is not member of any publications. Clean up and return.
-			 */
-			PQclear(res);
+		if (tbInfo == NULL)
 			continue;
-		}
 
-		i_tableoid = PQfnumber(res, "tableoid");
-		i_oid = PQfnumber(res, "oid");
-		i_pubname = PQfnumber(res, "pubname");
+		if (!(tbInfo->dobj.dump & DUMP_COMPONENT_DEFINITION))
+			continue;
 
-		pubrinfo = pg_malloc(ntups * sizeof(PublicationRelInfo));
+		pubrinfo[i].dobj.objType = DO_PUBLICATION_REL;
+		pubrinfo[i].dobj.catId.tableoid =
+			atooid(PQgetvalue(res, i, i_tableoid));
+		pubrinfo[i].dobj.catId.oid = atooid(PQgetvalue(res, i, i_oid));
+		AssignDumpId(&pubrinfo[i].dobj);
+		pubrinfo[i].dobj.namespace = tbInfo->dobj.namespace;
+		pubrinfo[i].dobj.name = tbInfo->dobj.name;
+		pubrinfo[i].pubname = pg_strdup(PQgetvalue(res, i, i_pubname));
+		pubrinfo[i].pubtable = tbInfo;
 
-		for (j = 0; j < ntups; j++)
-		{
-			pubrinfo[j].dobj.objType = DO_PUBLICATION_REL;
-			pubrinfo[j].dobj.catId.tableoid =
-				atooid(PQgetvalue(res, j, i_tableoid));
-			pubrinfo[j].dobj.catId.oid = atooid(PQgetvalue(res, j, i_oid));
-			AssignDumpId(&pubrinfo[j].dobj);
-			pubrinfo[j].dobj.namespace = tbinfo->dobj.namespace;
-			pubrinfo[j].dobj.name = tbinfo->dobj.name;
-			pubrinfo[j].pubname = pg_strdup(PQgetvalue(res, j, i_pubname));
-			pubrinfo[j].pubtable = tbinfo;
-
-			/* Decide whether we want to dump it */
-			selectDumpablePublicationTable(&(pubrinfo[j].dobj), fout);
-		}
-		PQclear(res);
+		/* Decide whether we want to dump it */
+		selectDumpablePublicationTable(&(pubrinfo[i].dobj), fout);
 	}
+
+	PQclear(res);
 	destroyPQExpBuffer(query);
 }
 
diff --git a/src/bin/pg_dump/pg_dump.h b/src/bin/pg_dump/pg_dump.h
index da97b731b1..1b43cabea0 100644
--- a/src/bin/pg_dump/pg_dump.h
+++ b/src/bin/pg_dump/pg_dump.h
@@ -716,8 +716,7 @@ extern void processExtensionTables(Archive *fout, ExtensionInfo extinfo[],
 extern EventTriggerInfo *getEventTriggers(Archive *fout, int *numEventTriggers);
 extern void getPolicies(Archive *fout, TableInfo tblinfo[], int numTables);
 extern void getPublications(Archive *fout);
-extern void getPublicationTables(Archive *fout, TableInfo tblinfo[],
-								 int numTables);
+extern void getPublicationTables(Archive *fout);
 extern void getSubscriptions(Archive *fout);
 
 #endif							/* PG_DUMP_H */
-- 
2.17.2

Cary Huang

cary.huang@highgo.ca

over 5 years ago

In reply to: Hsu, John (#1)

Re: Improve pg_dump dumping publication tables

The following review has been posted through the commitfest application:
make installcheck-world: tested, passed
Implements feature: tested, passed
Spec compliant: tested, passed
Documentation: not tested

I applied the patch to PG master branch successfully and the patch seems to do as described. If there are a lot of tables created in the Postgres and only a few of them having publications, the getPublicationTables() function will loop through all the tables and check each one to see if it is the desired relation type and if it contains one or more publications. So potentially a lot of looping and checking may happen. In John's patch, all these looping and checking are replaced with one query that will return all the tables that have publications. One problem though, is that, if there are a million tables created in the server like you say and all of them have publications, the query can return a million rows, which will require a lot of memory on client side and the pg_dump client may need to handle this extreme case with proper pagination, which adds complexity to the function's logic. Existing logic does not have this problem, because it simply queries a table's publication one at a time and do it a million times.

thank you
Cary Huang
HighGo Software

John Hsu

chenhao.john.hsu@gmail.com

about 5 years ago

In reply to: Cary Huang (#2)

Re: Improve pg_dump dumping publication tables

Hi Cary,

Thanks for taking a look. I agree there's a risk since there's more memory usage on client side, and if you're dumping say millions of publicationTables then that could be problematic.
I'd like to think this isn't any riskier than existing pg_dump code such as in getTables(...) where presumably we would run into similar problems.

Cheers,
John H

Tom Lane

tgl@sss.pgh.pa.us

almost 5 years ago

In reply to: Hsu, John (#1)

Re: Improve pg_dump dumping publication tables

"Hsu, John" <hsuchen@amazon.com> writes:

I was wondering if there's a good reason in pg_dump getPublicationTables()
to iterate through all tables one by one and querying to see if it has a
corresponding publication other than memory concerns?

I just came across this entry in the CommitFest, and I see that it's
practically the same as something I did in passing in 8e396a773.
The main difference is that I got rid of the server-side join, too,
in favor of having getPublicationTables locate the PublicationInfo
that should have been created already by getPublications. (The
immediate rationale for that was to get the publication owner name
from the PublicationInfo; but we'd have had to do that eventually
anyway if we ever want to allow selective dumping of publications.)

Anyway, I apologize for treading on your toes. If I'd noticed this
thread earlier I would certainly have given you credit for being the
first to have the idea.

As far as the memory argument goes, I'm not too concerned about it
because both the PublicationRelInfo structs and the tuples of the
transient PGresult are pretty small. In principle if you had very
many entries in pg_publication_rel, but a selective dump was only
interested in a few of them, there might be an interesting amount of
space wasted here. But that argument fails because even a selective
dump collects data about all tables, for reasons that are hard to get
around. The incremental space usage for PublicationRelInfos seems
unlikely to be significant compared to the per-table data we'd have
anyway.

I'll mark this CF entry "withdrawn", since it wasn't rejected
exactly. Too bad we don't have a classification of "superseded
by events", or something like that.

regards, tom lane