replication slots replicated to standbys?

Started by Bruce Momjianover 9 years ago7 messages

bruce@momjian.us

over 9 years ago

Someone reported that a replication slot that existed at the time a base
backup was done on the master was copied to the standby. Because they
didn't realize it, their WAL was not being recycled on the standby.

Is that possible? Is it a known behavior? I don't see it documented.

--
Bruce Momjian <bruce@momjian.us> http://momjian.us
EnterpriseDB http://enterprisedb.com

+ As you are, so once was I. As I am, so you will be. +
+                     Ancient Roman grave inscription +

--
Sent via pgsql-hackers mailing list (pgsql-hackers@postgresql.org)
To make changes to your subscription:
http://www.postgresql.org/mailpref/pgsql-hackers

Michael Paquier

michael.paquier@gmail.com

over 9 years ago

In reply to: Bruce Momjian (#1)

Re: replication slots replicated to standbys?

On Sat, Aug 20, 2016 at 1:39 PM, Bruce Momjian <bruce@momjian.us> wrote:

Someone reported that a replication slot that existed at the time a base
backup was done on the master was copied to the standby. Because they
didn't realize it, their WAL was not being recycled on the standby.

Is that possible? Is it a known behavior? I don't see it documented.

From backup.sgml:
<para>
It is often a good idea to also omit from the backup the files
within the cluster's <filename>pg_replslot/</> directory, so that
replication slots that exist on the master do not become part of the
backup. Otherwise, the subsequent use of the backup to create a standby
may result in indefinite retention of WAL files on the standby, and
possibly bloat on the master if hot standby feedback is enabled, because
the clients that are using those replication slots will still be connecting
to and updating the slots on the master, not the standby. Even if the
backup is only intended for use in creating a new master, copying the
replication slots isn't expected to be particularly useful, since the
contents of those slots will likely be badly out of date by the time
the new master comes on line.
</para>

Note as well that pg_basebackup omits its content and creates an empty
directory.
--
Michael

--
Sent via pgsql-hackers mailing list (pgsql-hackers@postgresql.org)
To make changes to your subscription:
http://www.postgresql.org/mailpref/pgsql-hackers

bruce@momjian.us

over 9 years ago

In reply to: Michael Paquier (#2)

Re: replication slots replicated to standbys?

On Sat, Aug 20, 2016 at 01:43:42PM +0900, Michael Paquier wrote:

On Sat, Aug 20, 2016 at 1:39 PM, Bruce Momjian <bruce@momjian.us> wrote:

Someone reported that a replication slot that existed at the time a base
backup was done on the master was copied to the standby. Because they
didn't realize it, their WAL was not being recycled on the standby.

Is that possible? Is it a known behavior? I don't see it documented.

From backup.sgml:

<para>
It is often a good idea to also omit from the backup the files
within the cluster's <filename>pg_replslot/</> directory, so that
replication slots that exist on the master do not become part of the
backup. Otherwise, the subsequent use of the backup to create a standby
may result in indefinite retention of WAL files on the standby, and
possibly bloat on the master if hot standby feedback is enabled, because
the clients that are using those replication slots will still be connecting
to and updating the slots on the master, not the standby. Even if the
backup is only intended for use in creating a new master, copying the
replication slots isn't expected to be particularly useful, since the
contents of those slots will likely be badly out of date by the time
the new master comes on line.
</para>

Note as well that pg_basebackup omits its content and creates an empty
directory.

Seems like another good idea to use pg_basebackup rather than manually
doing base backups; Magnus has been saying this for a while.

I supposed there is no way we could remove this error-prone behavior
because replication slots must survive server restarts. Is there no way
to know if we are starting a standby from a fresh base backup vs.
restarting a standby? In that case we could clear the replication
slots. Are there any other error-prone things copied from the master?

--
Bruce Momjian <bruce@momjian.us> http://momjian.us
EnterpriseDB http://enterprisedb.com

+ As you are, so once was I. As I am, so you will be. +
+                     Ancient Roman grave inscription +

--
Sent via pgsql-hackers mailing list (pgsql-hackers@postgresql.org)
To make changes to your subscription:
http://www.postgresql.org/mailpref/pgsql-hackers

craig.ringer@2ndquadrant.com

over 9 years ago

In reply to: Bruce Momjian (#3)

Re: replication slots replicated to standbys?

On 21 Aug 2016 12:36 AM, "Bruce Momjian" <bruce@momjian.us> wrote:

On Sat, Aug 20, 2016 at 01:43:42PM +0900, Michael Paquier wrote:

On Sat, Aug 20, 2016 at 1:39 PM, Bruce Momjian <bruce@momjian.us> wrote:

Someone reported that a replication slot that existed at the time a

base

backup was done on the master was copied to the standby. Because they
didn't realize it, their WAL was not being recycled on the standby.

Is that possible? Is it a known behavior? I don't see it documented.

From backup.sgml:

<para>
It is often a good idea to also omit from the backup the files
within the cluster's <filename>pg_replslot/</> directory, so that
replication slots that exist on the master do not become part of the
backup. Otherwise, the subsequent use of the backup to create a

standby

may result in indefinite retention of WAL files on the standby, and
possibly bloat on the master if hot standby feedback is enabled,

because

the clients that are using those replication slots will still be

connecting

to and updating the slots on the master, not the standby. Even if

the

backup is only intended for use in creating a new master, copying

the

replication slots isn't expected to be particularly useful, since

the

contents of those slots will likely be badly out of date by the time
the new master comes on line.
</para>

Note as well that pg_basebackup omits its content and creates an empty
directory.

Seems like another good idea to use pg_basebackup rather than manually
doing base backups; Magnus has been saying this for a while.

The main time that's an issue is when you're rsync'ing to save bandwidth,
using CoW volume snapshots, etc. pg_basebackup becomes totally impractical
on big systems.

I supposed there is no way we could remove this error-prone behavior
because replication slots must survive server restarts. Is there no way
to know if we are starting a standby from a fresh base backup vs.
restarting a standby? In that case we could clear the replication
slots. Are there any other error-prone things copied from the master?

We could remove slots when we enter archive recovery. But I've recently
implememted support for logical decoding from standbys, which needs slots.
Physical slot use on standby is also handy. We cannot tell whether a slot
was created on the replica or created on the master and copied in the base
backup and don't want to drop slots created on the replica.

I also have use cases for slots being retained in restore from snapshot,
for re-integrating restored nodes into an MM mesh.

I think a recovery.conf option to remove all slots during archive recovery
could be handy. But mostly it comes down to tools not copying them.

Michael Paquier

michael.paquier@gmail.com

over 9 years ago

In reply to: Craig Ringer (#4)

Re: replication slots replicated to standbys?

On Sun, Aug 21, 2016 at 1:24 PM, Craig Ringer
<craig.ringer@2ndquadrant.com> wrote:

On 21 Aug 2016 12:36 AM, "Bruce Momjian" <bruce@momjian.us> wrote:

Seems like another good idea to use pg_basebackup rather than manually
doing base backups; Magnus has been saying this for a while.

The main time that's an issue is when you're rsync'ing to save bandwidth,
using CoW volume snapshots, etc. pg_basebackup becomes totally impractical
on big systems.

Yes, and that's not fun. Particularly when the backup takes so long
that WAL has already been recycled... Replication slots help here but
the partitions dedicated to pg_xlog have their limit as well.

I supposed there is no way we could remove this error-prone behavior
because replication slots must survive server restarts. Is there no way
to know if we are starting a standby from a fresh base backup vs.
restarting a standby? In that case we could clear the replication
slots. Are there any other error-prone things copied from the master?

We could remove slots when we enter archive recovery. But I've recently
implemented support for logical decoding from standbys, which needs slots.
Physical slot use on standby is also handy. We cannot tell whether a slot
was created on the replica or created on the master and copied in the base
backup and don't want to drop slots created on the replica.

I also have use cases for slots being retained in restore from snapshot, for
re-integrating restored nodes into an MM mesh.

I think a recovery.conf option to remove all slots during archive recovery
could be handy. But mostly it comes down to tools not copying them.

Yes, I'd personally let recovery.conf out of that, as well as the
removal of replication slot data when archive recovery begins to keep
the configuration simple. The decision-making of the data included in
any backup will be done by the tool itself anyway..
--
Michael

--
Sent via pgsql-hackers mailing list (pgsql-hackers@postgresql.org)
To make changes to your subscription:
http://www.postgresql.org/mailpref/pgsql-hackers

Michael Paquier

michael.paquier@gmail.com

over 9 years ago

In reply to: Bruce Momjian (#3)

Re: replication slots replicated to standbys?

On Sun, Aug 21, 2016 at 1:35 AM, Bruce Momjian <bruce@momjian.us> wrote:

On Sat, Aug 20, 2016 at 01:43:42PM +0900, Michael Paquier wrote:

Note as well that pg_basebackup omits its content and creates an empty
directory.

Are there any other error-prone things copied from the master?

The contents of pg_snapshots get copied by pg_basebackup. Those are
useless in a backup, but harmless.
--
Michael

--
Sent via pgsql-hackers mailing list (pgsql-hackers@postgresql.org)
To make changes to your subscription:
http://www.postgresql.org/mailpref/pgsql-hackers

craig.ringer@2ndquadrant.com

over 9 years ago

In reply to: Michael Paquier (#5)

Re: replication slots replicated to standbys?

On 22 August 2016 at 10:31, Michael Paquier <michael.paquier@gmail.com>
wrote:

On Sun, Aug 21, 2016 at 1:24 PM, Craig Ringer
<craig.ringer@2ndquadrant.com> wrote:

On 21 Aug 2016 12:36 AM, "Bruce Momjian" <bruce@momjian.us> wrote:

Seems like another good idea to use pg_basebackup rather than manually
doing base backups; Magnus has been saying this for a while.

The main time that's an issue is when you're rsync'ing to save bandwidth,
using CoW volume snapshots, etc. pg_basebackup becomes totally

impractical

on big systems.

Yes, and that's not fun. Particularly when the backup takes so long
that WAL has already been recycled... Replication slots help here but
the partitions dedicated to pg_xlog have their limit as well.

We can and probably should allow XLogReader to invoke restore_command to
fetch WAL, read it, and discard/recycle it again. This would greatly
alleviate the pain of indefinite xlog retention.

It's a pain to do so while recovery.conf is its own separate magic though,
not part of postgresql.conf.

I have no plans to work on this at this time.

--
Craig Ringer http://www.2ndQuadrant.com/
PostgreSQL Development, 24x7 Support, Remote DBA, Training & Services