"missing chunk number XX for toast value YY in pg_toast ..." after pg_basebackup.

Started by Nonameabout 10 years ago8 messagesgeneral

fredrik@huitfeldt.com

about 10 years ago

Hi All,

we are running postgresql 9.1.15 on Debian.

we are, basically, running a postgresql cluster with two nodes. We are using synchronous streaming replication to make sure that the slave is always fully in sync (using a recovery.conf that points out the master). The slave is mainly used to maintain high availability. We also run backups on the slave to reduce load on the master.

On both nodes, postgresqls data is stored on a SAN.

At some point, we restarted the slave. The first thing it does is to do a pg_basebackup and then start streaming changes based on that. This seemed to work fine, until 16 hrs later when we made a backup on the slave, using pg_dump. pg_dump was unable to complete, due to a pg_toast error (ERROR: missing chunk number 0 for toast value 2753291 in pg_toast_22066).

The issue remained until we ran a full vacuum analyze on the cluster.

We have been running in this way on several clusters for some years now, basically since 9.1 was released, without seeing this issue. Since we upgraded to postgresql 9.1.15, we have seen it twice. This does not necessarily mean anything, but I thought I would mention it.

The issue is resolved now, but I would still like to understand what happened.
I have logfiles from the incident, but I cannot see anything out of the ordinary (despite having a fair amount of experience investigating postgresql logs).

I have read that this kind of issues are most frequently due to hardware issues or bugs in postgresql.

Using <https://bucardo.org/postgres_all_versions.html#version_9.1.15>,
I have reviewed the release notes from the subsequent 9.1 releases (9.1.15-9.1.19),
but I have been unable to identify anything that hints at the issue that we have experienced.

the closest one would be:
(9.1.16,9.4.2,9.3.7,9.2.11,9.0.20) Fix possible failure during hash index bucket split, if other processes are modifying the index concurrently (Tom Lane)

but that seems like a long shot.

My question is:
- does anyone know of bug reports (fixed or otherwise) that could shed light on our issue?
- does anyone have experience with this kind of behavior?

- our setup is fairly automated; would it be a good idea to always run vacuum analyze after a pg_restore (or before pg_dump)?

Noname

fredrik@huitfeldt.com

about 10 years ago

In reply to: Noname (#1)

Re: "missing chunk number XX for toast value YY in pg_toast ..." after pg_basebackup.

HI All,

i would really appreciate any help I can get on this issue.

basically, a pg_basebackup + streaming attach, led to a database that we could not read from afterwards.

Beset regards,
Fredrik

PS please advise if this is better posted on another list.

Import Notes

Resolved by subject fallback

Adrian Klaver

adrian.klaver@aklaver.com

about 10 years ago

In reply to: Noname (#2)

Re: Re: "missing chunk number XX for toast value YY in pg_toast ..." after pg_basebackup.

On 03/06/2016 10:18 PM, fredrik@huitfeldt.com wrote:

HI All,

i would really appreciate any help I can get on this issue.

basically, a pg_basebackup + streaming attach, led to a database that we
could not read from afterwards.

From original post:

/messages/by-id/1456919678340.31300.116900@webmail2

"The issue remained until we ran a full vacuum analyze on the cluster."

Which cluster was that, the master or the slave?

"I have logfiles from the incident, but I cannot see anything out of the
ordinary (despite having a fair amount of experience investigating
postgresql logs)."

Can we see the section before and after ERROR?

Beset regards,
Fredrik

PS please advise if this is better posted on another list.

--
Adrian Klaver
adrian.klaver@aklaver.com

--
Sent via pgsql-general mailing list (pgsql-general@postgresql.org)
To make changes to your subscription:
http://www.postgresql.org/mailpref/pgsql-general

Noname

fredrik@huitfeldt.com

about 10 years ago

In reply to: Adrian Klaver (#3)

Re: Re: "missing chunk number XX for toast value YY in pg_toast ..." after pg_basebackup.

Hi Adrian,

thank you very much for your response.

I ran the "VACUUM ANALYZE" command on the master node.

Regarding log messages.

Here is the contents of the log (excluding connections/disconnections):

2016-02-22 02:30:08 GMT 24616 LOG: recovery has paused
2016-02-22 02:30:08 GMT 24616 HINT: Execute pg_xlog_replay_resume() to continue.
2016-02-22 02:37:19 GMT 23859 DBNAME ERROR: missing chunk number 0 for toast value 2747579 in pg_toast_22066
2016-02-22 02:37:19 GMT 23859 DBNAME STATEMENT: COPY public.room_shape (room_uuid, data) TO stdout;
2016-02-22 02:37:41 GMT 2648 DBNAME LOG: could not receive data from client: Connection reset by peer
2016-02-22 02:37:41 GMT 2648 DBNAME LOG: unexpected EOF on client connection

Best regards,

Fredrik Huitfeldt

On 7 March 2016 16:35:29 +01:00, Adrian Klaver <adrian.klaver@aklaver.com> wrote:

Show quoted text

On 03/06/2016 10:18 PM, <fredrik@huitfeldt.com> wrote:

HI All,

i would really appreciate any help I can get on this issue.

basically, a pg_basebackup + streaming attach, led to a database that we
could not read from afterwards.

From original post:

</messages/by-id/1456919678340.31300.116900@webmail2>

"The issue remained until we ran a full vacuum analyze on the cluster."

Which cluster was that, the master or the slave?

"I have logfiles from the incident, but I cannot see anything out of the ordinary (despite having a fair amount of experience investigating postgresql logs)."

Can we see the section before and after ERROR?

Beset regards,
Fredrik

PS please advise if this is better posted on another list.

--
Adrian Klaver
<adrian.klaver@aklaver.com>

Adrian Klaver

adrian.klaver@aklaver.com

about 10 years ago

In reply to: Noname (#4)

Re: Re: "missing chunk number XX for toast value YY in pg_toast ..." after pg_basebackup.

On 03/09/2016 04:56 AM, fredrik@huitfeldt.com wrote:

Hi Adrian,

thank you very much for your response.

I ran the "VACUUM ANALYZE" command on the master node.

Regarding log messages.

Here is the contents of the log (excluding connections/disconnections):

Assuming the below is from the replica database.

2016-02-22 02:30:08 GMT 24616 LOG: recovery has paused

So what happened to cause the above?

I am not seeing anything below that indicates the recovery started again.

2016-02-22 02:30:08 GMT 24616 HINT: Execute pg_xlog_replay_resume() to
continue.
2016-02-22 02:37:19 GMT 23859 DBNAME ERROR: missing chunk number 0 for
toast value 2747579 in pg_toast_22066
2016-02-22 02:37:19 GMT 23859 DBNAME STATEMENT: COPY public.room_shape
(room_uuid, data) TO stdout;
2016-02-22 02:37:41 GMT 2648 DBNAME LOG: could not receive data from
client: Connection reset by peer
2016-02-22 02:37:41 GMT 2648 DBNAME LOG: unexpected EOF on client
connection

What does the log from the master show?

Best regards,
Fredrik Huitfeldt

On 7 March 2016 16:35:29 +01:00, Adrian Klaver
<adrian.klaver@aklaver.com> wrote:

On 03/06/2016 10:18 PM, fredrik@huitfeldt.com
<mailto:fredrik@huitfeldt.com> wrote:

HI All,

i would really appreciate any help I can get on this issue.

basically, a pg_basebackup + streaming attach, led to a database
that we
could not read from afterwards.

From original post:

/messages/by-id/1456919678340.31300.116900@webmail2

"The issue remained until we ran a full vacuum analyze on the cluster."

Which cluster was that, the master or the slave?

"I have logfiles from the incident, but I cannot see anything out of
the ordinary (despite having a fair amount of experience investigating
postgresql logs)."

Can we see the section before and after ERROR?

Beset regards,
Fredrik

PS please advise if this is better posted on another list.

--
Adrian Klaver
adrian.klaver@aklaver.com <mailto:adrian.klaver@aklaver.com>

--
Adrian Klaver
adrian.klaver@aklaver.com

--
Sent via pgsql-general mailing list (pgsql-general@postgresql.org)
To make changes to your subscription:
http://www.postgresql.org/mailpref/pgsql-general

Noname

fredrik@huitfeldt.com

about 10 years ago

In reply to: Adrian Klaver (#5)

Re: Re: "missing chunk number XX for toast value YY in pg_toast ..." after pg_basebackup.

Hi Adrian,

thank you very much for your patience. I apologise for the missing information.

On 9 March 2016 16:13:00 +01:00, Adrian Klaver <adrian.klaver@aklaver.com> wrote:

On 03/09/2016 04:56 AM, <fredrik@huitfeldt.com> wrote:

Hi Adrian,

thank you very much for your response.

I ran the "VACUUM ANALYZE" command on the master node.

Regarding log messages.

Here is the contents of the log (excluding connections/disconnections):

Assuming the below is from the replica database.
the "LOG: recovery was paused" message was indeed from the replica.

2016-02-22 02:30:08 GMT 24616 LOG: recovery has paused

So what happened to cause the above?

we automatically pause recovery on the replica before running pg_dump. This is in order to make certain that we get a consistent dump of the database.

I am not seeing anything below that indicates the recovery started again.
the reason why we do not see a matching "resume" is that the pg_dump failed and our error handling was insufficient.

2016-02-22 02:30:08 GMT 24616 HINT: Execute pg_xlog_replay_resume() to
continue.
2016-02-22 02:37:19 GMT 23859 DBNAME ERROR: missing chunk number 0 for
toast value 2747579 in pg_toast_22066
2016-02-22 02:37:19 GMT 23859 DBNAME STATEMENT: COPY public.room_shape
(room_uuid, data) TO stdout;
2016-02-22 02:37:41 GMT 2648 DBNAME LOG: could not receive data from
client: Connection reset by peer
2016-02-22 02:37:41 GMT 2648 DBNAME LOG: unexpected EOF on client
connection

What does the log from the master show?

It doesnt seem to show much. It does have these repeated messages, however:
2016-02-22 02:12:18 GMT 30908 LOG: using stale statistics instead of current ones because stats collector is not responding
2016-02-22 02:13:01 GMT 30908 LOG: using stale statistics instead of current ones because stats collector is not responding
2016-02-22 02:13:52 GMT 30908 LOG: using stale statistics instead of current ones because stats collector is not responding

There are lots of these mesages within the timeframe. There seems to be a couple of them every 2-4 hours.

Best regards,
Fredrik Huitfeldt

On 7 March 2016 16:35:29 +01:00, Adrian Klaver
<<adrian.klaver@aklaver.com>> wrote:

On 03/06/2016 10:18 PM, <fredrik@huitfeldt.com>
<mailto:<fredrik@huitfeldt.com>> wrote:

HI All,

i would really appreciate any help I can get on this issue.

basically, a pg_basebackup + streaming attach, led to a database
that we
could not read from afterwards.

From original post:

</messages/by-id/1456919678340.31300.116900@webmail2>

"The issue remained until we ran a full vacuum analyze on the cluster."

Which cluster was that, the master or the slave?

"I have logfiles from the incident, but I cannot see anything out of
the ordinary (despite having a fair amount of experience investigating
postgresql logs)."

Can we see the section before and after ERROR?

Beset regards,
Fredrik

PS please advise if this is better posted on another list.

--
Adrian Klaver
<adrian.klaver@aklaver.com> <mailto:<adrian.klaver@aklaver.com>>

--
Adrian Klaver
<adrian.klaver@aklaver.com>
Best regards,

Fredrik

Adrian Klaver

adrian.klaver@aklaver.com

about 10 years ago

In reply to: Noname (#6)

Re: Re: "missing chunk number XX for toast value YY in pg_toast ..." after pg_basebackup.

On 03/09/2016 10:41 AM, fredrik@huitfeldt.com wrote:

Hi Adrian,

thank you very much for your patience. I apologise for the missing
information.

On 9 March 2016 16:13:00 +01:00, Adrian Klaver
<adrian.klaver@aklaver.com> wrote:

On 03/09/2016 04:56 AM, fredrik@huitfeldt.com
<mailto:fredrik@huitfeldt.com> wrote:

Hi Adrian,

thank you very much for your response.

I ran the "VACUUM ANALYZE" command on the master node.

Regarding log messages.

Here is the contents of the log (excluding
connections/disconnections):

Assuming the below is from the replica database.

the "LOG: recovery was paused" message was indeed from the replica.

2016-02-22 02:30:08 GMT 24616 LOG: recovery has paused

So what happened to cause the above?

we automatically pause recovery on the replica before running pg_dump.
This is in order to make certain that we get a consistent dump of the
database.

Still muddling through this, but to recap and be clear in my mind:

1) This only started occurring with 9.1.15, but worked in previous
versions of 9.1.
So what was the last version of 9.1 that worked?

2) You seed a replica with pg_basebackup.

3) You set up synchronous streaming replication to the replica.

4) You pause the replication and use pg_dump to dump the replica.

5) At this point the error in the subject has occurred twice since you
switched to 9.1.5

6) Running full vacuum analyze on the master solves the problem.
How is it solved?
In other words do you resume replication after the vacuum, or before?
Then do you redo the pg_dump?
Or do you start over with a new pg_basebackup?

I am not seeing anything below that indicates the recovery started again.

the reason why we do not see a matching "resume" is that the pg_dump
failed and our error handling was insufficient.

2016-02-22 02:30:08 GMT 24616 HINT: Execute pg_xlog_replay_resume() to
continue.
2016-02-22 02:37:19 GMT 23859 DBNAME ERROR: missing chunk number 0 for
toast value 2747579 in pg_toast_22066
2016-02-22 02:37:19 GMT 23859 DBNAME STATEMENT: COPY public.room_shape
(room_uuid, data) TO stdout;
2016-02-22 02:37:41 GMT 2648 DBNAME LOG: could not receive data from
client: Connection reset by peer
2016-02-22 02:37:41 GMT 2648 DBNAME LOG: unexpected EOF on client
connection

What does the log from the master show?

It doesnt seem to show much. It does have these repeated messages, however:

2016-02-22 02:12:18 GMT 30908 LOG: using stale statistics instead of
current ones because stats collector is not responding
2016-02-22 02:13:01 GMT 30908 LOG: using stale statistics instead of
current ones because stats collector is not responding
2016-02-22 02:13:52 GMT 30908 LOG: using stale statistics instead of
current ones because stats collector is not responding

There are lots of these mesages within the timeframe. There seems to be
a couple of them every 2-4 hours.

This is usually a sign of resource starvation. I see this on an old
machine, at night, when I run some intensive file system backups. I
figured it out by looking at my crontab. The problems such as they are
is that the messages fill up logs and your statistics become, as the
message says, stale for how ever long the collector does not respond.

Best regards,
Fredrik Huitfeldt

On 7 March 2016 16:35:29 +01:00, Adrian Klaver
<adrian.klaver@aklaver.com <mailto:adrian.klaver@aklaver.com>> wrote:

On 03/06/2016 10:18 PM, fredrik@huitfeldt.com
<mailto:fredrik@huitfeldt.com>
<mailto:fredrik@huitfeldt.com <mailto:fredrik@huitfeldt.com>>
wrote:

HI All,

i would really appreciate any help I can get on this issue.

basically, a pg_basebackup + streaming attach, led to a database
that we
could not read from afterwards.

From original post:

/messages/by-id/1456919678340.31300.116900@webmail2

"The issue remained until we ran a full vacuum analyze on the
cluster."

Which cluster was that, the master or the slave?

"I have logfiles from the incident, but I cannot see anything
out of
the ordinary (despite having a fair amount of experience
investigating
postgresql logs)."

Can we see the section before and after ERROR?

Beset regards,
Fredrik

PS please advise if this is better posted on another list.

--
Adrian Klaver
adrian.klaver@aklaver.com <mailto:adrian.klaver@aklaver.com>
<mailto:adrian.klaver@aklaver.com
<mailto:adrian.klaver@aklaver.com>>

--
Adrian Klaver
adrian.klaver@aklaver.com <mailto:adrian.klaver@aklaver.com>

Best regards,
Fredrik

--
Adrian Klaver
adrian.klaver@aklaver.com

--
Sent via pgsql-general mailing list (pgsql-general@postgresql.org)
To make changes to your subscription:
http://www.postgresql.org/mailpref/pgsql-general

Noname

fredrik@huitfeldt.com

about 10 years ago

In reply to: Adrian Klaver (#7)

Re: Re: "missing chunk number XX for toast value YY in pg_toast ..." after pg_basebackup.

Hi Adrian,

thank you again for your continued patience.

On 10 March 2016 00:20:13 +01:00, Adrian Klaver <adrian.klaver@aklaver.com> wrote:

On 03/09/2016 10:41 AM, <fredrik@huitfeldt.com> wrote:

Hi Adrian,

thank you very much for your patience. I apologise for the missing
information.

On 9 March 2016 16:13:00 +01:00, Adrian Klaver
<<adrian.klaver@aklaver.com>> wrote:

On 03/09/2016 04:56 AM, <fredrik@huitfeldt.com>
<mailto:<fredrik@huitfeldt.com>> wrote:

Hi Adrian,

thank you very much for your response.

I ran the "VACUUM ANALYZE" command on the master node.

Regarding log messages.

Here is the contents of the log (excluding
connections/disconnections):

Assuming the below is from the replica database.
the "LOG: recovery was paused" message was indeed from the replica.

2016-02-22 02:30:08 GMT 24616 LOG: recovery has paused

So what happened to cause the above?

we automatically pause recovery on the replica before running pg_dump.
This is in order to make certain that we get a consistent dump of the
database.

Still muddling through this, but to recap and be clear in my mind:

1) This only started occurring with 9.1.15, but worked in previous versions of 9.1.
So what was the last version of 9.1 that worked?
The previous version for us was 9.1.13

It *is* true that it has never happened before, and also that it has happened twice with 9.1.15.

It should be noted that we *are* executing this process in many automated environments. This means that we have definitely executed the full process you describe here more than a hundred times for 9.1.15, probably even more than 200. This means that the process has worked for us more than 99% of the time, and it even crept through an extended QA phase.

This means that to me, we are most likely not looking for something that "does not work", but more likely for some corner case.

2) You seed a replica with pg_basebackup.

yes.

3) You set up synchronous streaming replication to the replica.

yes

4) You pause the replication and use pg_dump to dump the replica.

yes

5) At this point the error in the subject has occurred twice since you switched to 9.1.5

precisely

6) Running full vacuum analyze on the master solves the problem.
How is it solved?
In other words do you resume replication after the vacuum, or before?
Then do you redo the pg_dump?
Or do you start over with a new pg_basebackup?
I must admit that I am a bit confused here. My recollection is that the *only* thing I did was to run VACUUM ANALYZE on the master and immediately after I was able to successfully run pg_dump.

I realise that this does not correspond very well to postgresql stating that the replay has been paused.

I wonder if I might be mistaken and have run VACUUM ANALYZE on the slave node instead.

I am not seeing anything below that indicates the recovery started again.
the reason why we do not see a matching "resume" is that the pg_dump

failed and our error handling was insufficient.
Looking at this thread again, I wonder; if our errorhandling had been better, and we had automatically resumed streaming, even with the error in the pg_dump phase... I wonder if the problem would have gone away.

Is it possible that we can be unlucky when pausing replication, in a way that would not permit us to run a pg_dump? (perhaps if we factor in the "stale collector" warning below?)

2016-02-22 02:30:08 GMT 24616 HINT: Execute pg_xlog_replay_resume() to
continue.
2016-02-22 02:37:19 GMT 23859 DBNAME ERROR: missing chunk number 0 for
toast value 2747579 in pg_toast_22066
2016-02-22 02:37:19 GMT 23859 DBNAME STATEMENT: COPY public.room_shape
(room_uuid, data) TO stdout;
2016-02-22 02:37:41 GMT 2648 DBNAME LOG: could not receive data from
client: Connection reset by peer
2016-02-22 02:37:41 GMT 2648 DBNAME LOG: unexpected EOF on client
connection

What does the log from the master show?
It doesnt seem to show much. It does have these repeated messages, however:

2016-02-22 02:12:18 GMT 30908 LOG: using stale statistics instead of
current ones because stats collector is not responding
2016-02-22 02:13:01 GMT 30908 LOG: using stale statistics instead of
current ones because stats collector is not responding
2016-02-22 02:13:52 GMT 30908 LOG: using stale statistics instead of
current ones because stats collector is not responding

There are lots of these mesages within the timeframe. There seems to be
a couple of them every 2-4 hours.

This is usually a sign of resource starvation. I see this on an old machine, at night, when I run some intensive file system backups. I figured it out by looking at my crontab. The problems such as they are is that the messages fill up logs and your statistics become, as the message says, stale for how ever long the collector does not respond.
Thank you very much, this is great feedback. I will try to determine what put the master under such load.

Best regards,
Fredrik Huitfeldt

On 7 March 2016 16:35:29 +01:00, Adrian Klaver
<<adrian.klaver@aklaver.com> <mailto:<adrian.klaver@aklaver.com>>> wrote:

On 03/06/2016 10:18 PM, <fredrik@huitfeldt.com>
<mailto:<fredrik@huitfeldt.com>>
<mailto:<fredrik@huitfeldt.com> <mailto:<fredrik@huitfeldt.com>>>
wrote:

HI All,

i would really appreciate any help I can get on this issue.

basically, a pg_basebackup + streaming attach, led to a database
that we
could not read from afterwards.

From original post:

</messages/by-id/1456919678340.31300.116900@webmail2>

"The issue remained until we ran a full vacuum analyze on the
cluster."

Which cluster was that, the master or the slave?

"I have logfiles from the incident, but I cannot see anything
out of
the ordinary (despite having a fair amount of experience
investigating
postgresql logs)."

Can we see the section before and after ERROR?

Beset regards,
Fredrik

PS please advise if this is better posted on another list.

--
Adrian Klaver
<adrian.klaver@aklaver.com> <mailto:<adrian.klaver@aklaver.com>>
<mailto:<adrian.klaver@aklaver.com>
<mailto:<adrian.klaver@aklaver.com>>>

--
Adrian Klaver
<adrian.klaver@aklaver.com> <mailto:<adrian.klaver@aklaver.com>>
Best regards,

Fredrik

--
Adrian Klaver
<adrian.klaver@aklaver.com>
Thank you for your help so far,

Fredrik