9.6 and fsync=off

Started by Craig Ringerover 9 years ago21 messages

craig@2ndquadrant.com

over 9 years ago

Hi all

After helping clean up the mess from another user who turned fsync=off
because they read (bad) tuning advice that it was faster, I'd really like
to change the config file comment.

Really.

#fsync = on # turns forced synchronization on
or off

Now, we can't rename fsync to disable_crash_safety=on or
corrupt_my_database=on. But the comment needs changing.

How about:

#fsync = on # force disk flushes required for
crash safety

or, preferably something like:

"Enable forced disk flushes when they are required for crash safety.
Disabling fsync can lead to unrecoverable database corruption in a crash of
the host system."

--
Craig Ringer http://www.2ndQuadrant.com/
PostgreSQL Development, 24x7 Support, Training & Services

Abhijit Menon-Sen

ams@2ndQuadrant.com

over 9 years ago

In reply to: Craig Ringer (#1)

Re: 9.6 and fsync=off

At 2016-04-27 17:58:08 +0800, craig@2ndquadrant.com wrote:

#fsync = on # turns forced synchronization on or off

I suggest: # provide crash safety by flushing disk writes
# (Disabling this can lead to unrecoverable data
# loss if the system crashes.)

-- Abhijit

--
Sent via pgsql-hackers mailing list (pgsql-hackers@postgresql.org)
To make changes to your subscription:
http://www.postgresql.org/mailpref/pgsql-hackers

Magnus Hagander

magnus@hagander.net

over 9 years ago

In reply to: Abhijit Menon-Sen (#2)

Re: 9.6 and fsync=off

On Wed, Apr 27, 2016 at 12:43 PM, Abhijit Menon-Sen <ams@2ndquadrant.com>
wrote:

At 2016-04-27 17:58:08 +0800, craig@2ndquadrant.com wrote:

#fsync = on # turns forced synchronization

on or off

I suggest: # provide crash safety by
flushing disk writes
# (Disabling this can lead to
unrecoverable data
# loss if the system crashes.)

+1 for the change. I suggest shortening it to just "disabling this can lead
to unrecoverable data corruption" (I think corruption is better than loss,
mainly because too many people equate loss with "i may loose my last 10
updates, and I'm fine with that).

--
Magnus Hagander
Me: http://www.hagander.net/
Work: http://www.redpill-linpro.com/

Petr Jelinek

petr@2ndquadrant.com

over 9 years ago

In reply to: Magnus Hagander (#3)

Re: 9.6 and fsync=off

On 27/04/16 12:53, Magnus Hagander wrote:

On Wed, Apr 27, 2016 at 12:43 PM, Abhijit Menon-Sen <ams@2ndquadrant.com
<mailto:ams@2ndquadrant.com>> wrote:

At 2016-04-27 17:58:08 +0800, craig@2ndquadrant.com
<mailto:craig@2ndquadrant.com> wrote:

#fsync = on # turns forced synchronization on or off

I suggest: # provide crash safety by
flushing disk writes
# (Disabling this can
lead to unrecoverable data
# loss if the system
crashes.)

+1 for the change. I suggest shortening it to just "disabling this can
lead to unrecoverable data corruption" (I think corruption is better
than loss, mainly because too many people equate loss with "i may loose
my last 10 updates, and I'm fine with that).

+1 (Abhijit's wording with data loss changed to data corruption)

--
Petr Jelinek http://www.2ndQuadrant.com/
PostgreSQL Development, 24x7 Support, Training & Services

--
Sent via pgsql-hackers mailing list (pgsql-hackers@postgresql.org)
To make changes to your subscription:
http://www.postgresql.org/mailpref/pgsql-hackers

Abhijit Menon-Sen

ams@2ndQuadrant.com

over 9 years ago

In reply to: Petr Jelinek (#4)

1 attachment(s)

Re: 9.6 and fsync=off

Here's a patch just to help things along.

-- Abhijit

Attachments:

fsync.difftext/x-diff; charset=us-asciiDownload

diff --git a/src/backend/utils/misc/postgresql.conf.sample b/src/backend/utils/misc/postgresql.conf.sample
index f3e3de0..353de2e 100644
--- a/src/backend/utils/misc/postgresql.conf.sample
+++ b/src/backend/utils/misc/postgresql.conf.sample
@@ -182,7 +182,9 @@
 
 #wal_level = minimal			# minimal, replica, or logical
 					# (change requires restart)
-#fsync = on				# turns forced synchronization on or off
+#fsync = on				# provide crash safety by flushing disk writes
+					# (disabling this can lead to unrecoverable
+					# data corruption)
 #synchronous_commit = on		# synchronization level;
 					# off, local, remote_write, remote_apply, or on
 #wal_sync_method = fsync		# the default is the first option

Tom Lane

tgl@sss.pgh.pa.us

over 9 years ago

In reply to: Petr Jelinek (#4)

Re: 9.6 and fsync=off

Petr Jelinek <petr@2ndquadrant.com> writes:

+1 (Abhijit's wording with data loss changed to data corruption)

I'd suggest something like

#fsync = on # flush data to disk for crash safety
# (turning this off can cause
# unrecoverable data corruption!)

regards, tom lane

--
Sent via pgsql-hackers mailing list (pgsql-hackers@postgresql.org)
To make changes to your subscription:
http://www.postgresql.org/mailpref/pgsql-hackers

Craig Ringer

craig@2ndquadrant.com

over 9 years ago

In reply to: Tom Lane (#6)

Re: 9.6 and fsync=off

On 27 April 2016 at 21:44, Tom Lane <tgl@sss.pgh.pa.us> wrote:

Petr Jelinek <petr@2ndquadrant.com> writes:

+1 (Abhijit's wording with data loss changed to data corruption)

I'd suggest something like

#fsync = on # flush data to disk for crash
safety
# (turning this off can cause
# unrecoverable data corruption!)

Looks good.

The docs on fsync are already good, it's just a matter of making people
think twice and actually look at them.

--
Craig Ringer http://www.2ndQuadrant.com/
PostgreSQL Development, 24x7 Support, Training & Services

Robert Haas

robertmhaas@gmail.com

over 9 years ago

In reply to: Craig Ringer (#7)

Re: 9.6 and fsync=off

On Wed, Apr 27, 2016 at 11:04 AM, Craig Ringer <craig@2ndquadrant.com>
wrote:>> I'd suggest something like

#fsync = on # flush data to disk for crash
safety
# (turning this off can cause
# unrecoverable data corruption!)

Looks good.

The docs on fsync are already good, it's just a matter of making people
think twice and actually look at them.

Committed that way. Thanks for suggesting this, Craig.

--
Robert Haas
EnterpriseDB: http://www.enterprisedb.com
The Enterprise PostgreSQL Company

--
Sent via pgsql-hackers mailing list (pgsql-hackers@postgresql.org)
To make changes to your subscription:
http://www.postgresql.org/mailpref/pgsql-hackers

Greg Stark

stark@mit.edu

over 9 years ago

In reply to: Craig Ringer (#1)

Re: 9.6 and fsync=off

On Wed, Apr 27, 2016 at 10:58 AM, Craig Ringer <craig@2ndquadrant.com> wrote:

Now, we can't rename fsync to disable_crash_safety=on or
corrupt_my_database=on. But the comment needs changing.

Fwiw we've done similar things in the past. We can provide
backwards-compatibility support for "fsync" but make the setting
appear as "crash_safety" or whatever in pg_settings and in the default
postgres.conf. The only downside is that tools or scripts that
retrieve all the settings might break or miss that setting.

--
greg

--
Sent via pgsql-hackers mailing list (pgsql-hackers@postgresql.org)
To make changes to your subscription:
http://www.postgresql.org/mailpref/pgsql-hackers

#10

Simon Riggs

simon@2ndQuadrant.com

over 9 years ago

In reply to: Craig Ringer (#7)

Re: 9.6 and fsync=off

On 27 April 2016 at 17:04, Craig Ringer <craig@2ndquadrant.com> wrote:

On 27 April 2016 at 21:44, Tom Lane <tgl@sss.pgh.pa.us> wrote:

Petr Jelinek <petr@2ndquadrant.com> writes:

+1 (Abhijit's wording with data loss changed to data corruption)

I'd suggest something like

#fsync = on # flush data to disk for crash
safety
# (turning this off can cause
# unrecoverable data corruption!)

Looks good.

The docs on fsync are already good, it's just a matter of making people
think twice and actually look at them.

If fsync=off and you turn it on, does it fsync anything at that point?

Or does it mean only that future fsyncs will occur?

--
Simon Riggs http://www.2ndQuadrant.com/
<http://www.2ndquadrant.com/>
PostgreSQL Development, 24x7 Support, Remote DBA, Training & Services

#11

David G. Johnston

david.g.johnston@gmail.com

over 9 years ago

In reply to: Simon Riggs (#10)

Re: 9.6 and fsync=off

On Thursday, April 28, 2016, Simon Riggs <simon@2ndquadrant.com> wrote:

On 27 April 2016 at 17:04, Craig Ringer <craig@2ndquadrant.com
<javascript:_e(%7B%7D,'cvml','craig@2ndquadrant.com');>> wrote:

On 27 April 2016 at 21:44, Tom Lane <tgl@sss.pgh.pa.us
<javascript:_e(%7B%7D,'cvml','tgl@sss.pgh.pa.us');>> wrote:

Petr Jelinek <petr@2ndquadrant.com
<javascript:_e(%7B%7D,'cvml','petr@2ndquadrant.com');>> writes:

+1 (Abhijit's wording with data loss changed to data corruption)

I'd suggest something like

#fsync = on # flush data to disk for crash
safety
# (turning this off can cause
# unrecoverable data corruption!)

Looks good.

The docs on fsync are already good, it's just a matter of making people
think twice and actually look at them.

If fsync=off and you turn it on, does it fsync anything at that point?

Or does it mean only that future fsyncs will occur?

http://www.postgresql.org/docs/current/static/runtime-config-wal.html

4th paragraph in the fsync section.

David J.

#12

Simon Riggs

simon@2ndQuadrant.com

over 9 years ago

In reply to: David G. Johnston (#11)

Re: 9.6 and fsync=off

On 28 April 2016 at 22:30, David G. Johnston <david.g.johnston@gmail.com>
wrote:

On Thursday, April 28, 2016, Simon Riggs <simon@2ndquadrant.com> wrote:

On 27 April 2016 at 17:04, Craig Ringer <craig@2ndquadrant.com> wrote:

On 27 April 2016 at 21:44, Tom Lane <tgl@sss.pgh.pa.us> wrote:

Petr Jelinek <petr@2ndquadrant.com> writes:

+1 (Abhijit's wording with data loss changed to data corruption)

I'd suggest something like

#fsync = on # flush data to disk for crash
safety
# (turning this off can cause
# unrecoverable data
corruption!)

Looks good.

The docs on fsync are already good, it's just a matter of making people
think twice and actually look at them.

If fsync=off and you turn it on, does it fsync anything at that point?

Or does it mean only that future fsyncs will occur?

http://www.postgresql.org/docs/current/static/runtime-config-wal.html

4th paragraph in the fsync section.

Thanks. I've never touched that parameter! But I could have read the docs.

--
Simon Riggs http://www.2ndQuadrant.com/
<http://www.2ndquadrant.com/>
PostgreSQL Development, 24x7 Support, Remote DBA, Training & Services

#13

Andres Freund

andres@anarazel.de

over 9 years ago

In reply to: Simon Riggs (#10)

Re: 9.6 and fsync=off

On 2016-04-28 21:32:37 +0200, Simon Riggs wrote:

On 27 April 2016 at 17:04, Craig Ringer <craig@2ndquadrant.com> wrote:

On 27 April 2016 at 21:44, Tom Lane <tgl@sss.pgh.pa.us> wrote:

Petr Jelinek <petr@2ndquadrant.com> writes:

+1 (Abhijit's wording with data loss changed to data corruption)

I'd suggest something like

#fsync = on # flush data to disk for crash
safety
# (turning this off can cause
# unrecoverable data corruption!)

Looks good.

The docs on fsync are already good, it's just a matter of making people
think twice and actually look at them.

If fsync=off and you turn it on, does it fsync anything at that point?

Or does it mean only that future fsyncs will occur?

Abhijit had a patch implementing automatically running fsync whenever
reenabled IIRC. Abhijit?

Andres

--
Sent via pgsql-hackers mailing list (pgsql-hackers@postgresql.org)
To make changes to your subscription:
http://www.postgresql.org/mailpref/pgsql-hackers

#14

Abhijit Menon-Sen

ams@2ndQuadrant.com

over 9 years ago

In reply to: Andres Freund (#13)

1 attachment(s)

Re: 9.6 and fsync=off

At 2016-04-28 13:44:23 -0700, andres@anarazel.de wrote:

Abhijit had a patch implementing automatically running fsync whenever
reenabled IIRC. Abhijit?

The patch I had written is attached, and it's not quite the same thing.
Here's how I originally described it in response to a question from
Robert:

ï¿½In 20150115133245.GG5245@awork2.anarazel.de, Andres explained his
rationale as follows:

ï¿½What I am thinking of is that, currently, if you start the
server for initial loading with fsync=off, and then restart it,
you're open to data loss. So when the current config file
setting is changed from off to on, we should fsync the data
directory. Even if there was no crash restart.ï¿½

That's what I tried to implement.ï¿½

I remember there was some subsequent discussion about it being better to
issue fsync during a checkpoint when we see that its value has changed,
but if I did any work on it (which I have a vague memory of), I can't
find it now. Sorry.

Do you want a patch along those lines now, or is it too late?

-- Abhijit

Attachments:

0002-Recursively-fsync-PGDATA-on-the-next-restart-after-f.patchtext/x-diff; charset=us-asciiDownload

From 1768680b672bcb037446230323cabcf9960f7f9a Mon Sep 17 00:00:00 2001
From: Abhijit Menon-Sen <ams@2ndQuadrant.com>
Date: Fri, 1 May 2015 17:59:51 +0530
Subject: Recursively fsync PGDATA on the next restart after fsync was disabled

Even if we didn't crash, we want to fsync PGDATA on startup if we know
the server ran with fsync=off at some point since the previous restart.
Otherwise, starting the server with fsync=off for initial data loading
and then restarting it opens you to data loss on a power failure after
the restart.
---
 src/backend/access/transam/xlog.c       | 9 +++++++--
 src/bin/pg_controldata/pg_controldata.c | 2 ++
 src/include/catalog/pg_control.h        | 8 +++++++-
 3 files changed, 16 insertions(+), 3 deletions(-)

diff --git a/src/backend/access/transam/xlog.c b/src/backend/access/transam/xlog.c
index 084174d..af12992 100644
--- a/src/backend/access/transam/xlog.c
+++ b/src/backend/access/transam/xlog.c
@@ -4823,6 +4823,7 @@ BootStrapXLOG(void)
 	ControlFile->checkPoint = checkPoint.redo;
 	ControlFile->checkPointCopy = checkPoint;
 	ControlFile->unloggedLSN = 1;
+	ControlFile->fsync_disabled = false;
 
 	/* Set important parameter values for use when replaying WAL */
 	ControlFile->MaxConnections = MaxConnections;
@@ -5893,10 +5894,12 @@ StartupXLOG(void)
 	 */
 
 	if (enableFsync &&
-		(ControlFile->state != DB_SHUTDOWNED &&
-		 ControlFile->state != DB_SHUTDOWNED_IN_RECOVERY))
+		(ControlFile->fsync_disabled ||
+		 (ControlFile->state != DB_SHUTDOWNED &&
+		  ControlFile->state != DB_SHUTDOWNED_IN_RECOVERY)))
 	{
 		fsync_pgdata(data_directory);
+		ControlFile->fsync_disabled = false;
 	}
 
 	if (ControlFile->state == DB_SHUTDOWNED)
@@ -8272,6 +8275,8 @@ CreateCheckPoint(int flags)
 	/* crash recovery should always recover to the end of WAL */
 	ControlFile->minRecoveryPoint = InvalidXLogRecPtr;
 	ControlFile->minRecoveryPointTLI = 0;
+	if (!enableFsync)
+		ControlFile->fsync_disabled = true;
 
 	/*
 	 * Persist unloggedLSN value. It's reset on crash recovery, so this goes
diff --git a/src/bin/pg_controldata/pg_controldata.c b/src/bin/pg_controldata/pg_controldata.c
index d8cfe5e..e99014f 100644
--- a/src/bin/pg_controldata/pg_controldata.c
+++ b/src/bin/pg_controldata/pg_controldata.c
@@ -290,6 +290,8 @@ main(int argc, char *argv[])
 		   (uint32) ControlFile.backupEndPoint);
 	printf(_("End-of-backup record required:        %s\n"),
 		   ControlFile.backupEndRequired ? _("yes") : _("no"));
+	printf(_("Fsync disabled at runtime:            %s\n"),
+		   ControlFile.fsync_disabled ? _("yes") : _("no"));
 	printf(_("Current wal_level setting:            %s\n"),
 		   wal_level_str(ControlFile.wal_level));
 	printf(_("Current wal_log_hints setting:        %s\n"),
diff --git a/src/include/catalog/pg_control.h b/src/include/catalog/pg_control.h
index 2e4c381..a71d73e 100644
--- a/src/include/catalog/pg_control.h
+++ b/src/include/catalog/pg_control.h
@@ -21,7 +21,7 @@
 
 
 /* Version identifier for this pg_control format */
-#define PG_CONTROL_VERSION	942
+#define PG_CONTROL_VERSION	943
 
 /*
  * Body of CheckPoint XLOG records.  This is declared here because we keep
@@ -182,6 +182,12 @@ typedef struct ControlFileData
 	bool		track_commit_timestamp;
 
 	/*
+	 * Indicates whether fsync was ever disabled since the last restart.
+	 * Tested and set at checkpoints, reset at startup.
+	 */
+	bool		fsync_disabled;
+
+	/*
 	 * This data is used to check for hardware-architecture compatibility of
 	 * the database and the backend executable.  We need not check endianness
 	 * explicitly, since the pg_control version will surely look wrong to a
-- 
1.9.1

#15

Tom Lane

tgl@sss.pgh.pa.us

over 9 years ago

In reply to: Abhijit Menon-Sen (#14)

Re: 9.6 and fsync=off

Abhijit Menon-Sen <ams@2ndQuadrant.com> writes:

Do you want a patch along those lines now, or is it too late?

We're certainly not going to consider fooling with this in 9.6.
The situation for manual fsync-twiddling is no worse than it was in
any prior release, and we are long past feature freeze.

If you want to put it on your to-do queue for 9.7, feel free.

regards, tom lane

--
Sent via pgsql-hackers mailing list (pgsql-hackers@postgresql.org)
To make changes to your subscription:
http://www.postgresql.org/mailpref/pgsql-hackers

#16

Robert Haas

robertmhaas@gmail.com

over 9 years ago

In reply to: Tom Lane (#15)

Re: 9.6 and fsync=off

On Fri, Apr 29, 2016 at 9:49 AM, Tom Lane <tgl@sss.pgh.pa.us> wrote:

Abhijit Menon-Sen <ams@2ndQuadrant.com> writes:

Do you want a patch along those lines now, or is it too late?

We're certainly not going to consider fooling with this in 9.6.
The situation for manual fsync-twiddling is no worse than it was in
any prior release, and we are long past feature freeze.

If you want to put it on your to-do queue for 9.7, feel free.

Agreed.

I also think that it would be a swell idea to detect whether a system
has ever crashed with fsync=off, and do something about that, like
maybe bleat on every subsequent startup for the lifetime of the
cluster. I think Andres may have even proposed a patch for this sort
of thing before, although I don't remember for sure and I think he and
I disagreed on the details. Sketch:

- Keep a copy of the fsync status in pg_control.
- If we ever enter recovery while it's turned off, say:
WARNING: Entering recovery with fsync=off; this cluster may be
irretrievably corrupted.
...and also set a separate flag indicating that we've done at least
one recovery with fsync=off.
- If that flag is set on a subsequent startup, say:
WARNING: Recovery was previously performed with fsync=off; this
cluster may be irretrievably corrupted.

While I'm kvetching, it might also be a good idea to have a timestamp
in pg_control indicating the date and time at which pg_resetxlog was
last run (and maybe the cluster creation time, too). I run across way
too many clusters where the customer can't convincingly vouch for the
proposition that nothing evil has been done, and having some forensic
evidence available would make it easier to figure out where the blame
lies.

--
Robert Haas
EnterpriseDB: http://www.enterprisedb.com
The Enterprise PostgreSQL Company

--
Sent via pgsql-hackers mailing list (pgsql-hackers@postgresql.org)
To make changes to your subscription:
http://www.postgresql.org/mailpref/pgsql-hackers

#17

Craig Ringer

craig@2ndquadrant.com

over 9 years ago

In reply to: Robert Haas (#16)

Re: 9.6 and fsync=off

On 2 May 2016 at 22:07, Robert Haas <robertmhaas@gmail.com> wrote:

I also think that it would be a swell idea to detect whether a system
has ever crashed with fsync=off, and do something about that, like
maybe bleat on every subsequent startup for the lifetime of the
cluster.

Yes. Very, very yes.

That would've made my life considerably easier on a few occasions now.

It shouldn't take much more than a new pg_control field and a test during
recovery.

Should TODO this, but since that's sometimes where ideas go to die, I'm
going to see if I can hack this out soon as well.

--
Craig Ringer http://www.2ndQuadrant.com/
PostgreSQL Development, 24x7 Support, Training & Services

#18

Tom Lane

tgl@sss.pgh.pa.us

over 9 years ago

In reply to: Craig Ringer (#17)

Re: 9.6 and fsync=off

Craig Ringer <craig@2ndquadrant.com> writes:

On 2 May 2016 at 22:07, Robert Haas <robertmhaas@gmail.com> wrote:

I also think that it would be a swell idea to detect whether a system
has ever crashed with fsync=off, and do something about that, like
maybe bleat on every subsequent startup for the lifetime of the
cluster.

Yes. Very, very yes.

+1 for tracking this in pg_control (maybe even with a counter, not
just a flag). I'm less convinced that we need to bleat on every
subsequent startup though --- that seems like just nagging.
Having the info available from pg_controldata seems sufficient for
forensics.

The timestamp ideas aren't bad either.

BTW, how would this work in a standby server?

regards, tom lane

--
Sent via pgsql-hackers mailing list (pgsql-hackers@postgresql.org)
To make changes to your subscription:
http://www.postgresql.org/mailpref/pgsql-hackers

#19

Andres Freund

andres@anarazel.de

over 9 years ago

In reply to: Robert Haas (#16)

Re: 9.6 and fsync=off

Hi,

On 2016-05-02 10:07:50 -0400, Robert Haas wrote:

I also think that it would be a swell idea to detect whether a system
has ever crashed with fsync=off, and do something about that, like
maybe bleat on every subsequent startup for the lifetime of the
cluster. I think Andres may have even proposed a patch for this sort
of thing before, although I don't remember for sure and I think he and
I disagreed on the details. Sketch:

Hm, I can't remember doing that.

- Keep a copy of the fsync status in pg_control.
- If we ever enter recovery while it's turned off, say:
WARNING: Entering recovery with fsync=off; this cluster may be
irretrievably corrupted.
...and also set a separate flag indicating that we've done at least
one recovery with fsync=off.
- If that flag is set on a subsequent startup, say:
WARNING: Recovery was previously performed with fsync=off; this
cluster may be irretrievably corrupted.

Well, the problem with that is that postgres crashes are actually
harmless with regard to fsync=on/off. It's just OS crashes that are a
problem. So it seems quite likely that the false-positive rate here
would be high enough, to make people ignore it.

Andres

--
Sent via pgsql-hackers mailing list (pgsql-hackers@postgresql.org)
To make changes to your subscription:
http://www.postgresql.org/mailpref/pgsql-hackers

#20

Tom Lane

tgl@sss.pgh.pa.us

over 9 years ago

In reply to: Andres Freund (#19)

Re: 9.6 and fsync=off

Andres Freund <andres@anarazel.de> writes:

On 2016-05-02 10:07:50 -0400, Robert Haas wrote:

- If that flag is set on a subsequent startup, say:
WARNING: Recovery was previously performed with fsync=off; this
cluster may be irretrievably corrupted.

Well, the problem with that is that postgres crashes are actually
harmless with regard to fsync=on/off. It's just OS crashes that are a
problem. So it seems quite likely that the false-positive rate here
would be high enough, to make people ignore it.

That's a pretty good point. Also, as sketched, I believe this would
start bleating after a crash recovery performed because a backend
died --- which is a case where we know for certain there was no OS
crash. So this idea needs some more thought.

regards, tom lane

--
Sent via pgsql-hackers mailing list (pgsql-hackers@postgresql.org)
To make changes to your subscription:
http://www.postgresql.org/mailpref/pgsql-hackers

#21

Robert Haas

robertmhaas@gmail.com

over 9 years ago

In reply to: Tom Lane (#20)

Re: 9.6 and fsync=off

On Mon, May 2, 2016 at 12:04 PM, Tom Lane <tgl@sss.pgh.pa.us> wrote:

Andres Freund <andres@anarazel.de> writes:

On 2016-05-02 10:07:50 -0400, Robert Haas wrote:

- If that flag is set on a subsequent startup, say:
WARNING: Recovery was previously performed with fsync=off; this
cluster may be irretrievably corrupted.

Well, the problem with that is that postgres crashes are actually
harmless with regard to fsync=on/off. It's just OS crashes that are a
problem. So it seems quite likely that the false-positive rate here
would be high enough, to make people ignore it.

That's a pretty good point. Also, as sketched, I believe this would
start bleating after a crash recovery performed because a backend
died --- which is a case where we know for certain there was no OS
crash. So this idea needs some more thought.

That's true. I think, that we could arrange to ignore postmaster
initiated crash-and-restart cycles in deciding whether to set the
flag. Now, somebody could still do an immediate shutdown, or the
postmaster could go boom, but I don't think those are common enough
scenarios to justify not tracking this. If you are using fsync=off
and running an immediate shutdown and then setting fsync=on and
restarting the server ... yeah, that could hypothetically be safe.
But I think you are playing with fire. If you are using fsync=off for
the initial data load, it's not too much to ask that you shut the
cluster down cleanly when you are done.

--
Robert Haas
EnterpriseDB: http://www.enterprisedb.com
The Enterprise PostgreSQL Company

--
Sent via pgsql-hackers mailing list (pgsql-hackers@postgresql.org)
To make changes to your subscription:
http://www.postgresql.org/mailpref/pgsql-hackers