Enable data checksums by default

greg@turnstep.com

over 1 year ago

In reply to: Michael Banck (#2)

Re: Enable data checksums by default

On Wed, Aug 7, 2024 at 4:43 AM Michael Banck <mbanck@gmx.net> wrote:

I think the last time we dicussed this the consensus was that
computational overhead of computing the checksums is pretty small for
most systems (so the above change seems warranted regardless of whether
we switch the default), but turning on wal_compression also turns on
wal_log_hints, which can increase WAL by quite a lot. Maybe this is
covered elsewhere in the documentation (I just looked at the patch), but
if not, it probably should be added here as a word of caution.

Yeah, that seems something beyond this patch? Certainly we should mention
wal_compression in the release notes if the default changes. I mean, I feel
wal_log_hints should probably default to on as well, but I've honestly
never really given it much thought because my fingers are trained to type
"initdb -k". I've been using data checksums for roughly a decade now. I
think the only time I've NOT used checksums was when I was doing checksum
overhead measurements, or hacking on the pg_checksums program.

I think we usually do not mention when a feature was added/changed, do
we? So I'd just write "(default: enabled)" or whatever is the style of
the surrounding options.

+ {"no-data-checksums", no_argument, NULL, 20},

Does it make sense to add -K (capital k) as a short-cut for this? I
think this is how we distinguish on/off for pg_dump (-t/-T etc.) but
maybe that is not wider project policy.

I'd rather not. Better to keep it explicit rather than some other weird
letter that has no mnemonic value.

Cheers,
Greg

Michael Paquier

michael@paquier.xyz

over 1 year ago

In reply to: Greg Sabino Mullane (#3)

Re: Enable data checksums by default

peter_e@gmx.net

over 1 year ago

In reply to: Greg Sabino Mullane (#1)

Re: Enable data checksums by default

On 07.08.24 00:46, Greg Sabino Mullane wrote:

Currently, initdb only enables data checksums if passed the
--data-checksums or -k argument. There was some hesitation years ago
when this feature was first added, leading to the current situation
where the default is off. However, many years later, there is wide
consensus that this is an extraordinarily safe, desirable setting.
Indeed, most (if not all) of the major commercial and open source
Postgres systems currently turn this on by default. I posit you would be
hard-pressed to find many systems these days in which it has NOT been
turned on. So basically we have a de-facto standard, and I think it's
time we flipped the switch to make it on by default.

I'm sympathetic to this proposal, but I want to raise some concerns.

My understanding was that the reason for some hesitation about adopting
data checksums was the performance impact. Not the checksumming itself,
but the overhead from hint bit logging. The last time I looked into
that, you could get performance impacts on the order of 5% tps. Maybe
that's acceptable, and you of course can turn it off if you want the
extra performance. But I think this should be discussed in this thread.

About the claim that it's already the de-facto standard. Maybe that is
approximately true for "serious" installations. But AFAICT, the popular
packagings don't enable checksums by default, so there is likely a
significant middle tier between "just trying it out" and serious
production use that don't have it turned on.

For those uses, this change would render pg_upgrade useless for upgrades
from an old instance with default settings to a new instance with
default settings. And then users would either need to re-initdb with
checksums turned back off, or I suppose run pg_checksums on the old
instance before upgrading? This is significant additional complication.
And packagers who have built abstractions on top of pg_upgrade (such
as Debian pg_upgradecluster) would also need to implement something to
manage this somehow.

So I think we need to think through the upgrade experience a bit more.
Unfortunately, pg_checksums hasn't gotten to the point that we were
perhaps once hoping for that you could enable checksums on a live
system. I'm thinking pg_upgrade could have a mode where it adds the
checksum during the upgrade as it copies the files (essentially a subset
of pg_checksums). I think that would be useful for that middle tier of
users who just want a good default experience.

Michael Banck

michael.banck@credativ.de

over 1 year ago

In reply to: Peter Eisentraut (#5)

Re: Enable data checksums by default

On Thu, Aug 08, 2024 at 12:11:38PM +0200, Peter Eisentraut wrote:

So I think we need to think through the upgrade experience a bit more.
Unfortunately, pg_checksums hasn't gotten to the point that we were perhaps
once hoping for that you could enable checksums on a live system. I'm
thinking pg_upgrade could have a mode where it adds the checksum during the
upgrade as it copies the files (essentially a subset of pg_checksums). I
think that would be useful for that middle tier of users who just want a
good default experience.

Well that, or, as a first less ambitious step, pg_upgrade could carry
over the data_checksums setting from the old to the new instance by
essentially disabling it via pg_checksums -d (which is fast) if it the
current default (off) is set on the old instance and the new instance
was created with the new onw (checksums on).

Probably should include a warning or something in that case, though I
guess a lot of users will read just past it. But at least they are not
worse off than before.

Michael

/messages/by-id/E07A611B-9CF3-4FDB-8CE8-A221E39040EC@yesql.se

daniel@yesql.se

over 1 year ago

In reply to: Peter Eisentraut (#5)

Re: Enable data checksums by default

On 8 Aug 2024, at 12:11, Peter Eisentraut <peter@eisentraut.org> wrote:

My understanding was that the reason for some hesitation about adopting data checksums was the performance impact. Not the checksumming itself, but the overhead from hint bit logging. The last time I looked into that, you could get performance impacts on the order of 5% tps. Maybe that's acceptable, and you of course can turn it off if you want the extra performance. But I think this should be discussed in this thread.

That's been my experience as well, the overhead of the checksumming is
negligible but the overhead in WAL can be (having hint bits WAL logged does
carry other benefits as well to be fair).

I think we need to think through the upgrade experience a bit more.

Unfortunately, pg_checksums hasn't gotten to the point that we were perhaps once hoping for that you could enable checksums on a live system.

I don't recall there being any work done (or plans for) using pg_checksums on a
live system. Anyone interested in enabling checksums on a live cluster can
however review the patch for that in:

I'm thinking pg_upgrade could have a mode where it adds the checksum during the upgrade as it copies the files (essentially a subset of pg_checksums). I think that would be useful for that middle tier of users who just want a good default experience.

As a side-note, I implemented this in pg_upgrade at Greenplum (IIRC it was
submitted to -hackers at the time as well) and it worked well with not a lot of
code.

--
Daniel Gustafsson

greg@turnstep.com

over 1 year ago

In reply to: Michael Paquier (#4)

Re: Enable data checksums by default

Thank you for the feedback. Please find attached three separate patches.
One to add a new flag to initdb (--no-data-checksums), one to adjust the
tests to use this flag as needed, and the final to make the actual switch
of the default value (along with tests and docs).

Cheers,
Greg

Robert Haas

robertmhaas@gmail.com

over 1 year ago

In reply to: Peter Eisentraut (#5)

Re: Enable data checksums by default

On Thu, Aug 8, 2024 at 6:11 AM Peter Eisentraut <peter@eisentraut.org> wrote:

About the claim that it's already the de-facto standard. Maybe that is
approximately true for "serious" installations. But AFAICT, the popular
packagings don't enable checksums by default, so there is likely a
significant middle tier between "just trying it out" and serious
production use that don't have it turned on.

+1.

I'm thinking pg_upgrade could have a mode where it adds the
checksum during the upgrade as it copies the files (essentially a subset
of pg_checksums). I think that would be useful for that middle tier of
users who just want a good default experience.

That would be very nice.

--
Robert Haas
EDB: http://www.enterprisedb.com

#10

tomas.vondra@2ndquadrant.com

over 1 year ago

In reply to: Robert Haas (#9)

Re: Enable data checksums by default

On 8/8/24 19:42, Robert Haas wrote:

On Thu, Aug 8, 2024 at 6:11 AM Peter Eisentraut <peter@eisentraut.org> wrote:

About the claim that it's already the de-facto standard. Maybe that is
approximately true for "serious" installations. But AFAICT, the popular
packagings don't enable checksums by default, so there is likely a
significant middle tier between "just trying it out" and serious
production use that don't have it turned on.

+1.

I'm thinking pg_upgrade could have a mode where it adds the
checksum during the upgrade as it copies the files (essentially a subset
of pg_checksums). I think that would be useful for that middle tier of
users who just want a good default experience.

That would be very nice.

Yeah, but it might also disable checksums on the new cluster, which
would work for link mode too. So we'd probably want multiple modes, one
to enable checksums during file copy, one to disable checksums, and one
to just fail for incompatible clusters.

--
Tomas Vondra

#11

greg@turnstep.com

over 1 year ago

In reply to: Peter Eisentraut (#5)

Re: Enable data checksums by default

On Thu, Aug 8, 2024 at 6:11 AM Peter Eisentraut <peter@eisentraut.org>
wrote:

My understanding was that the reason for some hesitation about adopting
data checksums was the performance impact. Not the checksumming itself,
but the overhead from hint bit logging. The last time I looked into that,
you could get performance impacts on the order of 5% tps. Maybe that's
acceptable, and you of course can turn it off if you want the extra
performance. But I think this should be discussed in this thread.

Fair enough. I think the performance impact is acceptable, as evidenced by
the large number of people that turn it on. And it is easy enough to turn
it off again, either via --no-data-checksums or pg_checksums --disable.
I've come across people who have regretted not throwing a -k into their
initial initdb, but have not yet come across someone who has the opposite
regret. When I did some measurements some time ago, I found numbers much
less than 5%, but of course it depends on a lot of factors.

About the claim that it's already the de-facto standard. Maybe that is

approximately true for "serious" installations. But AFAICT, the popular
packagings don't enable checksums by default, so there is likely a
significant middle tier between "just trying it out" and serious
production use that don't have it turned on.

I would push back on that "significant" a good bit. The number of Postgres
installations in the cloud is very likely to dwarf the total package
installations. Maybe not 10 years ago, but now? Maybe someone from Amazon
can share some numbers. Not that we have any way to compare against package
installs :) But anecdotally the number of people who mention RDS etc. on
the various fora has exploded.

For those uses, this change would render pg_upgrade useless for upgrades
from an old instance with default settings to a new instance with default
settings. And then users would either need to re-initdb with checksums
turned back off, or I suppose run pg_checksums on the old instance before
upgrading? This is significant additional complication.

Meh, re-running initdb with --no-data-checksums seems a fairly low hurdle.

And packagers who have built abstractions on top of pg_upgrade (such as
Debian pg_upgradecluster) would also need to implement something to manage
this somehow.

How does it deal with clusters with checksums enabled now?

I'm thinking pg_upgrade could have a mode where it adds the checksum
during the upgrade as it copies the files (essentially a subset
of pg_checksums). I think that would be useful for that middle tier of
users who just want a good default experience.

Hm...might be a bad experience if it forces a switch out of --link mode.
Perhaps a warning at the end of pg_upgrade that suggests running
pg_checksums on your new cluster if you want to enable checksums?

Cheers,
Greg

#12

Robert Haas

robertmhaas@gmail.com

over 1 year ago

In reply to: Greg Sabino Mullane (#11)

Re: Enable data checksums by default

On Tue, Aug 13, 2024 at 10:42 AM Greg Sabino Mullane <htamfids@gmail.com> wrote:

Fair enough. I think the performance impact is acceptable, as evidenced by the large number of people that turn it on. And it is easy enough to turn it off again, either via --no-data-checksums or pg_checksums --disable.
When I did some measurements some time ago, I found numbers much less than 5%, but of course it depends on a lot of factors.

I think the bad case is when you have a write workload that is
significantly bigger than shared_buffers but still small enough to fit
comfortably in the OS cache. When everything fits in shared_buffers,
you only need to write dirty buffers once per checkpoint cycle, so
making it more expensive isn't necessarily a big deal. When you're
constantly going to disk, that's so expensive that you don't notice
the computational overhead. But when you're in that middle zone where
you keep evicting buffers from PG but not actually having to write
them down to the disk, then I think it's pretty noticeable.

I've come across people who have regretted not throwing a -k into their initial initdb, but have not yet come across someone who has the opposite regret.

I don't think this is really a fair comparison, because everything
being a little slower all the time is not something that people are
likely to "regret" in the same way that they regret it when a data
corruption issue goes undetected. An undetected data corruption issue
is a single, very painful event that people are likely to notice,
whereas a small performance loss over time kind of blends into the
background. You don't really regret that kind of thing in the same way
that you regret a bad event that happens at a particular moment in
time.

And it's not like we have statistics anywhere that you can look at to
see how much CPU time you spent computing checksums, so if a user DOES
have a performance problem that would not have occurred if checksums
had been disabled, they'll probably never know it.

For those uses, this change would render pg_upgrade useless for upgrades from an old instance with default settings to a new instance with default settings. And then users would either need to re-initdb with checksums turned back off, or I suppose run pg_checksums on the old instance before upgrading? This is significant additional complication.

Meh, re-running initdb with --no-data-checksums seems a fairly low hurdle.

I tend to agree with that, but I would also like to see the sort of
improvements that Peter mentions. It's a lot less work to say "let's
just change the default" and then get mad at anyone who disagrees than
it is to do the engineering to make changing the default less of a
problem. But that kind of engineering really adds a lot of value
compared to just changing the default.

None of that is to say that I'm totally hostile to this change.
Checksums don't actually prevent your data from getting corrupted, or
let you recover it after it does. They just tell you about the
problem, and very often you would have found out anyway. However, they
do have peace-of-mind value. If you've got checksums turned on, you
can verify your checksums regularly and see that they're OK, and
people like that. Whether that's worth the overhead for everyone, I'm
not quite sure.

--
Robert Haas
EDB: http://www.enterprisedb.com

#13

peter_e@gmx.net

over 1 year ago

In reply to: Robert Haas (#9)

Re: Enable data checksums by default

On 08.08.24 19:42, Robert Haas wrote:

I'm thinking pg_upgrade could have a mode where it adds the
checksum during the upgrade as it copies the files (essentially a subset
of pg_checksums). I think that would be useful for that middle tier of
users who just want a good default experience.

That would be very nice.

Here is a demo patch for that. It turned out to be quite simple.

I wrote above about a separate mode for that (like
--copy-and-make-adjustments), but it was just as easy to stick it into
the existing --copy mode.

It would be useful to check what the performance overhead of this is
versus a copy that does not have to make adjustments. I expect it's
very little.

A drawback is that as written this does not work on Windows, because
Windows uses a different code path in copyFile(). I don't know the
reasons for that. But it would need to be figured out.

#14

jakub.wartak@enterprisedb.com

over 1 year ago

In reply to: Greg Sabino Mullane (#3)

Re: Enable data checksums by default

On Wed, Aug 7, 2024 at 4:18 PM Greg Sabino Mullane <htamfids@gmail.com> wrote:

On Wed, Aug 7, 2024 at 4:43 AM Michael Banck <mbanck@gmx.net> wrote:

I think the last time we dicussed this the consensus was that
computational overhead of computing the checksums is pretty small for
most systems (so the above change seems warranted regardless of whether
we switch the default), but turning on wal_compression also turns on
wal_log_hints, which can increase WAL by quite a lot. Maybe this is

[..]

Yeah, that seems something beyond this patch? Certainly we should mention wal_compression in the release notes if the default changes. I mean, I feel wal_log_hints should probably default to on as well, but I've honestly never really given it much thought because my fingers are trained to type "initdb -k". I've been using data checksums for roughly a decade now. I think the only time I've NOT used checksums was when I was doing checksum overhead measurements, or hacking on the pg_checksums program.

Maybe I don't understand something, but just to be clear:
wal_compression (mentioned above) is not turning wal_log_hints on,
just the wal_log_hints needs to be on when using data checksums
(implicitly, by the XLogHintBitIsNeeded() macro). I suppose Michael
was thinking about the wal_log_hints earlier (?)

-J.

#15

[1]: /messages/by-id/20190330192543.GH4719@development
[2]: https://www.enterprisedb.com/docs/pgd/4/deployments/tpaexec/

jakub.wartak@enterprisedb.com

over 1 year ago

In reply to: Greg Sabino Mullane (#11)

Re: Enable data checksums by default

Hi Greg and others

On Tue, Aug 13, 2024 at 4:42 PM Greg Sabino Mullane <htamfids@gmail.com> wrote:

On Thu, Aug 8, 2024 at 6:11 AM Peter Eisentraut <peter@eisentraut.org> wrote:

My understanding was that the reason for some hesitation about adopting data checksums was the performance impact. Not the checksumming itself, but the overhead from hint bit logging. The last time I looked into that, you could get performance impacts on the order of 5% tps. Maybe that's acceptable, and you of course can turn it off if you want the extra performance. But I think this should be discussed in this thread.

Fair enough. I think the performance impact is acceptable, as evidenced by the large number of people that turn it on. And it is easy enough to turn it off again, either via --no-data-checksums or pg_checksums --disable. I've come across people who have regretted not throwing a -k into their initial initdb, but have not yet come across someone who has the opposite regret. When I did some measurements some time ago, I found numbers much less than 5%, but of course it depends on a lot of factors.

Same here, and +1 to data_checksums=on by default for new installations.

The best public measurement of the impact was posted in [1]/messages/by-id/20190330192543.GH4719@development in 2019 by
Tomas to the best of my knowledge, where he explicitly mentioned the
problem with more WAL with hints/checksums: SATA disks (low IOPS). My
take: now we have 2024, and most people are using at least SSDs or
slow-SATA (but in cloud they could just change the class of I/O if
required to get IOPS to avoid too much throttling), therefore the
price of IOPS dropped significantly.

About the claim that it's already the de-facto standard. Maybe that is approximately true for "serious" installations. But AFAICT, the popular packagings don't enable checksums by default, so there is likely a significant middle tier between "just trying it out" and serious
production use that don't have it turned on.

I would push back on that "significant" a good bit. The number of Postgres installations in the cloud is very likely to dwarf the total package installations. Maybe not 10 years ago, but now? Maybe someone from Amazon can share some numbers. Not that we have any way to compare against package installs :) But anecdotally the number of people who mention RDS etc. on the various fora has exploded.

Same here. If it helps the case the: 43% of all PostgreSQL DBs
involved in any support case or incident in EDB within last year had
data_checksums=on (at least if they had collected the data using our )
. That's a surprisingly high number (for something that's off by
default), and it makes me think this is because plenty of customers
are either managed by DBAs who care, or assisted by consultants when
deploying, or simply using TPAexec [2]https://www.enterprisedb.com/docs/pgd/4/deployments/tpaexec/ which has this on by default.

Another thing is plenty of people run with wal_log_hints=on (without
data_checksums=off) just to have pg_rewind working. As this is a
strictly standby related tool it means they don't have WAL/network
bandwidth problems, so the WAL rate is not that high in the wild to
cause problems. I found 1 or 2 cases within last year where we would
mention that high WAL generation was attributed to
wal_log_hints=on/XLOG_FPI and they still didn't disable it apparently
(we have plenty of cases related to too much WAL, but it's mostly due
to other basic reasons)

-J.

#16

Michael Banck

michael.banck@credativ.de

over 1 year ago

In reply to: Jakub Wartak (#14)

Re: Enable data checksums by default

On Thu, Aug 15, 2024 at 09:49:04AM +0200, Jakub Wartak wrote:

On Wed, Aug 7, 2024 at 4:18 PM Greg Sabino Mullane <htamfids@gmail.com> wrote:

On Wed, Aug 7, 2024 at 4:43 AM Michael Banck <mbanck@gmx.net> wrote:

I think the last time we dicussed this the consensus was that
computational overhead of computing the checksums is pretty small for
most systems (so the above change seems warranted regardless of whether
we switch the default), but turning on wal_compression also turns on
wal_log_hints, which can increase WAL by quite a lot. Maybe this is

[..]

Yeah, that seems something beyond this patch? Certainly we should
mention wal_compression in the release notes if the default changes.
I mean, I feel wal_log_hints should probably default to on as well,
but I've honestly never really given it much thought because my
fingers are trained to type "initdb -k". I've been using data
checksums for roughly a decade now. I think the only time I've NOT
used checksums was when I was doing checksum overhead measurements,
or hacking on the pg_checksums program.

Maybe I don't understand something, but just to be clear:
wal_compression (mentioned above) is not turning wal_log_hints on,
just the wal_log_hints needs to be on when using data checksums
(implicitly, by the XLogHintBitIsNeeded() macro). I suppose Michael
was thinking about the wal_log_hints earlier (?)

Uh, I am pretty sure I meant to say "turning on data_checksums als turns
on wal_log_hints", sorry about the confusion.

I guess the connection is that if you turn on wal_lot_hints (either
directly or via data_checksums) then the number FPIs goes up (possibly
signficantly), and enabling wal_compression could (partly) remedy that.
But I agree with Greg that such a discussion is probably out-of-scope
for this default change.

Michael

#17

jakub.wartak@enterprisedb.com

over 1 year ago

In reply to: Robert Haas (#12)

Re: Enable data checksums by default

Hi all,

On Tue, Aug 13, 2024 at 10:08 PM Robert Haas <robertmhaas@gmail.com> wrote:

And it's not like we have statistics anywhere that you can look at to
see how much CPU time you spent computing checksums, so if a user DOES
have a performance problem that would not have occurred if checksums
had been disabled, they'll probably never know it.

In worst case, per second and per-pid CPU time consumption could be
quantified by having eBPF which is the standard on distros now
(requires kernel headers and bpfcc-tools installed), e.g. here 7918
was PID doing pgbench-related -c 4 workload with checksum=on (sorry
for formatting, but I don't want to use HTML here):

# funclatency-bpfcc --microseconds -i 1 -p 7918
/usr/lib/postgresql/16/bin/postgres:pg_checksum_page
Tracing 1 functions for
"/usr/lib/postgresql/16/bin/postgres:pg_checksum_page"... Hit Ctrl-C
to end.

usecs : count distribution
0 -> 1 : 0 | |
2 -> 3 : 238 |************* |
4 -> 7 : 714 |****************************************|
8 -> 15 : 2 | |
16 -> 31 : 5 | |
32 -> 63 : 0 | |
64 -> 127 : 1 | |
128 -> 255 : 0 | |
256 -> 511 : 1 | |
512 -> 1023 : 1 | |

avg = 6 usecs, total: 6617 usecs, count: 962

usecs : count distribution
0 -> 1 : 0 | |
2 -> 3 : 241 |************* |
4 -> 7 : 706 |****************************************|
8 -> 15 : 11 | |
16 -> 31 : 10 | |
32 -> 63 : 1 | |

avg = 5 usecs, total: 5639 usecs, count: 969

[..refreshes every 1s here..]

So the above can tell us e.g. that this pg_checksum_page() took 5639
us out of 1s full sample time (and with 100% CPU pegged core so that's
gives again ~5% CPU util per this routine; I'm ignoring the WAL/log
hint impact for sure). One could also write a small script using
bpftrace instead, too. Disassembly on Debian version and stock PGDG is
telling me it's ful SSE2 instruction-set, so that's nice and optimal
too.

For those uses, this change would render pg_upgrade useless for upgrades from an old instance with default settings to a new instance with default settings. And then users would either need to re-initdb with checksums turned back off, or I suppose run pg_checksums on the old instance before upgrading? This is significant additional complication.

Meh, re-running initdb with --no-data-checksums seems a fairly low hurdle.

I tend to agree with that, but I would also like to see the sort of
improvements that Peter mentions.

[..]

None of that is to say that I'm totally hostile to this change.

[.,.]

Whether that's worth the overhead for everyone, I'm not quite sure.

Without data checksums there's a risk that someone receives silent-bit
corruption and no one will notice. Shouldn't integrity of data stand
above performance by default, in this case? (and performance could be
opt-in, if someone really wants it).

-J.

#18

peter_e@gmx.net

over 1 year ago

In reply to: Peter Eisentraut (#13)

Re: Enable data checksums by default

On 15.08.24 08:38, Peter Eisentraut wrote:

On 08.08.24 19:42, Robert Haas wrote:

I'm thinking pg_upgrade could have a mode where it adds the
checksum during the upgrade as it copies the files (essentially a subset
of pg_checksums). I think that would be useful for that middle tier of
users who just want a good default experience.

That would be very nice.

Here is a demo patch for that. It turned out to be quite simple.

I wrote above about a separate mode for that (like
--copy-and-make-adjustments), but it was just as easy to stick it into
the existing --copy mode.

It would be useful to check what the performance overhead of this is
versus a copy that does not have to make adjustments. I expect it's
very little.

A drawback is that as written this does not work on Windows, because
Windows uses a different code path in copyFile(). I don't know the
reasons for that. But it would need to be figured out.

Here is an updated patch for this. I simplified the logic a bit and
also handle the case where the read() reads less than a round number of
blocks. I did some performance testing. The overhead of computing the
checksums versus a straight --copy without checksum adjustments appears
to be around 5% wall clock time, which seems ok to me. I also looked
around the documentation to see if there is anything to update, but
didn't find anything.

I think if we can work out what to do on Windows, this could be a useful
little feature for facilitating $subject.

#19

jakub.wartak@enterprisedb.com

over 1 year ago

In reply to: Peter Eisentraut (#18)

Re: Enable data checksums by default

On Thu, Aug 22, 2024 at 8:11 AM Peter Eisentraut <peter@eisentraut.org> wrote:

On 15.08.24 08:38, Peter Eisentraut wrote:

On 08.08.24 19:42, Robert Haas wrote:

I'm thinking pg_upgrade could have a mode where it adds the
checksum during the upgrade as it copies the files (essentially a subset
of pg_checksums). I think that would be useful for that middle tier of
users who just want a good default experience.

That would be very nice.

Here is a demo patch for that. It turned out to be quite simple.

I wrote above about a separate mode for that (like
--copy-and-make-adjustments), but it was just as easy to stick it into
the existing --copy mode.

It would be useful to check what the performance overhead of this is
versus a copy that does not have to make adjustments. I expect it's
very little.

A drawback is that as written this does not work on Windows, because
Windows uses a different code path in copyFile(). I don't know the
reasons for that. But it would need to be figured out.

Here is an updated patch for this. I simplified the logic a bit and
also handle the case where the read() reads less than a round number of
blocks. I did some performance testing. The overhead of computing the
checksums versus a straight --copy without checksum adjustments appears
to be around 5% wall clock time, which seems ok to me. I also looked
around the documentation to see if there is anything to update, but
didn't find anything.

I think if we can work out what to do on Windows, this could be a useful
little feature for facilitating $subject.

My take:
1. I wonder if we should or should not by default calculate/enable the
checksums when doing pg_upgrade --copy from cluster with
checksums=off. Maybe we should error on that like we are doing now.
There might be still people want to have them off, but they would use
the proposed-new-defaults-of-initdb with checksums on blindly (so this
should be opt-in via some switch like with let's say
--copy-and-enable-checksums; so the user is in full control).
2. WIN32's copyFile() could then stay as it is, and then that new
--copy-and-enable-checksums on WIN32 would have to fallback to classic
loop.

-J.

#20

peter_e@gmx.net

over 1 year ago

In reply to: Greg Sabino Mullane (#8)

Re: Enable data checksums by default

On 08.08.24 19:19, Greg Sabino Mullane wrote:

Thank you for the feedback. Please find attached three separate patches.
One to add a new flag to initdb (--no-data-checksums), one to adjust the
tests to use this flag as needed, and the final to make the actual
switch of the default value (along with tests and docs).

I think we can get started with the initdb --no-data-checksums option.

The 0001 patch is missing documentation and --help output for this
option. Also, some of the tests for the option that are in patch 0003
might be better in patch 0001.

Separately, this

-        may incur a noticeable performance penalty. If set, checksums
+        may incur a small performance penalty. If set, checksums

should perhaps be committed separately. I don't think the patch 0003
really changes the performance penalty. ;-)

#21

greg@turnstep.com

over 1 year ago

In reply to: Peter Eisentraut (#20)

#22

Bruce Momjian

bruce@momjian.us

over 1 year ago

In reply to: Peter Eisentraut (#20)

#23

Nathan Bossart

nathandbossart@gmail.com

over 1 year ago

In reply to: Bruce Momjian (#22)

#24

greg@turnstep.com

over 1 year ago

In reply to: Nathan Bossart (#23)

#25

peter_e@gmx.net

over 1 year ago

In reply to: Greg Sabino Mullane (#24)

#26

Nathan Bossart

nathandbossart@gmail.com

over 1 year ago

In reply to: Peter Eisentraut (#25)

#27

peter_e@gmx.net

over 1 year ago

In reply to: Nathan Bossart (#26)

#28

Nathan Bossart

nathandbossart@gmail.com

over 1 year ago

In reply to: Peter Eisentraut (#27)

#29

daniel@yesql.se

over 1 year ago

In reply to: Nathan Bossart (#28)

#30

peter_e@gmx.net

over 1 year ago

In reply to: Nathan Bossart (#28)

#31

peter_e@gmx.net

over 1 year ago

In reply to: Peter Eisentraut (#30)

#32

peter_e@gmx.net

over 1 year ago

In reply to: Peter Eisentraut (#31)

#33

daniel@yesql.se

over 1 year ago

In reply to: Peter Eisentraut (#32)

#34

Alvaro Herrera

alvherre@2ndquadrant.com

over 1 year ago

In reply to: Peter Eisentraut (#5)

#35

peter_e@gmx.net

over 1 year ago

In reply to: Alvaro Herrera (#34)

#36

Alvaro Herrera

alvherre@2ndquadrant.com

over 1 year ago

In reply to: Peter Eisentraut (#35)

#37

tomas.vondra@2ndquadrant.com

11 months ago

In reply to: Peter Eisentraut (#31)

#38

Christoph Berg

myon@debian.org

11 months ago

In reply to: Tomas Vondra (#37)

#39

peter_e@gmx.net

11 months ago

In reply to: Tomas Vondra (#37)

#40

Heikki Linnakangas

heikki.linnakangas@enterprisedb.com

10 months ago

In reply to: Peter Eisentraut (#39)

#41

peter_e@gmx.net

10 months ago

In reply to: Heikki Linnakangas (#40)

#42

daniel@yesql.se

10 months ago

In reply to: Heikki Linnakangas (#40)

#43

tomas.vondra@2ndquadrant.com

10 months ago

In reply to: Peter Eisentraut (#41)

#44

tomas.vondra@2ndquadrant.com

10 months ago

In reply to: Daniel Gustafsson (#42)

#45

daniel@yesql.se

10 months ago

In reply to: Tomas Vondra (#44)

#46

Bruce Momjian

bruce@momjian.us

10 months ago

In reply to: Heikki Linnakangas (#40)

#47

peter_e@gmx.net

9 months ago

In reply to: Tomas Vondra (#43)

#48

Christoph Berg

myon@debian.org

9 months ago

In reply to: Heikki Linnakangas (#40)

#49

peter_e@gmx.net

9 months ago

In reply to: Christoph Berg (#48)

#50

Bruce Momjian

bruce@momjian.us

9 months ago

In reply to: Peter Eisentraut (#49)

#51

tomas.vondra@2ndquadrant.com

7 months ago

In reply to: Peter Eisentraut (#47)

#52

laurenz.albe@cybertec.at

7 months ago

In reply to: Tomas Vondra (#51)

#53

daniel@yesql.se

7 months ago

In reply to: Laurenz Albe (#52)

#54

Greg Burd

greg@burd.me

7 months ago

In reply to: Daniel Gustafsson (#53)

#55

tomas.vondra@2ndquadrant.com

7 months ago

In reply to: Greg Burd (#54)

#56

laurenz.albe@cybertec.at

7 months ago

In reply to: Greg Burd (#54)

#57

laurenz.albe@cybertec.at

7 months ago

In reply to: Greg Burd (#54)

#58

laurenz.albe@cybertec.at

7 months ago

In reply to: Greg Burd (#54)

#59

greg@turnstep.com

7 months ago

In reply to: Laurenz Albe (#56)

#60

Jeff Davis

pgsql@j-davis.com

7 months ago

In reply to: Tomas Vondra (#55)

#61

Greg Burd

greg@burd.me

7 months ago

In reply to: Laurenz Albe (#56)

#62

Greg Burd

greg@burd.me

7 months ago

In reply to: Jeff Davis (#60)

#63

Ants Aasma

ants.aasma@cybertec.at

7 months ago

In reply to: Tomas Vondra (#55)

#64

Jim Nasby

Jim.Nasby@BlueTreble.com

7 months ago

In reply to: Ants Aasma (#63)

#65