Up to date conventional wisdom re max shared_buffer size?

Started by Jerry Sieversover 8 years ago10 messagesgeneral

gsievers19@comcast.net

over 8 years ago

Briefly, just curious if legacy max values for shared_buffers have
scaled up since 8G was like 25% of RAM?

Pg 9.3 on monster 2T/192 CPU Xenial thrashing

Upgrade pending but we recently started having $interesting performance
issues at times looking like I/O slowness and other times apparently
causing CPU spins.

The DB is 10TB total size with OLTP plus some occasional heavy batching
which frequently correlates with degradation that requires intervention.

Unrelated server problem forced us to relocate from a Debian/Wheezy 3.x
kernel 1T 144 CPU to the even bigger box mentioned earlier. And we wen
up a major kernel version also in the process.

Anyway, shared_buffer coherency generally high but does take big dips
that are sometimes sustained for seconds or even minutes.

shared_buffers only 20G which is relatively very small vs total machine
RAM however we do not have the luxury of scheduled downtime so this and
other settings requiring a full restart are not touched without good
reason.

Thanks

--
Jerry Sievers
Postgres DBA/Development Consulting
e: postgres.consulting@comcast.net
p: 312.241.7800

--
Sent via pgsql-general mailing list (pgsql-general@postgresql.org)
To make changes to your subscription:
http://www.postgresql.org/mailpref/pgsql-general

andres@anarazel.de

over 8 years ago

In reply to: Jerry Sievers (#1)

Re: Up to date conventional wisdom re max shared_buffer size?

Hi,

On 2017-09-19 17:00:05 -0500, Jerry Sievers wrote:

Briefly, just curious if legacy max values for shared_buffers have
scaled up since 8G was like 25% of RAM?

It's very workload dependent. I've successfully used PG with roughly 1TB
of shared buffers, where that performed better than lower
settings.

Pg 9.3 on monster 2T/192 CPU Xenial thrashing

Not sure what the word "thrashing" in that sentence means.

Things have improved a lot since 9.3 WRT to scalability, so I'd not
infer too much from 9.3 performance on a larger box.

Upgrade pending but we recently started having $interesting performance
issues at times looking like I/O slowness and other times apparently
causing CPU spins.

That's not something we can really usefully comment on given the amount
of information.

Anyway, shared_buffer coherency generally high but does take big dips
that are sometimes sustained for seconds or even minutes.

"shared_buffer coherency"?

Greetings,

Andres Freund

--
Sent via pgsql-general mailing list (pgsql-general@postgresql.org)
To make changes to your subscription:
http://www.postgresql.org/mailpref/pgsql-general

ron.l.johnson@cox.net

over 8 years ago

In reply to: Andres Freund (#2)

Re: Up to date conventional wisdom re max shared_buffer size?

On 09/19/2017 05:00 PM, Jerry Sievers wrote:
[snip]

The DB is 10TB total size with OLTP plus some occasional heavy batching
which frequently correlates with degradation that requires intervention.

Unrelated server problem forced us to relocate from a Debian/Wheezy 3.x
kernel 1T 144 CPU to the even bigger box mentioned earlier. And we wen
up a major kernel version also in the process.

How did you backup/restore a 10TB db?

--
World Peace Through Nuclear Pacification

--
Sent via pgsql-general mailing list (pgsql-general@postgresql.org)
To make changes to your subscription:
http://www.postgresql.org/mailpref/pgsql-general

Import Notes

Reply to msg id not found: Ba2l1w02A1DF5NZ01a2npYReference msg id not found: Ba2l1w02A1DF5NZ01a2npY | Resolved by subject fallback

gsievers19@comcast.net

over 8 years ago

In reply to: Andres Freund (#2)

Re: Up to date conventional wisdom re max shared_buffer size?

Thanks Andres! See inline...

Andres Freund <andres@anarazel.de> writes:

Hi,

On 2017-09-19 17:00:05 -0500, Jerry Sievers wrote:

Briefly, just curious if legacy max values for shared_buffers have
scaled up since 8G was like 25% of RAM?

It's very workload dependent. I've successfully used PG with roughly 1TB
of shared buffers, where that performed better than lower
settings.

Wow! Ok

Pg 9.3 on monster 2T/192 CPU Xenial thrashing

Not sure what the word "thrashing" in that sentence means.

Cases of dozens or hundreds of sessions running typical statements for
this system but running 100% on their CPUs. Seems to be triggered by
certain heavy weight batch jobs kicking off on this generally OLTP
system.

ISTM there might be LW lock contention happening around some sort of
shared resource where the lock wait implementation is a CPU spinner.

Things have improved a lot since 9.3 WRT to scalability, so I'd not
infer too much from 9.3 performance on a larger box.

Understood. The situation got worse when we moved to the even bigger
box also running a 4.x kernel which I presume was no where near existent
when 9.3 was our current Pg version.

Upgrade pending but we recently started having $interesting performance
issues at times looking like I/O slowness and other times apparently
causing CPU spins.

That's not something we can really usefully comment on given the amount
of information.

Ack'd.

I'd like to strace some of the spinning backends when/if we get another
opportunity to observe the problem to see if by syscall or libfunc name
we can learn more about what's the cause.

Anyway, shared_buffer coherency generally high but does take big dips
that are sometimes sustained for seconds or even minutes.

"shared_buffer coherency"?

As measured querying pg_stat_databases and comparing total reads to read
hits. Run frequently such as once /5-seconds and factored into a hit
percentage. May stay up around 100% for several ticks but then go way
down which may or not sustain.

This is an OLTP app using Rails with hundreds of tables both trivial
n structure as well as having partitions, large payloads... TOAST and
the like.

TPS can measure in the ~5-10k range.

Thx again

Greetings,

Andres Freund

--
Jerry Sievers
Postgres DBA/Development Consulting
e: postgres.consulting@comcast.net
p: 312.241.7800

--
Sent via pgsql-general mailing list (pgsql-general@postgresql.org)
To make changes to your subscription:
http://www.postgresql.org/mailpref/pgsql-general

gsievers19@comcast.net

over 8 years ago

In reply to: Ron Johnson (#3)

Re: Up to date conventional wisdom re max shared_buffer size?

Ron Johnson <ron.l.johnson@cox.net> writes:

On 09/19/2017 05:00 PM, Jerry Sievers wrote:
[snip]

The DB is 10TB total size with OLTP plus some occasional heavy batching
which frequently correlates with degradation that requires intervention.

Unrelated server problem forced us to relocate from a Debian/Wheezy 3.x
kernel 1T 144 CPU to the even bigger box mentioned earlier. And we wen
up a major kernel version also in the process.

How did you backup/restore a 10TB db?

We just relocated the SAN volume. Takes about 1 minute :-)

--
World Peace Through Nuclear Pacification

--
Jerry Sievers
Postgres DBA/Development Consulting
e: postgres.consulting@comcast.net
p: 312.241.7800

--
Sent via pgsql-general mailing list (pgsql-general@postgresql.org)
To make changes to your subscription:
http://www.postgresql.org/mailpref/pgsql-general

andres@anarazel.de

over 8 years ago

In reply to: Jerry Sievers (#4)

Re: Up to date conventional wisdom re max shared_buffer size?

On 2017-09-20 13:00:34 -0500, Jerry Sievers wrote:

Pg 9.3 on monster 2T/192 CPU Xenial thrashing

Not sure what the word "thrashing" in that sentence means.

Cases of dozens or hundreds of sessions running typical statements for
this system but running 100% on their CPUs. Seems to be triggered by
certain heavy weight batch jobs kicking off on this generally OLTP
system.

ISTM there might be LW lock contention happening around some sort of
shared resource where the lock wait implementation is a CPU spinner.

Yes, we improved that a lot in 9.5, 9.6 and 10. The really bad
scenarios - I've seen 95% cpu time spent in locking - should all be
fixed.

I'd try to make sure that both transparent hugepages and zone reclaim
mode are disabled - the latter probably is already, but the former might
still cause some problems.

Things have improved a lot since 9.3 WRT to scalability, so I'd not
infer too much from 9.3 performance on a larger box.

Understood. The situation got worse when we moved to the even bigger
box also running a 4.x kernel which I presume was no where near existent
when 9.3 was our current Pg version.

I suspect it's more the bigger box than the newer kernel. The more
sockets and cores you have, the more lock contention bites you. That's
because inter-socket / cpu transfers get more expensive with more cores.

Upgrade pending but we recently started having $interesting performance
issues at times looking like I/O slowness and other times apparently
causing CPU spins.

That's not something we can really usefully comment on given the amount
of information.

Ack'd.

I'd like to strace some of the spinning backends when/if we get another
opportunity to observe the problem to see if by syscall or libfunc name
we can learn more about what's the cause.

I think the causes are known, and fixed - don't think there's much you
can do besides upgrading, unless you want to backport a number of
complex patches yourself.

FWIW, usually perf gives better answers than strace in this type of
scenario.

Anyway, shared_buffer coherency generally high but does take big dips
that are sometimes sustained for seconds or even minutes.

"shared_buffer coherency"?

As measured querying pg_stat_databases and comparing total reads to read
hits. Run frequently such as once /5-seconds and factored into a hit
percentage. May stay up around 100% for several ticks but then go way
down which may or not sustain.

This is an OLTP app using Rails with hundreds of tables both trivial
n structure as well as having partitions, large payloads... TOAST and
the like.

TPS can measure in the ~5-10k range.

That's cache hit rate, not coherency ;)

- Andres

--
Sent via pgsql-general mailing list (pgsql-general@postgresql.org)
To make changes to your subscription:
http://www.postgresql.org/mailpref/pgsql-general

scott.marlowe@gmail.com

over 8 years ago

In reply to: Jerry Sievers (#1)

Re: Up to date conventional wisdom re max shared_buffer size?

On Tue, Sep 19, 2017 at 4:00 PM, Jerry Sievers <gsievers19@comcast.net> wrote:

Briefly, just curious if legacy max values for shared_buffers have
scaled up since 8G was like 25% of RAM?

Pg 9.3 on monster 2T/192 CPU Xenial thrashing

Upgrade pending but we recently started having $interesting performance
issues at times looking like I/O slowness and other times apparently
causing CPU spins.

Have you looked at things like zone reclaim mode and transparent huge
pages? Both of those can cause odd problems. Also it's usually a good
idea to turn off swap as the linux kernel, presented with lots of ram
and a small (by comparison) swap file sometimes makes bad life choices
and starts using swap for things like storing currently unused shared
buffers or something.

--
Sent via pgsql-general mailing list (pgsql-general@postgresql.org)
To make changes to your subscription:
http://www.postgresql.org/mailpref/pgsql-general

ron.l.johnson@cox.net

over 8 years ago

In reply to: Ron Johnson (#3)

Re: Up to date conventional wisdom re max shared_buffer size?

On 09/20/2017 01:05 PM, Jerry Sievers wrote:

Ron Johnson <ron.l.johnson@cox.net> writes:

On 09/19/2017 05:00 PM, Jerry Sievers wrote:
[snip]

The DB is 10TB total size with OLTP plus some occasional heavy batching
which frequently correlates with degradation that requires intervention.

Unrelated server problem forced us to relocate from a Debian/Wheezy 3.x
kernel 1T 144 CPU to the even bigger box mentioned earlier. And we wen
up a major kernel version also in the process.

How did you backup/restore a 10TB db?

We just relocated the SAN volume. Takes about 1 minute :-)

Ah, yes. Major *kernel* version. Never mind... :)

--
World Peace Through Nuclear Pacification

--
Sent via pgsql-general mailing list (pgsql-general@postgresql.org)
To make changes to your subscription:
http://www.postgresql.org/mailpref/pgsql-general

Import Notes

Reply to msg id not found: Bu541w0403fZUi801u5oTo

gsievers19@comcast.net

over 8 years ago

In reply to: Scott Marlowe (#7)

Re: Up to date conventional wisdom re max shared_buffer size?

Scott Marlowe <scott.marlowe@gmail.com> writes:

On Tue, Sep 19, 2017 at 4:00 PM, Jerry Sievers <gsievers19@comcast.net> wrote:

Briefly, just curious if legacy max values for shared_buffers have
scaled up since 8G was like 25% of RAM?

Pg 9.3 on monster 2T/192 CPU Xenial thrashing

Upgrade pending but we recently started having $interesting performance
issues at times looking like I/O slowness and other times apparently
causing CPU spins.

Have you looked at things like zone reclaim mode and transparent huge
pages? Both of those can cause odd problems. Also it's usually a good
idea to turn off swap as the linux kernel, presented with lots of ram
and a small (by comparison) swap file sometimes makes bad life choices
and starts using swap for things like storing currently unused shared
buffers or something.

Not sure but we're checking into these items. Thanks

--
Jerry Sievers
Postgres DBA/Development Consulting
e: postgres.consulting@comcast.net
p: 312.241.7800

--
Sent via pgsql-general mailing list (pgsql-general@postgresql.org)
To make changes to your subscription:
http://www.postgresql.org/mailpref/pgsql-general

gsievers19@comcast.net

over 8 years ago

In reply to: Andres Freund (#6)

Re: Up to date conventional wisdom re max shared_buffer size?

Andres Freund <andres@anarazel.de> writes:

On 2017-09-20 13:00:34 -0500, Jerry Sievers wrote:

Pg 9.3 on monster 2T/192 CPU Xenial thrashing

Not sure what the word "thrashing" in that sentence means.

Cases of dozens or hundreds of sessions running typical statements for
this system but running 100% on their CPUs. Seems to be triggered by
certain heavy weight batch jobs kicking off on this generally OLTP
system.

ISTM there might be LW lock contention happening around some sort of
shared resource where the lock wait implementation is a CPU spinner.

Yes, we improved that a lot in 9.5, 9.6 and 10. The really bad
scenarios - I've seen 95% cpu time spent in locking - should all be
fixed.

Yup, as I suspected from studying the rel notes but some shops including
ours are slow getting enough app dev and QA resources aligned for DB
system major upgrades thus we have ended up doing them 2 versions at a
time prior and this time 3 versions up :-(

I'd try to make sure that both transparent hugepages and zone reclaim
mode are disabled - the latter probably is already, but the former might
still cause some problems.

Ack. We'll research this.

Things have improved a lot since 9.3 WRT to scalability, so I'd not
infer too much from 9.3 performance on a larger box.

Understood. The situation got worse when we moved to the even bigger
box also running a 4.x kernel which I presume was no where near existent
when 9.3 was our current Pg version.

I suspect it's more the bigger box than the newer kernel. The more
sockets and cores you have, the more lock contention bites you. That's
because inter-socket / cpu transfers get more expensive with more cores.

Ack

Upgrade pending but we recently started having $interesting performance
issues at times looking like I/O slowness and other times apparently
causing CPU spins.

That's not something we can really usefully comment on given the amount
of information.

Ack'd.

I'd like to strace some of the spinning backends when/if we get another
opportunity to observe the problem to see if by syscall or libfunc name
we can learn more about what's the cause.

I think the causes are known, and fixed - don't think there's much you
can do besides upgrading, unless you want to backport a number of
complex patches yourself.

FWIW, usually perf gives better answers than strace in this type of
scenario.

Anyway, shared_buffer coherency generally high but does take big dips
that are sometimes sustained for seconds or even minutes.

"shared_buffer coherency"?

As measured querying pg_stat_databases and comparing total reads to read
hits. Run frequently such as once /5-seconds and factored into a hit
percentage. May stay up around 100% for several ticks but then go way
down which may or not sustain.

This is an OLTP app using Rails with hundreds of tables both trivial
n structure as well as having partitions, large payloads... TOAST and
the like.

TPS can measure in the ~5-10k range.

That's cache hit rate, not coherency ;)

My bad

Thanks again.

- Andres

--
Jerry Sievers
Postgres DBA/Development Consulting
e: postgres.consulting@comcast.net
p: 312.241.7800

--
Sent via pgsql-general mailing list (pgsql-general@postgresql.org)
To make changes to your subscription:
http://www.postgresql.org/mailpref/pgsql-general