Postgres Clustering

Started by Alan McKayalmost 17 years ago14 messagesgeneral
Jump to latest
#1Alan McKay
alan.mckay@gmail.com

Hey folks,

I have done some googling and found a few things on the matter. But
am looking for some suggestions from the experts out there.

Got any good pointers for reading material to help me get up to speed
on PostgreSQL clustering? What options are available? What are the
issues? Terminology. I'm pretty new to the whole data-warehouse
thing. And once I do all the reading, I'll even be open to product
recommendations :-)

And in particular since I already have heard of this particular
product - are there any opinions on Continuent?

thanks,
-Alan

--
“Mother Nature doesn’t do bailouts.”
- Glenn Prickett

#2Thomas Kellerer
spam_eater@gmx.net
In reply to: Alan McKay (#1)
Re: Postgres Clustering

Alan McKay wrote on 27.05.2009 19:57:

What options are available?

I guess a good starting point is the Postgres Wiki:

http://wiki.postgresql.org/wiki/Replication%2C_Clustering%2C_and_Connection_Pooling

#3Scott Mead
scott.lists@enterprisedb.com
In reply to: Alan McKay (#1)
Re: [PERFORM] Postgres Clustering

On Wed, May 27, 2009 at 1:57 PM, Alan McKay <alan.mckay@gmail.com> wrote:

Hey folks,

I have done some googling and found a few things on the matter. But
am looking for some suggestions from the experts out there.

Got any good pointers for reading material to help me get up to speed
on PostgreSQL clustering? What options are available? What are the
issues? Terminology. I'm pretty new to the whole data-warehouse
thing. And once I do all the reading, I'll even be open to product
recommendations :-)

And in particular since I already have heard of this particular
product - are there any opinions on Continuent?

What's your specific use case? Different types of clustering behave
differently depending on what you're trying to do.

If you're looking to parallelize large BI type queries something like
GridSQL or PGPool may make sense. If you're looking for more of an OLTP
solution, or multi-master replication, pgCluster will make more sense.

--Scott

#4Daniel van Ham Colchete
daniel.colchete@gmail.com
In reply to: Alan McKay (#1)
Re: [PERFORM] Postgres Clustering

Alan,

here I'm implementing something similar to the Chord protocol [1] on the
application level to partition my data across 6 PostgreSQL servers with N+1
replication. Two up sides on this approch:

1 - When one server is down the load is spread between all the other ones,
instead of going only to the replication pair.
2 - Adding one more server is easy, you only have to reallocate aprox. 1/n
of your data (n=number of servers).

Good luck there!

Best,
Daniel

On Wed, May 27, 2009 at 2:57 PM, Alan McKay <alan.mckay@gmail.com> wrote:

Show quoted text

Hey folks,

I have done some googling and found a few things on the matter. But
am looking for some suggestions from the experts out there.

Got any good pointers for reading material to help me get up to speed
on PostgreSQL clustering? What options are available? What are the
issues? Terminology. I'm pretty new to the whole data-warehouse
thing. And once I do all the reading, I'll even be open to product
recommendations :-)

And in particular since I already have heard of this particular
product - are there any opinions on Continuent?

thanks,
-Alan

--
“Mother Nature doesn’t do bailouts.”
- Glenn Prickett

--
Sent via pgsql-performance mailing list (pgsql-performance@postgresql.org)
To make changes to your subscription:
http://www.postgresql.org/mailpref/pgsql-performance

#5Eddy Ernesto Baños Fernández
eebanos@estudiantes.uci.cu
In reply to: Alan McKay (#1)
Re: [PERFORM] Postgres Clustering

Try Cybercluster....

-----Mensaje original-----
De: pgsql-performance-owner@postgresql.org
[mailto:pgsql-performance-owner@postgresql.org] En nombre de Alan McKay
Enviado el: miércoles, 27 de mayo de 2009 13:57
Para: pgsql-performance@postgresql.org; pgsql-general@postgresql.org
Asunto: [PERFORM] Postgres Clustering

Hey folks,

I have done some googling and found a few things on the matter. But
am looking for some suggestions from the experts out there.

Got any good pointers for reading material to help me get up to speed
on PostgreSQL clustering? What options are available? What are the
issues? Terminology. I'm pretty new to the whole data-warehouse
thing. And once I do all the reading, I'll even be open to product
recommendations :-)

And in particular since I already have heard of this particular
product - are there any opinions on Continuent?

thanks,
-Alan

--
“Mother Nature doesn’t do bailouts.”
- Glenn Prickett

--
Sent via pgsql-performance mailing list (pgsql-performance@postgresql.org)
To make changes to your subscription:
http://www.postgresql.org/mailpref/pgsql-performance

#6Greg Smith
gsmith@gregsmith.com
In reply to: Alan McKay (#1)
Re: Postgres Clustering

On Wed, 27 May 2009, Alan McKay wrote:

Got any good pointers for reading material to help me get up to speed
on PostgreSQL clustering? What options are available? What are the
issues? Terminology.

http://wiki.postgresql.org/wiki/Replication%2C_Clustering%2C_and_Connection_Pooling
is where I keep my notes, which has worked out great for me because then
other people fix mistakes and keep everything current.

And in particular since I already have heard of this particular
product - are there any opinions on Continuent?

If what you need is fairly high-level replication where you can handle
some log replication overhead, one of their replicators might work well
for you. I usually work more with OLTP systems, where write volume is too
high for something running that far outside of the database to perform
well enough. You might note that Continuent's solutions advertise
"Scaling of read-intensive applications" rather than write ones.

By the way: cross-posting on these lists is generally frowned upon. It
causes problems for people who reply to you but are aren't on all of the
lists you sent to. If you're not sure what list something should go on,
just send it to -general rather than cc'ing multiple ones.

--
* Greg Smith gsmith@gregsmith.com http://www.gregsmith.com Baltimore, MD

#7Kevin Kempter
kevink@consistentstate.com
In reply to: Eddy Ernesto Baños Fernández (#5)
Re: [PERFORM] Postgres Clustering

On Wednesday 27 May 2009 12:55:51 Eddy Ernesto Baños Fernández wrote:

Try Cybercluster....

-----Mensaje original-----
De: pgsql-performance-owner@postgresql.org
[mailto:pgsql-performance-owner@postgresql.org] En nombre de Alan McKay
Enviado el: miércoles, 27 de mayo de 2009 13:57
Para: pgsql-performance@postgresql.org; pgsql-general@postgresql.org
Asunto: [PERFORM] Postgres Clustering

Hey folks,

I have done some googling and found a few things on the matter. But
am looking for some suggestions from the experts out there.

Got any good pointers for reading material to help me get up to speed
on PostgreSQL clustering? What options are available? What are the
issues? Terminology. I'm pretty new to the whole data-warehouse
thing. And once I do all the reading, I'll even be open to product
recommendations :-)

And in particular since I already have heard of this particular
product - are there any opinions on Continuent?

Continuent works (AFAIK) like pgpool clustering, it sends the same statements
to both/all servers in the cluster but it has no insight to the servers beyond
this, so if via a direct connection server A becomes out of sync with server B
then continuent is oblivious.

Other tools to look at:
- EnterpriseDB's GridSQL
- SLONY
- Command Prompt's PG Replicator

Show quoted text

thanks,
-Alan

--
“Mother Nature doesn’t do bailouts.”
- Glenn Prickett

--
Sent via pgsql-performance mailing list (pgsql-performance@postgresql.org)
To make changes to your subscription:
http://www.postgresql.org/mailpref/pgsql-performance

#8Alan McKay
alan.mckay@gmail.com
In reply to: Greg Smith (#6)
Re: Postgres Clustering

By the way:  cross-posting on these lists is generally frowned upon. It
causes problems for people who reply to you but are aren't on all of the
lists you sent to.  If you're not sure what list something should go on,
just send it to -general rather than cc'ing multiple ones.

Duly noted!

Thanks for all the input so far folks.

--
“Mother Nature doesn’t do bailouts.”
- Glenn Prickett

#9Alan McKay
alan.mckay@gmail.com
In reply to: Kevin Kempter (#7)
Re: [PERFORM] Postgres Clustering

Continuent works (AFAIK) like pgpool clustering, it sends the same
statements to both/all servers in the cluster but it has no insight to the
servers beyond this, so if via a direct connection server A becomes out of
sync with server B then continuent is oblivious.

So can the same be said for pgpool then?

thanks,
-Alan

--
“Mother Nature doesn’t do bailouts.”
- Glenn Prickett

#10Kevin Kempter
kevink@consistentstate.com
In reply to: Alan McKay (#9)
Re: [PERFORM] Postgres Clustering

On Wednesday 27 May 2009 13:33:55 Alan McKay wrote:

Continuent works (AFAIK) like pgpool clustering, it sends the same
statements to both/all servers in the cluster but it has no insight to
the servers beyond this, so if via a direct connection server A becomes
out of sync with server B then continuent is oblivious.

So can the same be said for pgpool then?

Yes

Show quoted text

thanks,
-Alan

--
“Mother Nature doesn’t do bailouts.”
- Glenn Prickett

#11Dimitri Fontaine
dimitri@2ndQuadrant.fr
In reply to: Alan McKay (#1)
Re: [PERFORM] Postgres Clustering

Hi,

Le 27 mai 09 à 19:57, Alan McKay a écrit :

I have done some googling and found a few things on the matter. But
am looking for some suggestions from the experts out there.

Got any good pointers for reading material to help me get up to speed
on PostgreSQL clustering? What options are available? What are the
issues? Terminology. I'm pretty new to the whole data-warehouse
thing. And once I do all the reading, I'll even be open to product
recommendations :-)

Depending on your exact needs, which the terminology you're using only
allow to guess about, you might enjoy this reading:
http://wiki.postgresql.org/wiki/Image:Moskva_DB_Tools.v3.pdf

--
dim

#12Alan McKay
alan.mckay@gmail.com
In reply to: Dimitri Fontaine (#11)
Re: [PERFORM] Postgres Clustering

Depending on your exact needs, which the terminology you're using only allow
to guess about, you might enjoy this reading:
 http://wiki.postgresql.org/wiki/Image:Moskva_DB_Tools.v3.pdf

Thanks. To be honest I don't even know myself what my needs are yet.
I've only been on the job here for a month now. And one thing I
learned at PGCon last week is that I have a lot of benchmarching work
to do before I can figure out what my needs are!

At this point I'm just trying to read as much as I can on the general
topic of "clustering". What I am most interested in is load
balancing to be able to scale up when required, and of course to be
able to determine ahead of time when that might be :-)

cheers,
-Alan

--
“Mother Nature doesn’t do bailouts.”
- Glenn Prickett

#13Scot Kreienkamp
SKreien@la-z-boy.com
In reply to: Alan McKay (#1)
Re: Postgres Clustering

We didn't have much luck with Continuent. They had to make multiple
code level changes to get their product to work correctly with our app
on PG 8.2. We never did get it successfully implemented. At this point
I'm stuck with WAL shipping as I can't find anything that fits my
constraints.

Thanks,

Scot Kreienkamp

-----Original Message-----
From: pgsql-general-owner@postgresql.org
[mailto:pgsql-general-owner@postgresql.org] On Behalf Of Alan McKay
Sent: Wednesday, May 27, 2009 1:57 PM
To: pgsql-performance@postgresql.org; pgsql-general@postgresql.org
Subject: [GENERAL] Postgres Clustering

Hey folks,

I have done some googling and found a few things on the matter. But
am looking for some suggestions from the experts out there.

Got any good pointers for reading material to help me get up to speed
on PostgreSQL clustering? What options are available? What are the
issues? Terminology. I'm pretty new to the whole data-warehouse
thing. And once I do all the reading, I'll even be open to product
recommendations :-)

And in particular since I already have heard of this particular
product - are there any opinions on Continuent?

thanks,
-Alan

--
"Mother Nature doesn't do bailouts."
- Glenn Prickett

--
Sent via pgsql-general mailing list (pgsql-general@postgresql.org)
To make changes to your subscription:
http://www.postgresql.org/mailpref/pgsql-general

#14Tim Uckun
timuckun@gmail.com
In reply to: Eddy Ernesto Baños Fernández (#5)
Re: [PERFORM] Postgres Clustering

2009/5/28 Eddy Ernesto Baños Fernández <eebanos@estudiantes.uci.cu>:

Try Cybercluster....

I looked into that. There is one piece of documentation that is less
than ten pages long. There is no users group, no listserve, no
community that I can discern.

Do you have experience with it and if so could you please share.

Thanks.