Online backup vs Continuous backup

Started by Bruce Momjianover 20 years ago25 messageshackersdocs

bruce@momjian.us

over 20 years ago

hackersdocs

I noticed that we are using the term "Online Backup" in our
documentation when we are talking about continuous backup and PITR.

To me, "online backup" is doing a backup while the system is online
(online-backup), and that is accomplished by pg_dump. I know a lot of
databases us "Online Backup" but I assume this is for historical reasons
because at some time in the past their full backups didn't work while
the database was online. Other systems use the term "Continuous
Logging", but I think that is too easily confused with the server
activity logs.

I suggest the following patch to rename our capability "Continuous
Backup".

-- 
  Bruce Momjian                        |  http://candle.pha.pa.us
  pgman@candle.pha.pa.us               |  (610) 359-1001
  +  If your life is a hard drive,     |  13 Roberts Road
  +  Christ can be your backup.        |  Newtown Square, Pennsylvania 19073

Tom Lane

tgl@sss.pgh.pa.us

over 20 years ago

In reply to: Bruce Momjian (#1)

hackersdocs

Re: Online backup vs Continuous backup

Bruce Momjian <pgman@candle.pha.pa.us> writes:

I suggest the following patch to rename our capability "Continuous
Backup".

This doesn't seem like an improvement. "Online backup" is the standard
terminology AFAIK.

regards, tom lane

Bruce Momjian

bruce@momjian.us

over 20 years ago

In reply to: Tom Lane (#2)

hackersdocs

Re: Online backup vs Continuous backup

Tom Lane wrote:

Bruce Momjian <pgman@candle.pha.pa.us> writes:

I suggest the following patch to rename our capability "Continuous
Backup".

This doesn't seem like an improvement. "Online backup" is the standard
terminology AFAIK.

But why is it the standard terminology? It doesn't seem logical.

-- 
  Bruce Momjian                        |  http://candle.pha.pa.us
  pgman@candle.pha.pa.us               |  (610) 359-1001
  +  If your life is a hard drive,     |  13 Roberts Road
  +  Christ can be your backup.        |  Newtown Square, Pennsylvania 19073

Joshua D. Drake

jd@commandprompt.com

over 20 years ago

In reply to: Bruce Momjian (#1)

hackersdocs

Re: [HACKERS] Online backup vs Continuous backup

Bruce Momjian wrote:

I noticed that we are using the term "Online Backup" in our
documentation when we are talking about continuous backup and PITR.

To me, "online backup" is doing a backup while the system is online
(online-backup), and that is accomplished by pg_dump. I know a lot of
databases us "Online Backup" but I assume this is for historical reasons
because at some time in the past their full backups didn't work while
the database was online. Other systems use the term "Continuous
Logging", but I think that is too easily confused with the server
activity logs.

I suggest the following patch to rename our capability "Continuous
Backup".

I have never heard the term Continuous backup. Although I have heard
online backup. The problem is that when I hear the term online backup
I think Hot backup which is what we do with pg_dump.

I would just call the other Point in time recovery :)

Show quoted text

Bruce Momjian

bruce@momjian.us

over 20 years ago

In reply to: Joshua D. Drake (#4)

hackersdocs

Re: [HACKERS] Online backup vs Continuous backup

Joshua D. Drake wrote:

Bruce Momjian wrote:

I noticed that we are using the term "Online Backup" in our
documentation when we are talking about continuous backup and PITR.

To me, "online backup" is doing a backup while the system is online
(online-backup), and that is accomplished by pg_dump. I know a lot of
databases us "Online Backup" but I assume this is for historical reasons
because at some time in the past their full backups didn't work while
the database was online. Other systems use the term "Continuous
Logging", but I think that is too easily confused with the server
activity logs.

I suggest the following patch to rename our capability "Continuous
Backup".

I have never heard the term Continuous backup. Although I have heard
online backup. The problem is that when I hear the term online backup
I think Hot backup which is what we do with pg_dump.

Yes, that is my problem too.

I would just call the other Point in time recovery :)

The problem there is that it is "recovery", and the documentation talks
about the "backup" step. I am thinking we are going to have to add
something like "Continuous backup, also called Online Backup" or
something like that.

-- 
  Bruce Momjian                        |  http://candle.pha.pa.us
  pgman@candle.pha.pa.us               |  (610) 359-1001
  +  If your life is a hard drive,     |  13 Roberts Road
  +  Christ can be your backup.        |  Newtown Square, Pennsylvania 19073

Heikki Linnakangas

heikki.linnakangas@enterprisedb.com

over 20 years ago

In reply to: Bruce Momjian (#5)

hackersdocs

Re: [HACKERS] Online backup vs Continuous backup

On Mon, 26 Dec 2005, Bruce Momjian wrote:

Joshua D. Drake wrote:

I have never heard the term Continuous backup. Although I have heard
online backup. The problem is that when I hear the term online backup
I think Hot backup which is what we do with pg_dump.

Yes, that is my problem too.

I would just call the other Point in time recovery :)

The problem there is that it is "recovery", and the documentation talks
about the "backup" step. I am thinking we are going to have to add
something like "Continuous backup, also called Online Backup" or
something like that.

FWIW, I believe "log archival" is a common term for that.

- Heikki

Peter Eisentraut

peter_e@gmx.net

over 20 years ago

In reply to: Heikki Linnakangas (#6)

hackersdocs

Re: [HACKERS] Online backup vs Continuous backup

Am Montag, 26. Dezember 2005 20:46 schrieb Heikki Linnakangas:

FWIW, I believe "log archival" is a common term for that.

I like that better, although there are a lot of logs, so maybe "transaction
log archival".

Considering the list originally referenced:

- File system backup
- SQL dump
- (Transaction log archival?)

This addresses the technical workings of the various backup systems. The fact
that it is or isn't on-line, hot, continuous, fast, or flexible is secondary.

Bruce Momjian

bruce@momjian.us

over 20 years ago

In reply to: Peter Eisentraut (#7)

hackersdocs

Re: [HACKERS] Online backup vs Continuous backup

Peter Eisentraut wrote:

Am Montag, 26. Dezember 2005 20:46 schrieb Heikki Linnakangas:

FWIW, I believe "log archival" is a common term for that.

I like that better, although there are a lot of logs, so maybe
"transaction log archival".

Yep, when I think of log archiving, I think of saving the server logs,
which thanks to log file rotation, we kind of support.

The problem is that when I think of the transaction log, I think of clog
because it is the transaction status log. :-(

I am thinking anything that says "log" is going to be confusing. How about
"Transaction Archiving"?

Considering the list originally referenced:

- File system backup - SQL dump - (Transaction log archival?)

This addresses the technical workings of the various backup systems.
The fact that it is or isn't on-line, hot, continuous, fast, or flexible
is secondary.

To me, the continuous activity is the significant feature of that backup
method. I chose "Continuous Backup" because it is the continual
activity that is significant. "Continuous Archiving"?

--
  Bruce Momjian                        |  http://candle.pha.pa.us
  pgman@candle.pha.pa.us               |  (610) 359-1001
  +  If your life is a hard drive,     |  13 Roberts Road
  +  Christ can be your backup.        |  Newtown Square, Pennsylvania 19073

Bruce Momjian

bruce@momjian.us

over 20 years ago

In reply to: Bruce Momjian (#1)

hackersdocs

Re: [HACKERS] Online backup vs Continuous backup

Bruce Momjian <pgman@candle.pha.pa.us> writes:

I noticed that we are using the term "Online Backup" in our
documentation when we are talking about continuous backup and PITR.

To me, "online backup" is doing a backup while the system is online
(online-backup), and that is accomplished by pg_dump.

Well fwiw what pg_dump does isn't even considered a "backup" at all in other
systems. It's a "logical export" or something of that sort.

It's not considered a "backup" because it's saving something different than
the actual physical database. When you restore you get something (hopefully)
logically equivalent but still physically different.

Hot backups which is what you read "online backup" to mean is pretty closely
tied to PITR log archiving backups. One isn't very useful without the other.
Specifically, you can have all the archived logs in the world but if you don't
have a backup to start from they're useless. And in postgres I gather hot
backups aren't very useful if you don't have the transaction logs necessary to
fix any torn pages.

--
greg

#10

Tom Lane

tgl@sss.pgh.pa.us

over 20 years ago

In reply to: Bruce Momjian (#8)

hackersdocs

Re: [HACKERS] Online backup vs Continuous backup

Bruce Momjian <pgman@candle.pha.pa.us> writes:

This addresses the technical workings of the various backup systems.
The fact that it is or isn't on-line, hot, continuous, fast, or flexible
is secondary.

To me, the continuous activity is the significant feature of that backup
method. I chose "Continuous Backup" because it is the continual
activity that is significant. "Continuous Archiving"?

I like Peter's suggestion as-is; or "WAL Archiving". I don't like
applying adjectives that are not specific to the particular method.
That will just create more confusion down the road if we add another
feature that could also be called "continuous archiving".

regards, tom lane

#11

Andrew Dunstan

andrew@dunslane.net

over 20 years ago

In reply to: Bruce Momjian (#9)

hackersdocs

Re: [HACKERS] Online backup vs Continuous backup

Greg Stark said:

Bruce Momjian <pgman@candle.pha.pa.us> writes:

I noticed that we are using the term "Online Backup" in our
documentation when we are talking about continuous backup and PITR.

To me, "online backup" is doing a backup while the system is online
(online-backup), and that is accomplished by pg_dump.

Well fwiw what pg_dump does isn't even considered a "backup" at all in
other systems. It's a "logical export" or something of that sort.

It's not considered a "backup" because it's saving something different
than the actual physical database. When you restore you get something
(hopefully) logically equivalent but still physically different.

This seems fairly arbitrary. On that basis anything on a higher level than
dd is not a backup method, ISTM.

cheers

andrew

#12

Bruce Momjian

bruce@momjian.us

over 20 years ago

In reply to: Andrew Dunstan (#11)

hackersdocs

Re: [HACKERS] Online backup vs Continuous backup

"Andrew Dunstan" <andrew@dunslane.net> writes:

Greg Stark said:

Well fwiw what pg_dump does isn't even considered a "backup" at all in
other systems. It's a "logical export" or something of that sort.

It's not considered a "backup" because it's saving something different
than the actual physical database. When you restore you get something
(hopefully) logically equivalent but still physically different.

This seems fairly arbitrary. On that basis anything on a higher level than
dd is not a backup method, ISTM.

Well most of other options *aren't* any higher level than dd.

Even with PITR you're just storing the transaction logs byte for byte. When
you replay it works at slightly higher level but it's still far from
reinterpreting your data and you'll get (almost) exactly the same bytes back.

I'm not saying pg_dump is a bad thing, but now that Postgres has real backups
there isn't much of a use case for using pg_dump as a stand-in for backups. If
your system crashes and you need to bring up a new system quickly you want to
bring it up in exactly the state the one that crashed was in. You don't want
to pass all the data through a program that interprets the ascii
representation again and hope it comes up with the same data.

There are other reasons to need pg_dump of course. Just not for backups.

--
greg

#13

Aftab Alam

aalam@tatashare.com

over 20 years ago

In reply to: Tom Lane (#10)

hackersdocs

sending mail from Postgres

Hi there,

How can i send mail form postgresql.

any suggestion.

thanx & regards
aftab

#14

Jeremy Drake

jeremyd@apptechsys.com

over 20 years ago

In reply to: Aftab Alam (#13)

hackersdocs

Re: sending mail from Postgres

Code from perl cookbook, wrapped in plperlu wrapper

not very portable, Net::SMTP would probably be better, and result in more
portable perl code.

Didn't test this - but it is copied straight from perl cookbook via
google: http://www.unix.org.ua/orelly/perl/cookbook/ch18_04.htm

CREATE OR REPLACE FUNCTION send_email(from_address text, to_address text,
subject text, body text) RETURNS void AS $$

use Mail::Mailer;
my ($from_address, $to_address, $subject, $body) = @_;

my $mailer = Mail::Mailer->new("sendmail");
$mailer->open({ From => $from_address,
To => $to_address,
Subject => $subject,
})
or die "Can't open: $!";
print $mailer $body;
$mailer->close();

$$ LANGUAGE plperlu VOLATILE STRICT;

On Wed, 28 Dec 2005, Aftab Alam wrote:

Hi there,

How can i send mail form postgresql.

any suggestion.

thanx & regards
aftab

---------------------------(end of broadcast)---------------------------
TIP 6: explain analyze is your friend

--
Don't steal; thou'lt never thus compete successfully in business. Cheat.
-- Ambrose Bierce

#15

Chris Browne

cbbrowne@acm.org

over 20 years ago

In reply to: Bruce Momjian (#1)

hackersdocs

Re: Online backup vs Continuous backup

Bruce Momjian <pgman@candle.pha.pa.us> writes:

I suggest the following patch to rename our capability "Continuous
Backup".

This doesn't seem like an improvement. "Online backup" is the standard
terminology AFAIK.

There's something ambiguous about it, in view that there are two
useful points in time for a backup to be "as of," namely:
a) The start time (which is what pg_dump does), and
b) The end time (which I believe is not uncommon with other systems).

PITR is obviously more like b)...
--
wm(X,Y):-write(X),write('@'),write(Y). wm('cbbrowne','gmail.com').
http://linuxfinances.info/info/slony.html
Keeping instructions and operands in different memories saves .20
(.09) microseconds.

#16

Pavel Stehule

pavel.stehule@gmail.com

over 20 years ago

In reply to: Aftab Alam (#13)

hackersdocs

Re: sending mail from Postgres

Hello,

it depend on your possibilities. Simply, use PL/Perl or PL/sh.

Regards
Pavel Stehule

or use PgSendMail
http://sourceforge.net/project/showfiles.php?group_id=35804

Hi there,

How can i send mail form postgresql.

any suggestion.

thanx & regards
aftab

---------------------------(end of broadcast)---------------------------
TIP 6: explain analyze is your friend

_________________________________________________________________
Citite se osamele? Poznejte nekoho vyjmecneho diky Match.com.
http://www.msn.cz/

#17

David Fetter

david@fetter.org

over 20 years ago

In reply to: Aftab Alam (#13)

hackersdocs

Re: sending mail from Postgres

On Wed, Dec 28, 2005 at 08:52:41AM +0530, Aftab Alam wrote:

Hi there,

How can i send mail form postgresql.

Generally, it is a better idea either to poll your PostgreSQL database
and send mail based on that, or use LISTEN/NOTIFY, rather than
directly sending mail from your database.

Cheers,
D
--
David Fetter david@fetter.org http://fetter.org/
phone: +1 415 235 3778

Remember to vote!

#18

Simon Riggs

simon@2ndQuadrant.com

over 20 years ago

In reply to: Bruce Momjian (#3)

hackersdocs

Re: [HACKERS] Online backup vs Continuous backup

On Mon, 2005-12-26 at 13:46 -0500, Bruce Momjian wrote:

Tom Lane wrote:

Bruce Momjian <pgman@candle.pha.pa.us> writes:

I suggest the following patch to rename our capability "Continuous
Backup".

This doesn't seem like an improvement. "Online backup" is the standard
terminology AFAIK.

But why is it the standard terminology? It doesn't seem logical.

Well, as Greg says its a physical backup that can be done on-line, so
online backup makes perfect sense to me. I've never had somebody say
"that makes no sense" before. Nomenclature is different everywhere, I
accept.

I generally describe it like this:

Logical Backup
- use pg_dump - must be done on-line
Physical Backup
All file copy only
- must be Cold/Off-line backup
All file copy + WAL archiving
- allows Hot/Online or Cold/Offline backup

People understand those terms...

When do I mention PITR? Well, I describe this as Archive Recovery, with
an option to go to end-of-logs, or to a point-in-time.
[In the code, the mode variable is InArchiveRecovery.]

I do think that saying "do you use PITR?" makes little sense. We should
be talking about the backup mode, not the potential future recovery
mode.

I think it would all make more sense if we described the use of
archive_command = something as being in "WAL Archive Mode". That would
then allow us to say:
"You can only take Online Backups while in WAL Archive Mode".
"If you ever wish to perform PITR, you must use WAL Archive Mode".
"If you backed-up in WAL Archive Mode, you can perform an Archive
Recovery".

Best Regards, Simon Riggs

#19

Rick Gigger

rick@alpinenetworking.com

over 20 years ago

In reply to: Tom Lane (#2)

hackersdocs

Re: [DOCS] Online backup vs Continuous backup

How about:

use "Online backup" or "Hot backup" to refer to either method of back
since they are both done while the system is online or hot.

If you want to get specific refer to doing a "sql dump" etc for using
pg_dump
Then use "Incremental backup" to refer to the whole process of the
WAL archival etc
Refer to the actual log files themselves as transaction logs.

That all seems to be pretty intuitive and non-ambiguous non-confusing
to me.

On Dec 26, 2005, at 11:44 AM, Tom Lane wrote:

Show quoted text

Bruce Momjian <pgman@candle.pha.pa.us> writes:

I suggest the following patch to rename our capability "Continuous
Backup".

This doesn't seem like an improvement. "Online backup" is the
standard
terminology AFAIK.

regards, tom lane

---------------------------(end of
broadcast)---------------------------
TIP 4: Have you searched our list archives?

http://archives.postgresql.org

#20

Rick Gigger

rick@alpinenetworking.com

over 20 years ago

In reply to: Simon Riggs (#18)

hackersdocs

Re: [DOCS] Online backup vs Continuous backup

I think it would all make more sense if we described the use of
archive_command = something as being in "WAL Archive Mode". That would
then allow us to say:
"You can only take Online Backups while in WAL Archive Mode".
"If you ever wish to perform PITR, you must use WAL Archive Mode".
"If you backed-up in WAL Archive Mode, you can perform an Archive
Recovery".

It seems to me there are two different context in which one would be
making statements like this. And what we are "allowed to say"
depends greatly on context. These contexts are as follows:

1) Explaining the feature set of postgres to a potential user.
2) Explaining to an actual postgres user how to actually do something.

In the first case it makes the most sense to me to use industry
standard or very intuitive terminology to the extend that it exists.
ie (Transaction Logs vs. WAL). Incremental Backup and Point in Time
Recovery seem to be fairly commonly used and understood database
buzzwords for someone to investigate the feature set of an RDBMS.

In the second case it seems to me that the most important thing is
that you pick terminology that is consistent, unambiguous and clearly
defined. Log archival, PITR, etc are not point and click operations
like they are in say MS SQL Server. This gives us more flexibility
but it also requires a deeper understanding. If someone is unwilling
or unable to to learn whatever terminology you happen to come up with
then it seems to me they shouldn't even be attempting to set up one
of those features. At the same time if the terminology you uses
changes all the time (is not consistent), or if you can't figure out
what any of the terms mean (they are not clearly defined) or if you
use terms like "online backup" to mean both types of backup but then
use it once in a specific circumstance where only one usage is
appropriate (you are using the terms ambiguously) then users will be
confused and it will be your fault not theirs.

Just my 2 cents

Rick Gigger

#21

Bruce Momjian

bruce@momjian.us

over 20 years ago

In reply to: Rick Gigger (#19)

hackersdocs

#22

Peter Eisentraut

peter_e@gmx.net

over 20 years ago