Regarding WAL Format Changes
While reading patch-1 (
<http://archives.postgresql.org/pgsql-hackers/2012-06/txtFskHiYakjO.txt>
1-use-uint64-got-segno.patch) of WAL Format
Changes(http://archives.postgresql.org/message-id/4FDA5136.6080206@enterpris
edb.com), I had few observations which are summarized below:
1. Function header for following functions still contains referece to log,
seg
a. InstallXLogFileSegment()
b. RemoveOldXlogFiles()
c. XLogFileCopy()
d. XLogGetLastRemoved()
e. UpdateLastRemovedPtr()
f. RemoveOldXlogFiles()
2. @@ -2680,8 +2645,8 @@ InstallXLogFileSegment(uint32 *log, uint32 *seg,
char *tmppath,
LWLockRelease(ControlFileLock);
ereport(LOG,
(errcode_for_file_access(),
- errmsg("could not link file \"%s\" to
\"%s\" (initialization of log file %u, segment %u): %m",
- tmppath, path, *log,
*seg)));
+ errmsg("could not link file \"%s\" to
\"%s\" (initialization of log file): %m",
+ tmppath, path)));
If Changed error message can contain log file and segment number, it
would be more clear. That should be easily
deducible from segment number.
3. -RemoveOldXlogFiles(uint32 log, uint32 seg, XLogRecPtr endptr)
+RemoveOldXlogFiles(XLogSegNo segno, XLogRecPtr endptr)
.
.
.
@@ -4016,8 +3953,9 @@ retry:
if (!(((XLogPageHeader) readBuf)->xlp_info &
XLP_FIRST_IS_CONTRECORD))
{
ereport(emode_for_corrupt_record(emode,
*RecPtr),
- (errmsg("there is no
contrecord flag in log file %u, segment %u, offset %u",
- readId,
readSeg, readOff)));
+ (errmsg("there is no
contrecord flag in log segment %s, offset %u",
+
XLogFileNameP(curFileTLI, readSegNo),
+ readOff)));
goto next_record_is_invalid;
}
pageHeaderSize =
XLogPageHeaderSize((XLogPageHeader) readBuf);
@@ -4025,10 +3963,13 @@ retry:
if (contrecord->xl_rem_len == 0 ||
total_len != (contrecord->xl_rem_len +
gotlen))
{
+ char fname[MAXFNAMELEN];
+ XLogFileName(fname, curFileTLI, readSegNo);
ereport(emode_for_corrupt_record(emode,
*RecPtr),
- (errmsg("invalid contrecord
length %u in log file %u, segment %u, offset %u",
+ (errmsg("invalid contrecord
length %u in log segment %s, offset %u",
contrecord->xl_rem_len,
- readId,
readSeg, readOff)));
+
XLogFileNameP(curFileTLI, readSegNo),
+ readOff)));
goto next_record_is_invalid;
}
For the above 2 changed error messages, 'log segment' is used for
filename.
However all similar changes has 'log file' for filename. There are some
places
where 'log segment' is used and other places it is 'log file'.
So is there any particular reason for it?
4. @@ -533,33 +533,17 @@ pg_xlog_location_diff(PG_FUNCTION_ARGS)
- /*
- * Sanity check
- */
- if (loc1.xrecoff > XLogFileSize)
- ereport(ERROR,
- (errcode(ERRCODE_INVALID_PARAMETER_VALUE),
- errmsg("xrecoff \"%X\" is out of valid
range, 0..%X", loc1.xrecoff, XLogFileSize)));
- if (loc2.xrecoff > XLogFileSize)
- ereport(ERROR,
- (errcode(ERRCODE_INVALID_PARAMETER_VALUE),
- errmsg("xrecoff \"%X\" is out of valid
range, 0..%X", loc2.xrecoff, XLogFileSize)));
+ bytes1 = (((uint64)loc1.xlogid) << 32L) + loc1.xrecoff;
+ bytes2 = (((uint64)loc2.xlogid) << 32L) + loc2.xrecoff;
Is there no chance that it can be out of valid range after new changes,
just a doubt?
5.
--- a/src/backend/replication/walreceiver.c
+++ b/src/backend/replication/walreceiver.c
@@ -69,11 +69,12 @@ walrcv_disconnect_type walrcv_disconnect = NULL;
/*
* These variables are used similarly to openLogFile/Id/Seg/Off,
- * but for walreceiver to write the XLOG.
+ * but for walreceiver to write the XLOG. recvFileTLI is the TimeLineID
In the above comments, there is still reference to Id/Seg/Off.
With Regards,
Amit Kapila.
On 27.06.2012 17:14, Amit Kapila wrote:
1. Function header for following functions still contains referece to log,
seg
a. InstallXLogFileSegment()
b. RemoveOldXlogFiles()
c. XLogFileCopy()
d. XLogGetLastRemoved()
e. UpdateLastRemovedPtr()
f. RemoveOldXlogFiles()
Thanks, fixed.
2. @@ -2680,8 +2645,8 @@ InstallXLogFileSegment(uint32 *log, uint32 *seg, char *tmppath, LWLockRelease(ControlFileLock); ereport(LOG, (errcode_for_file_access(), - errmsg("could not link file \"%s\" to \"%s\" (initialization of log file %u, segment %u): %m", - tmppath, path, *log, *seg))); + errmsg("could not link file \"%s\" to \"%s\" (initialization of log file): %m", + tmppath, path))); If Changed error message can contain log file and segment number, it would be more clear. That should be easily deducible from segment number.
That seems redundant. The target file name is calculated from the
segment number, and we're now using the file name instead of log+seg in
other messages too.
3. -RemoveOldXlogFiles(uint32 log, uint32 seg, XLogRecPtr endptr) +RemoveOldXlogFiles(XLogSegNo segno, XLogRecPtr endptr) . . . @@ -4016,8 +3953,9 @@ retry: if (!(((XLogPageHeader) readBuf)->xlp_info& XLP_FIRST_IS_CONTRECORD)) { ereport(emode_for_corrupt_record(emode, *RecPtr), - (errmsg("there is no contrecord flag in log file %u, segment %u, offset %u", - readId, readSeg, readOff))); + (errmsg("there is no contrecord flag in log segment %s, offset %u", + XLogFileNameP(curFileTLI, readSegNo), + readOff)));goto next_record_is_invalid; } pageHeaderSize = XLogPageHeaderSize((XLogPageHeader) readBuf); @@ -4025,10 +3963,13 @@ retry: if (contrecord->xl_rem_len == 0 || total_len != (contrecord->xl_rem_len + gotlen)) { + char fname[MAXFNAMELEN]; + XLogFileName(fname, curFileTLI, readSegNo);ereport(emode_for_corrupt_record(emode, *RecPtr), - (errmsg("invalid contrecord length %u in log file %u, segment %u, offset %u", + (errmsg("invalid contrecord length %u in log segment %s, offset %u",contrecord->xl_rem_len, - readId, readSeg, readOff))); + XLogFileNameP(curFileTLI, readSegNo), + readOff)));goto next_record_is_invalid;
}For the above 2 changed error messages, 'log segment' is used for
filename.
However all similar changes has 'log file' for filename. There are some
places
where 'log segment' is used and other places it is 'log file'.
So is there any particular reason for it?
Not really. There are several messages that use "log file %s", and also
several places that use "log segment %s" Should we make it consistent
and use either "log segment" or "log file" everywhere?
4. @@ -533,33 +533,17 @@ pg_xlog_location_diff(PG_FUNCTION_ARGS) - /* - * Sanity check - */ - if (loc1.xrecoff> XLogFileSize) - ereport(ERROR, - (errcode(ERRCODE_INVALID_PARAMETER_VALUE), - errmsg("xrecoff \"%X\" is out of valid range, 0..%X", loc1.xrecoff, XLogFileSize))); - if (loc2.xrecoff> XLogFileSize) - ereport(ERROR, - (errcode(ERRCODE_INVALID_PARAMETER_VALUE), - errmsg("xrecoff \"%X\" is out of valid range, 0..%X", loc2.xrecoff, XLogFileSize))); + bytes1 = (((uint64)loc1.xlogid)<< 32L) + loc1.xrecoff; + bytes2 = (((uint64)loc2.xlogid)<< 32L) + loc2.xrecoff;Is there no chance that it can be out of valid range after new changes,
just a doubt?
No. Not in the sense it used to be, anyway, the XLogFileSize check is no
longer relevant. Perhaps we should check for InvalidXLogRecPtr or that
the pointer doesn't point e.g in the middle of a page header. But then
again, this calculation works fine with both of those cases, so I see no
reason to make it stricter.
5. --- a/src/backend/replication/walreceiver.c +++ b/src/backend/replication/walreceiver.c @@ -69,11 +69,12 @@ walrcv_disconnect_type walrcv_disconnect = NULL;/* * These variables are used similarly to openLogFile/Id/Seg/Off, - * but for walreceiver to write the XLOG. + * but for walreceiver to write the XLOG. recvFileTLI is the TimeLineIDIn the above comments, there is still reference to Id/Seg/Off.
Thanks, fixed.
--
Heikki Linnakangas
EnterpriseDB http://www.enterprisedb.com
Import Notes
Reply to msg id not found: 4feb15b2.4abd440a.0769.3a71SMTPIN_ADDED@mx.google.comReference msg id not found: 4feb15b2.4abd440a.0769.3a71SMTPIN_ADDED@mx.google.com | Resolved by subject fallback
Excerpts from Heikki Linnakangas's message of mié jun 27 10:56:13 -0400 2012:
On 27.06.2012 17:14, Amit Kapila wrote:
For the above 2 changed error messages, 'log segment' is used for
filename.
However all similar changes has 'log file' for filename. There are some
places
where 'log segment' is used and other places it is 'log file'.
So is there any particular reason for it?Not really. There are several messages that use "log file %s", and also
several places that use "log segment %s" Should we make it consistent
and use either "log segment" or "log file" everywhere?
I think it would be better to use "log segment" for WAL segments. That
way we don't cause confusion with the regular text/csv log output files.
Heck, maybe even "WAL segment" instead of "log".
As a translator, I can't have a single, clear explanation of what "log
file" is because there are multiple meanings. It would be better not to
depend on context.
--
Álvaro Herrera <alvherre@commandprompt.com>
The PostgreSQL Company - Command Prompt, Inc.
PostgreSQL Replication, Consulting, Custom Development, 24x7 support
On Wed, Jun 27, 2012 at 11:56 PM, Heikki Linnakangas
<heikki.linnakangas@enterprisedb.com> wrote:
On 27.06.2012 17:14, Amit Kapila wrote:
1. Function header for following functions still contains referece to log,
seg
a. InstallXLogFileSegment()
b. RemoveOldXlogFiles()
c. XLogFileCopy()
d. XLogGetLastRemoved()
e. UpdateLastRemovedPtr()
f. RemoveOldXlogFiles()Thanks, fixed.
There is still reference to log/seg in the comment of InstallXLogFileSegment().
The attached patch should be applied?
Regards,
--
Fujii Masao
Attachments:
logseg2segno_v1.patchapplication/octet-stream; name=logseg2segno_v1.patchDownload
*** a/src/backend/access/transam/xlog.c
--- b/src/backend/access/transam/xlog.c
***************
*** 2588,2598 **** XLogFileCopy(XLogSegNo destsegno, TimeLineID srcTLI, XLogSegNo srcsegno)
*
* tmppath: initial name of file to install. It will be renamed into place.
*
! * find_free: if TRUE, install the new segment at the first empty log/seg
* number at or after the passed numbers. If FALSE, install the new segment
* exactly where specified, deleting any existing segment file there.
*
! * *max_advance: maximum number of log/seg slots to advance past the starting
* point. Fail if no free slot is found in this range. On return, reduced
* by the number of slots skipped over. (Irrelevant, and may be NULL,
* when find_free is FALSE.)
--- 2588,2598 ----
*
* tmppath: initial name of file to install. It will be renamed into place.
*
! * find_free: if TRUE, install the new segment at the first empty segno
* number at or after the passed numbers. If FALSE, install the new segment
* exactly where specified, deleting any existing segment file there.
*
! * *max_advance: maximum number of segno slots to advance past the starting
* point. Fail if no free slot is found in this range. On return, reduced
* by the number of slots skipped over. (Irrelevant, and may be NULL,
* when find_free is FALSE.)
On 27.06.2012 18:55, Fujii Masao wrote:
On Wed, Jun 27, 2012 at 11:56 PM, Heikki Linnakangas
<heikki.linnakangas@enterprisedb.com> wrote:On 27.06.2012 17:14, Amit Kapila wrote:
1. Function header for following functions still contains referece to log,
seg
a. InstallXLogFileSegment()
b. RemoveOldXlogFiles()
c. XLogFileCopy()
d. XLogGetLastRemoved()
e. UpdateLastRemovedPtr()
f. RemoveOldXlogFiles()Thanks, fixed.
There is still reference to log/seg in the comment of InstallXLogFileSegment().
The attached patch should be applied?
Thanks, applied. Sorry for being so sloppy with this..
--
Heikki Linnakangas
EnterpriseDB http://www.enterprisedb.com
Heikki Linnakangas <heikki.linnakangas@enterprisedb.com> writes:
So is there any particular reason for it?
Not really. There are several messages that use "log file %s", and also
several places that use "log segment %s" Should we make it consistent
and use either "log segment" or "log file" everywhere?
+1 for uniformity. I think I'd vote for using "file" and eliminating
the "segment" terminology altogether, but the other direction would be
okay too, and might require fewer changes.
IIRC, in the original coding "segment" meant 16MB worth of WAL while
"file" was sometimes used to denote 4GB worth (ie, the boundaries where
we had to increment the high half of the LSN struct). Now that 4GB
boundaries are not special, there's no reason to retain the "file"
concept or terminology.
regards, tom lane
On Thu, Jun 28, 2012 at 1:13 AM, Heikki Linnakangas
<heikki.linnakangas@enterprisedb.com> wrote:
On 27.06.2012 18:55, Fujii Masao wrote:
On Wed, Jun 27, 2012 at 11:56 PM, Heikki Linnakangas
<heikki.linnakangas@enterprisedb.com> wrote:On 27.06.2012 17:14, Amit Kapila wrote:
1. Function header for following functions still contains referece to
log,
seg
a. InstallXLogFileSegment()
b. RemoveOldXlogFiles()
c. XLogFileCopy()
d. XLogGetLastRemoved()
e. UpdateLastRemovedPtr()
f. RemoveOldXlogFiles()Thanks, fixed.
There is still reference to log/seg in the comment of
InstallXLogFileSegment().
The attached patch should be applied?Thanks, applied. Sorry for being so sloppy with this..
No problem. WAL format change you did is really nice!
Regards,
--
Fujii Masao
From: Heikki Linnakangas [mailto:heikki.linnakangas@enterprisedb.com]
Sent: Wednesday, June 27, 2012 8:26 PM
On 27.06.2012 17:14, Amit Kapila wrote:
2. @@ -2680,8 +2645,8 @@ InstallXLogFileSegment(uint32 *log, uint32 *seg, char *tmppath, LWLockRelease(ControlFileLock); ereport(LOG, (errcode_for_file_access(), - errmsg("could not link file \"%s\" to \"%s\" (initialization of log file %u, segment %u): %m", - tmppath, path, *log, *seg))); + errmsg("could not link file \"%s\" to \"%s\" (initialization of log file): %m", + tmppath, path))); If Changed error message can contain log file and segment number, it would be more clear. That should be easily deducible from segment number.
That seems redundant. The target file name is calculated from the
segment number, and we're now using the file name instead of log+seg in
other messages too.
errmsg("could not link file \"%s\" to \"%s\" (initialization of log file):
%m", + tmppath, path)));
In this if we try to get the meaning of second part of message
"(initialization of log file)", it was much
better previously as in this message it refers 2 files and previously it was
clear initialization of which log
file failed. So we can mention file name in second part of message
"(initialization of log file)" as well.
3. -RemoveOldXlogFiles(uint32 log, uint32 seg, XLogRecPtr endptr)
For the above 2 changed error messages, 'log segment' is used for
filename.
However all similar changes has 'log file' for filename. There are
some
places
where 'log segment' is used and other places it is 'log file'.
So is there any particular reason for it?
Not really. There are several messages that use "log file %s", and also
several places that use "log segment %s" Should we make it consistent
and use either "log segment" or "log file" everywhere?
'file' seems to be better option as some users may not be even aware of
segments, they would be using default values of segments and they can relate
to 'file' easily.
Also using 'WAL' instead of 'log' as suggested by Alvaro is good if others
also thinks same.
With Regards,
Amit Kapila.