Regression tests fail with tzdata 2024b

Started by Wolfgang Waltherover 1 year ago11 messageshackers
Jump to latest
#1Wolfgang Walther
walther@technowledgy.de

Building --with-system-tzdata and the latest tzdata 2024b fails the
regression tests for me (see attached .diffs). This seems to be because
of [1]https://github.com/eggert/tz/commit/a0b09c0230089252acf2eb0f1ba922e99f7f4a03, which changed the way "PST8PDT" is handled. This is the timezone
that the regression tests are run with.

From 2024b on, "PST8PDT" is the same as "America/Los_Angeles", so by
changing the regression tests to use the latter as the default, we're
getting consistent output on at least 2024a and 2024b.

Patch attached.

Best,

Wolfgang

[1]: https://github.com/eggert/tz/commit/a0b09c0230089252acf2eb0f1ba922e99f7f4a03
https://github.com/eggert/tz/commit/a0b09c0230089252acf2eb0f1ba922e99f7f4a03

Attachments:

regression.diffstext/plain; charset=UTF-8; name=regression.diffsDownload+90-90
v1-0001-Run-regression-tests-with-timezone-America-Los_An.patchtext/x-patch; charset=UTF-8; name=v1-0001-Run-regression-tests-with-timezone-America-Los_An.patchDownload+96-96
#2Tom Lane
tgl@sss.pgh.pa.us
In reply to: Wolfgang Walther (#1)
Re: Regression tests fail with tzdata 2024b

Wolfgang Walther <walther@technowledgy.de> writes:

Building --with-system-tzdata and the latest tzdata 2024b fails the
regression tests for me (see attached .diffs). This seems to be because
of [1], which changed the way "PST8PDT" is handled. This is the timezone
that the regression tests are run with.

That's quite annoying, especially since it was not mentioned in the
2024b release notes. (I had read the notes and concluded that 2024b
didn't require any immediate attention on our part.)

From 2024b on, "PST8PDT" is the same as "America/Los_Angeles", so by
changing the regression tests to use the latter as the default, we're
getting consistent output on at least 2024a and 2024b.

I'm fairly un-thrilled with this answer, not least because it exposes
that zone's idiosyncratic "LMT" offset of -7:52:58 for years before
1883. (I'm surprised that that seems to affect only one or two
regression results.) Also, as a real place to a greater extent
than "PST8PDT" is, it's more subject to historical revisionism when
somebody turns up evidence of local law having been different than
TZDB currently thinks.

We may not have a lot of choice though. I experimented with using
full POSIX notation, that is "PST8PDT,M3.2.0,M11.1.0", but that is
actually worse in terms of the number of test diffs, since it doesn't
match the DST transition dates that the tests expect for years before
2007. Another objection is that the tests would then not exercise any
of the mainstream tzdb-file-reading code paths within the timezone
code itself.

Grumble.

regards, tom lane

#3Wolfgang Walther
walther@technowledgy.de
In reply to: Tom Lane (#2)
Re: Regression tests fail with tzdata 2024b

Tom Lane:

Also, as a real place to a greater extent
than "PST8PDT" is, it's more subject to historical revisionism when
somebody turns up evidence of local law having been different than
TZDB currently thinks.

I now tried all versions of tzdata which we had in tree back to 2018g,
they all work fine with the same regression test output. 2018g was an
arbitrary cutoff, I just didn't try any further.

In the end, we don't need a default timezone that will never change. We
just need one that didn't change in a reasonable number of releases
going backwards. Once America/Los_Angeles is changed, we need to switch
to a different zone, which could be one that wouldn't work today. Kind
of a sliding window.

One positive might be: With this timezone, we are more likely to see
relevant changes mentioned in the upstream release notes.

Best,

Wolfgang

#4Tom Lane
tgl@sss.pgh.pa.us
In reply to: Wolfgang Walther (#3)
Re: Regression tests fail with tzdata 2024b

Wolfgang Walther <walther@technowledgy.de> writes:

Tom Lane:

Also, as a real place to a greater extent
than "PST8PDT" is, it's more subject to historical revisionism when
somebody turns up evidence of local law having been different than
TZDB currently thinks.

I now tried all versions of tzdata which we had in tree back to 2018g,
they all work fine with the same regression test output. 2018g was an
arbitrary cutoff, I just didn't try any further.

Yeah, my belly-aching above is just about hypothetical future
instability. In reality, I'm sure America/Los_Angeles is pretty
well researched and so it's unlikely that there will be unexpected
changes in its zone history.

In the end, we don't need a default timezone that will never change.

We do, really. For example, there's a nonzero chance the USA will
cancel DST altogether at some future time. (This would be driven by
politicians who don't remember the last time, but there's no shortage
of those.) That'd likely break some of our test results, and even if
it happened not to, it'd still be bad because we'd probably lose some
coverage of the DST-transition-related code paths in src/timezone/.
So I'd really be way happier with a test timezone that never changes
but does include DST behavior. I thought PST8PDT fit those
requirements pretty well, and I'm annoyed at Eggert for having tossed
it overboard for no benefit whatever. But I don't run tzdb, so
here we are.

We just need one that didn't change in a reasonable number of
releases going backwards.

We've had this sort of fire-drill before, e.g. commit 8d7af8fbe.
It's no fun, and the potential surface area for unexpected changes
is now much greater than the few tests affected by that change.

But here we are, so I pushed your patch with a couple of other
cosmetic bits. There are still a couple of references to PST8PDT
in the tree, but they don't appear to care what the actual meaning
of that zone is, so I left them be.

regards, tom lane

#5Sven Klemm
sven@timescale.com
In reply to: Tom Lane (#4)
Re: Regression tests fail with tzdata 2024b

On Mon, Sep 16, 2024 at 7:09 AM Tom Lane <tgl@sss.pgh.pa.us> wrote:

Wolfgang Walther <walther@technowledgy.de> writes:

Tom Lane:

Also, as a real place to a greater extent
than "PST8PDT" is, it's more subject to historical revisionism when
somebody turns up evidence of local law having been different than
TZDB currently thinks.

I now tried all versions of tzdata which we had in tree back to 2018g,
they all work fine with the same regression test output. 2018g was an
arbitrary cutoff, I just didn't try any further.

Yeah, my belly-aching above is just about hypothetical future
instability. In reality, I'm sure America/Los_Angeles is pretty
well researched and so it's unlikely that there will be unexpected
changes in its zone history.

In the end, we don't need a default timezone that will never change.

We do, really. For example, there's a nonzero chance the USA will
cancel DST altogether at some future time. (This would be driven by
politicians who don't remember the last time, but there's no shortage
of those.) That'd likely break some of our test results, and even if
it happened not to, it'd still be bad because we'd probably lose some
coverage of the DST-transition-related code paths in src/timezone/.
So I'd really be way happier with a test timezone that never changes
but does include DST behavior. I thought PST8PDT fit those
requirements pretty well, and I'm annoyed at Eggert for having tossed
it overboard for no benefit whatever. But I don't run tzdb, so
here we are.

We just need one that didn't change in a reasonable number of
releases going backwards.

We've had this sort of fire-drill before, e.g. commit 8d7af8fbe.
It's no fun, and the potential surface area for unexpected changes
is now much greater than the few tests affected by that change.

But here we are, so I pushed your patch with a couple of other
cosmetic bits. There are still a couple of references to PST8PDT
in the tree, but they don't appear to care what the actual meaning
of that zone is, so I left them be.

This is an unfortunate change as this will break extensions tests using
pg_regress for testing. We run our tests against multiple minor versions
and this getting backported means our tests will fail with the next minor
pg release. Is there a workaround available to make the timezone for
pg_regress configurable without going into every test?

Regards, Sven Klemm

#6Tom Lane
tgl@sss.pgh.pa.us
In reply to: Sven Klemm (#5)
Re: Regression tests fail with tzdata 2024b

Sven Klemm <sven@timescale.com> writes:

This is an unfortunate change as this will break extensions tests using
pg_regress for testing. We run our tests against multiple minor versions
and this getting backported means our tests will fail with the next minor
pg release. Is there a workaround available to make the timezone for
pg_regress configurable without going into every test?

Configurable to what? If your test cases are dependent on the
historical behavior of PST8PDT, you're out of luck, because that
simply isn't available anymore (or won't be once 2024b reaches
your platform, anyway).

regards, tom lane

#7Sven Klemm
sven@timescale.com
In reply to: Tom Lane (#6)
Re: Regression tests fail with tzdata 2024b

On Mon, Sep 16, 2024 at 5:19 PM Tom Lane <tgl@sss.pgh.pa.us> wrote:

Configurable to what? If your test cases are dependent on the
historical behavior of PST8PDT, you're out of luck, because that
simply isn't available anymore (or won't be once 2024b reaches
your platform, anyway).

I was wondering whether the timezone used by pg_regress could be made
configurable.

--
Regards, Sven Klemm

#8Tom Lane
tgl@sss.pgh.pa.us
In reply to: Sven Klemm (#7)
Re: Regression tests fail with tzdata 2024b

Sven Klemm <sven@timescale.com> writes:

On Mon, Sep 16, 2024 at 5:19 PM Tom Lane <tgl@sss.pgh.pa.us> wrote:

Configurable to what? If your test cases are dependent on the
historical behavior of PST8PDT, you're out of luck, because that
simply isn't available anymore (or won't be once 2024b reaches
your platform, anyway).

I was wondering whether the timezone used by pg_regress could be made
configurable.

Yes, I understood that you were suggesting that. My point is that
it wouldn't do you any good: you will still have to change any
regression test cases that depend on behavior PST8PDT has/had that
is different from America/Los_Angeles. That being the case,
I don't see much value in making it configurable.

regards, tom lane

#9Wolfgang Walther
walther@technowledgy.de
In reply to: Tom Lane (#8)
Re: Regression tests fail with tzdata 2024b

Tom Lane:

I was wondering whether the timezone used by pg_regress could be made
configurable.

Yes, I understood that you were suggesting that. My point is that
it wouldn't do you any good: you will still have to change any
regression test cases that depend on behavior PST8PDT has/had that
is different from America/Los_Angeles. That being the case,
I don't see much value in making it configurable.

Just changing it back to PST8PDT wouldn't really help as Tom pointed
out. You'd still get different results depending on which tzdata version
you are running with.

The core regression tests need to be run with a timezone that tests
special cases in the timezone handling code. But that might not be true
for extensions - all they want could be a stable output across major and
minor versions of postgres and versions of tzdata. It could be helpful
to set pg_regress' timezone to UTC, for example?

Best,

Wolfgang

#10Tom Lane
tgl@sss.pgh.pa.us
In reply to: Wolfgang Walther (#9)
Re: Regression tests fail with tzdata 2024b

Wolfgang Walther <walther@technowledgy.de> writes:

The core regression tests need to be run with a timezone that tests
special cases in the timezone handling code. But that might not be true
for extensions - all they want could be a stable output across major and
minor versions of postgres and versions of tzdata. It could be helpful
to set pg_regress' timezone to UTC, for example?

I would not recommend that choice. It would mask simple errors such
as failing to apply the correct conversion between timestamptz and
timestamp values. Also, if you have test cases that are affected by
this issue at all, you probably have a need/desire to test things like
DST transitions.

regards, tom lane

#11Sven Klemm
sven@timescale.com
In reply to: Tom Lane (#10)
Re: Regression tests fail with tzdata 2024b

On Tue, Sep 17, 2024 at 4:15 PM Tom Lane <tgl@sss.pgh.pa.us> wrote:

Wolfgang Walther <walther@technowledgy.de> writes:

The core regression tests need to be run with a timezone that tests
special cases in the timezone handling code. But that might not be true
for extensions - all they want could be a stable output across major and
minor versions of postgres and versions of tzdata. It could be helpful
to set pg_regress' timezone to UTC, for example?

I would not recommend that choice. It would mask simple errors such
as failing to apply the correct conversion between timestamptz and
timestamp values. Also, if you have test cases that are affected by
this issue at all, you probably have a need/desire to test things like
DST transitions.

As far as I'm aware timescaledb does not rely on specifics of tzdata
version but just needs a stable setting for timezone. I guess I'll adjust
our tests to not depend on upstream pg_regress timezone.

--
Regards, Sven Klemm