Buildfarm: Enabling injection points on basilisk/dogfish (Alpine / musl)

Started by Wolfgang Walther9 months ago6 messages
#1Wolfgang Walther
walther@technowledgy.de

I recently enabled more features on my two buildfarm animals basilisk
and dogfish, which are running on Alpine with musl-libc in a docker
container.

--with-libnuma and --with-liburing seemed to work fine and have been
enabled for the last few runs, but --enable-injection-points does not [1]https://github.com/technowledgy/postgresql-buildfarm-alpine/pull/100.

With injection points enabled, I get the following errors in test_aio:

[15:14:45.408](0.000s) not ok 187 - worker: first hard IO error is
reported: expected stderr
[15:14:45.409](0.000s)
[15:14:45.409](0.000s) #   Failed test 'worker: first hard IO error is
reported: expected stderr'
#   at t/001_aio.pl line 810.
[15:14:45.409](0.000s) #                   'psql:<stdin>:88: ERROR: 
could not read blocks 2..2 in file "base/5/16408": I/O error'
#     doesn't match '(?^:ERROR:.*could not read blocks 2\.\.2 in file
\"base/.*\": Input/output error)'

5 more tests in that suite fail with similar errors. Full logs in the
artifact downloads of the CI run [2]https://github.com/technowledgy/postgresql-buildfarm-alpine/actions/runs/14420687470?pr=100.

It seems like it's just the error message that is different and has
"I/O" instead of "Input/output"?

On a more general note, does enabling injection points make any sense
here? I see that coverage in the build farm is not very big. IIUC, those
are a development tool, so might not be relevant, because nobody is
developing on Alpine / musl?

Best,

Wolfgang

[1]: https://github.com/technowledgy/postgresql-buildfarm-alpine/pull/100

[2]: https://github.com/technowledgy/postgresql-buildfarm-alpine/actions/runs/14420687470?pr=100
https://github.com/technowledgy/postgresql-buildfarm-alpine/actions/runs/14420687470?pr=100

#2Noah Misch
noah@leadboat.com
In reply to: Wolfgang Walther (#1)
Re: Buildfarm: Enabling injection points on basilisk/dogfish (Alpine / musl)

On Sat, Apr 12, 2025 at 07:51:06PM +0200, Wolfgang Walther wrote:

With injection points enabled, I get the following errors in test_aio:

[15:14:45.408](0.000s) not ok 187 - worker: first hard IO error is reported:
expected stderr
[15:14:45.409](0.000s)
[15:14:45.409](0.000s) #�� Failed test 'worker: first hard IO error is
reported: expected stderr'
#�� at t/001_aio.pl line 810.
[15:14:45.409](0.000s) #������������������ 'psql:<stdin>:88: ERROR:� could
not read blocks 2..2 in file "base/5/16408": I/O error'
#���� doesn't match '(?^:ERROR:.*could not read blocks 2\.\.2 in file
\"base/.*\": Input/output error)'

It seems like it's just the error message that is different and has "I/O"
instead of "Input/output"?

Looks like it.

On a more general note, does enabling injection points make any sense here?

Yes, it does.

I see that coverage in the build farm is not very big. IIUC, those are a
development tool, so might not be relevant, because nobody is developing on
Alpine / musl?

No, whether anyone develops on the platform is not a factor. One hasn't fully
tested PostgreSQL until one builds with injection points.

#3Andrew Dunstan
andrew@dunslane.net
In reply to: Noah Misch (#2)
1 attachment(s)
Re: Buildfarm: Enabling injection points on basilisk/dogfish (Alpine / musl)

On 2025-04-12 Sa 10:10 PM, Noah Misch wrote:

On Sat, Apr 12, 2025 at 07:51:06PM +0200, Wolfgang Walther wrote:

With injection points enabled, I get the following errors in test_aio:

[15:14:45.408](0.000s) not ok 187 - worker: first hard IO error is reported:
expected stderr
[15:14:45.409](0.000s)
[15:14:45.409](0.000s) #   Failed test 'worker: first hard IO error is
reported: expected stderr'
#   at t/001_aio.pl line 810.
[15:14:45.409](0.000s) #                   'psql:<stdin>:88: ERROR:  could
not read blocks 2..2 in file "base/5/16408": I/O error'
#     doesn't match '(?^:ERROR:.*could not read blocks 2\.\.2 in file
\"base/.*\": Input/output error)'
It seems like it's just the error message that is different and has "I/O"
instead of "Input/output"?

Looks like it.

On a more general note, does enabling injection points make any sense here?

Yes, it does.

I see that coverage in the build farm is not very big. IIUC, those are a
development tool, so might not be relevant, because nobody is developing on
Alpine / musl?

No, whether anyone develops on the platform is not a factor. One hasn't fully
tested PostgreSQL until one builds with injection points.

Here's a simple fix ... also removes some unnecessary escaping and
leaning toothpick syndrome.

cheers

andrew

--
Andrew Dunstan
EDB: https://www.enterprisedb.com

Attachments:

aio-regex-fix.patchtext/x-patch; charset=UTF-8; name=aio-regex-fix.patchDownload
diff --git a/src/test/modules/test_aio/t/001_aio.pl b/src/test/modules/test_aio/t/001_aio.pl
index c136d8ee8f5..ef4e5247e5b 100644
--- a/src/test/modules/test_aio/t/001_aio.pl
+++ b/src/test/modules/test_aio/t/001_aio.pl
@@ -813,7 +813,7 @@ SELECT invalidate_rel_block('tbl_ok', 2);
 		"first hard IO error is reported",
 		qq(SELECT count(*) FROM tbl_ok),
 		qr/^$/,
-		qr/ERROR:.*could not read blocks 2\.\.2 in file \"base\/.*\": Input\/output error/
+		qr!ERROR:.*could not read blocks 2\.\.2 in file "base/.*": (?:I/O|Input/output) error!
 	);
 
 	psql_like(
@@ -822,7 +822,7 @@ SELECT invalidate_rel_block('tbl_ok', 2);
 		"second hard IO error is reported",
 		qq(SELECT count(*) FROM tbl_ok),
 		qr/^$/,
-		qr/ERROR:.*could not read blocks 2\.\.2 in file \"base\/.*\": Input\/output error/
+		qr!ERROR:.*could not read blocks 2\.\.2 in file "base/.*": (?:I/O|Input/output) error!
 	);
 
 	$psql->query_safe(qq(SELECT inj_io_short_read_detach()));
#4Wolfgang Walther
walther@technowledgy.de
In reply to: Andrew Dunstan (#3)
Re: Buildfarm: Enabling injection points on basilisk/dogfish (Alpine / musl)

Andrew Dunstan:

On 2025-04-12 Sa 10:10 PM, Noah Misch wrote:

On Sat, Apr 12, 2025 at 07:51:06PM +0200, Wolfgang Walther wrote:

With injection points enabled, I get the following errors in test_aio:

[15:14:45.408](0.000s) not ok 187 - worker: first hard IO error is
reported:
expected stderr
[15:14:45.409](0.000s)
[15:14:45.409](0.000s) #   Failed test 'worker: first hard IO error is
reported: expected stderr'
#   at t/001_aio.pl line 810.
[15:14:45.409](0.000s) # 'psql:<stdin>:88: ERROR:  could
not read blocks 2..2 in file "base/5/16408": I/O error'
#     doesn't match '(?^:ERROR:.*could not read blocks 2\.\.2 in file
\"base/.*\": Input/output error)'
It seems like it's just the error message that is different and has
"I/O"
instead of "Input/output"?

Looks like it.

On a more general note, does enabling injection points make any
sense here?

Yes, it does.

I see that coverage in the build farm is not very big. IIUC, those
are a
development tool, so might not be relevant, because nobody is
developing on
Alpine / musl?

No, whether anyone develops on the platform is not a factor. One
hasn't fully
tested PostgreSQL until one builds with injection points.

Here's a simple fix ... also removes some unnecessary escaping and
leaning toothpick syndrome.

Confirmed - this works!

Thanks,

Wolfgang

#5Andres Freund
andres@anarazel.de
In reply to: Wolfgang Walther (#4)
Re: Buildfarm: Enabling injection points on basilisk/dogfish (Alpine / musl)

Hi,

On April 13, 2025 7:27:33 PM GMT+02:00, Wolfgang Walther <walther@technowledgy.de> wrote:

Andrew Dunstan:

On 2025-04-12 Sa 10:10 PM, Noah Misch wrote:

On Sat, Apr 12, 2025 at 07:51:06PM +0200, Wolfgang Walther wrote:

With injection points enabled, I get the following errors in test_aio:

[15:14:45.408](0.000s) not ok 187 - worker: first hard IO error is reported:
expected stderr
[15:14:45.409](0.000s)
[15:14:45.409](0.000s) #   Failed test 'worker: first hard IO error is
reported: expected stderr'
#   at t/001_aio.pl line 810.
[15:14:45.409](0.000s) # 'psql:<stdin>:88: ERROR:  could
not read blocks 2..2 in file "base/5/16408": I/O error'
#     doesn't match '(?^:ERROR:.*could not read blocks 2\.\.2 in file
\"base/.*\": Input/output error)'
It seems like it's just the error message that is different and has "I/O"
instead of "Input/output"?

Looks like it.

On a more general note, does enabling injection points make any sense here?

Yes, it does.

I see that coverage in the build farm is not very big. IIUC, those are a
development tool, so might not be relevant, because nobody is developing on
Alpine / musl?

No, whether anyone develops on the platform is not a factor. One hasn't fully
tested PostgreSQL until one builds with injection points.

Here's a simple fix ... also removes some unnecessary escaping and leaning toothpick syndrome.

Confirmed - this works!

Thanks for testing and writing up a fix. Andrew, would you be ok applying it? I've been traveling the last 24h and should probably not handing sharp commit bits tonight.

I'm not too surprised about failures like this, when writing the tests up I was worried about different formulations. But after seeing freebsd, glibc Linux, netbsd, openbsd windows all working the same I thought we were in the clear.

Greetings,

Andres
--
Sent from my Android device with K-9 Mail. Please excuse my brevity.

#6Andrew Dunstan
andrew@dunslane.net
In reply to: Andres Freund (#5)
Re: Buildfarm: Enabling injection points on basilisk/dogfish (Alpine / musl)

On 2025-04-13 Su 1:51 PM, Andres Freund wrote:

Hi,

On April 13, 2025 7:27:33 PM GMT+02:00, Wolfgang Walther <walther@technowledgy.de> wrote:

Andrew Dunstan:

On 2025-04-12 Sa 10:10 PM, Noah Misch wrote:

On Sat, Apr 12, 2025 at 07:51:06PM +0200, Wolfgang Walther wrote:

With injection points enabled, I get the following errors in test_aio:

[15:14:45.408](0.000s) not ok 187 - worker: first hard IO error is reported:
expected stderr
[15:14:45.409](0.000s)
[15:14:45.409](0.000s) #   Failed test 'worker: first hard IO error is
reported: expected stderr'
#   at t/001_aio.pl line 810.
[15:14:45.409](0.000s) # 'psql:<stdin>:88: ERROR:  could
not read blocks 2..2 in file "base/5/16408": I/O error'
#     doesn't match '(?^:ERROR:.*could not read blocks 2\.\.2 in file
\"base/.*\": Input/output error)'
It seems like it's just the error message that is different and has "I/O"
instead of "Input/output"?

Looks like it.

On a more general note, does enabling injection points make any sense here?

Yes, it does.

I see that coverage in the build farm is not very big. IIUC, those are a
development tool, so might not be relevant, because nobody is developing on
Alpine / musl?

No, whether anyone develops on the platform is not a factor. One hasn't fully
tested PostgreSQL until one builds with injection points.

Here's a simple fix ... also removes some unnecessary escaping and leaning toothpick syndrome.

Confirmed - this works!

Thanks for testing and writing up a fix. Andrew, would you be ok applying it? I've been traveling the last 24h and should probably not handing sharp commit bits tonight.

I'm not too surprised about failures like this, when writing the tests up I was worried about different formulations. But after seeing freebsd, glibc Linux, netbsd, openbsd windows all working the same I thought we were in the clear.

pushed.

cheers

andrew

--
Andrew Dunstan
EDB: https://www.enterprisedb.com