Regression test fails when BLCKSZ is 1kB
I compiled postgreSQL with 1kB block size and regresion test fails. Main problem
is that output is correct but in different order. See attachment.
I think affected test should contain order by keyword.
Any comments?
Zdenek
Attachments:
regression.diffstext/plain; name=regression.diffsDownload
*** ./expected/join.out Wed Jan 9 21:42:28 2008
--- ./results/join.out Mon Apr 21 13:50:53 2008
***************
*** 214,226 ****
WHERE t1.a = t2.d;
xxx | a | e
-----+---+----
- | 0 |
| 1 | -1
| 2 | 2
- | 2 | 4
| 3 | -3
| 5 | -5
| 5 | -5
(7 rows)
--
--- 214,226 ----
WHERE t1.a = t2.d;
xxx | a | e
-----+---+----
| 1 | -1
| 2 | 2
| 3 | -3
+ | 2 | 4
| 5 | -5
| 5 | -5
+ | 0 |
(7 rows)
--
***************
*** 1569,1581 ****
FROM J1_TBL INNER JOIN J2_TBL USING (i);
xxx | i | j | t | k
-----+---+---+-------+----
- | 0 | | zero |
| 1 | 4 | one | -1
| 2 | 3 | two | 2
- | 2 | 3 | two | 4
| 3 | 2 | three | -3
| 5 | 0 | five | -5
| 5 | 0 | five | -5
(7 rows)
-- Same as above, slightly different syntax
--- 1569,1581 ----
FROM J1_TBL INNER JOIN J2_TBL USING (i);
xxx | i | j | t | k
-----+---+---+-------+----
| 1 | 4 | one | -1
| 2 | 3 | two | 2
| 3 | 2 | three | -3
+ | 2 | 3 | two | 4
| 5 | 0 | five | -5
| 5 | 0 | five | -5
+ | 0 | | zero |
(7 rows)
-- Same as above, slightly different syntax
***************
*** 1583,1595 ****
FROM J1_TBL JOIN J2_TBL USING (i);
xxx | i | j | t | k
-----+---+---+-------+----
- | 0 | | zero |
| 1 | 4 | one | -1
| 2 | 3 | two | 2
- | 2 | 3 | two | 4
| 3 | 2 | three | -3
| 5 | 0 | five | -5
| 5 | 0 | five | -5
(7 rows)
SELECT '' AS "xxx", *
--- 1583,1595 ----
FROM J1_TBL JOIN J2_TBL USING (i);
xxx | i | j | t | k
-----+---+---+-------+----
| 1 | 4 | one | -1
| 2 | 3 | two | 2
| 3 | 2 | three | -3
+ | 2 | 3 | two | 4
| 5 | 0 | five | -5
| 5 | 0 | five | -5
+ | 0 | | zero |
(7 rows)
SELECT '' AS "xxx", *
***************
*** 1625,1637 ****
FROM J1_TBL NATURAL JOIN J2_TBL;
xxx | i | j | t | k
-----+---+---+-------+----
- | 0 | | zero |
| 1 | 4 | one | -1
| 2 | 3 | two | 2
- | 2 | 3 | two | 4
| 3 | 2 | three | -3
| 5 | 0 | five | -5
| 5 | 0 | five | -5
(7 rows)
SELECT '' AS "xxx", *
--- 1625,1637 ----
FROM J1_TBL NATURAL JOIN J2_TBL;
xxx | i | j | t | k
-----+---+---+-------+----
| 1 | 4 | one | -1
| 2 | 3 | two | 2
| 3 | 2 | three | -3
+ | 2 | 3 | two | 4
| 5 | 0 | five | -5
| 5 | 0 | five | -5
+ | 0 | | zero |
(7 rows)
SELECT '' AS "xxx", *
***************
*** 1638,1650 ****
FROM J1_TBL t1 (a, b, c) NATURAL JOIN J2_TBL t2 (a, d);
xxx | a | b | c | d
-----+---+---+-------+----
- | 0 | | zero |
| 1 | 4 | one | -1
| 2 | 3 | two | 2
- | 2 | 3 | two | 4
| 3 | 2 | three | -3
| 5 | 0 | five | -5
| 5 | 0 | five | -5
(7 rows)
SELECT '' AS "xxx", *
--- 1638,1650 ----
FROM J1_TBL t1 (a, b, c) NATURAL JOIN J2_TBL t2 (a, d);
xxx | a | b | c | d
-----+---+---+-------+----
| 1 | 4 | one | -1
| 2 | 3 | two | 2
| 3 | 2 | three | -3
+ | 2 | 3 | two | 4
| 5 | 0 | five | -5
| 5 | 0 | five | -5
+ | 0 | | zero |
(7 rows)
SELECT '' AS "xxx", *
***************
*** 1651,1659 ****
FROM J1_TBL t1 (a, b, c) NATURAL JOIN J2_TBL t2 (d, a);
xxx | a | b | c | d
-----+---+---+------+---
- | 0 | | zero |
| 2 | 3 | two | 2
| 4 | 1 | four | 2
(3 rows)
-- mismatch number of columns
--- 1651,1659 ----
FROM J1_TBL t1 (a, b, c) NATURAL JOIN J2_TBL t2 (d, a);
xxx | a | b | c | d
-----+---+---+------+---
| 2 | 3 | two | 2
| 4 | 1 | four | 2
+ | 0 | | zero |
(3 rows)
-- mismatch number of columns
***************
*** 1662,1674 ****
FROM J1_TBL t1 (a, b) NATURAL JOIN J2_TBL t2 (a);
xxx | a | b | t | k
-----+---+---+-------+----
- | 0 | | zero |
| 1 | 4 | one | -1
| 2 | 3 | two | 2
- | 2 | 3 | two | 4
| 3 | 2 | three | -3
| 5 | 0 | five | -5
| 5 | 0 | five | -5
(7 rows)
--
--- 1662,1674 ----
FROM J1_TBL t1 (a, b) NATURAL JOIN J2_TBL t2 (a);
xxx | a | b | t | k
-----+---+---+-------+----
| 1 | 4 | one | -1
| 2 | 3 | two | 2
| 3 | 2 | three | -3
+ | 2 | 3 | two | 4
| 5 | 0 | five | -5
| 5 | 0 | five | -5
+ | 0 | | zero |
(7 rows)
--
***************
*** 1678,1690 ****
FROM J1_TBL JOIN J2_TBL ON (J1_TBL.i = J2_TBL.i);
xxx | i | j | t | i | k
-----+---+---+-------+---+----
- | 0 | | zero | 0 |
| 1 | 4 | one | 1 | -1
| 2 | 3 | two | 2 | 2
- | 2 | 3 | two | 2 | 4
| 3 | 2 | three | 3 | -3
| 5 | 0 | five | 5 | -5
| 5 | 0 | five | 5 | -5
(7 rows)
SELECT '' AS "xxx", *
--- 1678,1690 ----
FROM J1_TBL JOIN J2_TBL ON (J1_TBL.i = J2_TBL.i);
xxx | i | j | t | i | k
-----+---+---+-------+---+----
| 1 | 4 | one | 1 | -1
| 2 | 3 | two | 2 | 2
| 3 | 2 | three | 3 | -3
+ | 2 | 3 | two | 2 | 4
| 5 | 0 | five | 5 | -5
| 5 | 0 | five | 5 | -5
+ | 0 | | zero | 0 |
(7 rows)
SELECT '' AS "xxx", *
***************
*** 1691,1699 ****
FROM J1_TBL JOIN J2_TBL ON (J1_TBL.i = J2_TBL.k);
xxx | i | j | t | i | k
-----+---+---+------+---+---
- | 0 | | zero | | 0
| 2 | 3 | two | 2 | 2
| 4 | 1 | four | 2 | 4
(3 rows)
--
--- 1691,1699 ----
FROM J1_TBL JOIN J2_TBL ON (J1_TBL.i = J2_TBL.k);
xxx | i | j | t | i | k
-----+---+---+------+---+---
| 2 | 3 | two | 2 | 2
| 4 | 1 | four | 2 | 4
+ | 0 | | zero | | 0
(3 rows)
--
***************
*** 1762,1774 ****
FROM J1_TBL RIGHT OUTER JOIN J2_TBL USING (i);
xxx | i | j | t | k
-----+---+---+-------+----
- | 0 | | zero |
| 1 | 4 | one | -1
| 2 | 3 | two | 2
- | 2 | 3 | two | 4
| 3 | 2 | three | -3
| 5 | 0 | five | -5
| 5 | 0 | five | -5
| | | |
| | | | 0
(9 rows)
--- 1762,1774 ----
FROM J1_TBL RIGHT OUTER JOIN J2_TBL USING (i);
xxx | i | j | t | k
-----+---+---+-------+----
| 1 | 4 | one | -1
| 2 | 3 | two | 2
| 3 | 2 | three | -3
+ | 2 | 3 | two | 4
| 5 | 0 | five | -5
| 5 | 0 | five | -5
+ | 0 | | zero |
| | | |
| | | | 0
(9 rows)
***************
*** 1777,1789 ****
FROM J1_TBL RIGHT JOIN J2_TBL USING (i);
xxx | i | j | t | k
-----+---+---+-------+----
- | 0 | | zero |
| 1 | 4 | one | -1
| 2 | 3 | two | 2
- | 2 | 3 | two | 4
| 3 | 2 | three | -3
| 5 | 0 | five | -5
| 5 | 0 | five | -5
| | | |
| | | | 0
(9 rows)
--- 1777,1789 ----
FROM J1_TBL RIGHT JOIN J2_TBL USING (i);
xxx | i | j | t | k
-----+---+---+-------+----
| 1 | 4 | one | -1
| 2 | 3 | two | 2
| 3 | 2 | three | -3
+ | 2 | 3 | two | 4
| 5 | 0 | five | -5
| 5 | 0 | five | -5
+ | 0 | | zero |
| | | |
| | | | 0
(9 rows)
======================================================================
*** ./expected/arrays.out Thu Mar 20 22:42:48 2008
--- ./results/arrays.out Mon Apr 21 13:50:52 2008
***************
*** 139,146 ****
a | b | c
---------------+-----------------------+-------------------
{16,25,3,4,5} | {{{113,142},{1,147}}} | {}
- {16,25,23} | {{3,4},{4,5}} | {foobar,new_word}
[4:4]={NULL} | {3,4} | {foo,new_word}
(3 rows)
--
--- 139,146 ----
a | b | c
---------------+-----------------------+-------------------
{16,25,3,4,5} | {{{113,142},{1,147}}} | {}
[4:4]={NULL} | {3,4} | {foo,new_word}
+ {16,25,23} | {{3,4},{4,5}} | {foobar,new_word}
(3 rows)
--
======================================================================
*** ./expected/misc.out Mon Apr 21 13:50:12 2008
--- ./results/misc.out Mon Apr 21 13:50:55 2008
***************
*** 356,361 ****
--- 356,362 ----
e | 18 | | |
e | | hi elisa | |
e | | | -4 |
+ f | 25 | | -9 |
f | 19 | hi claire | -5 |
f | 20 | hi mike | -6 |
f | 21 | hi marcel | |
***************
*** 362,368 ****
f | 22 | | -7 |
f | | hi keith | -8 |
f | 24 | hi marc | |
- f | 25 | | -9 |
f | 26 | | |
f | | hi allison | -10 |
f | | hi jeff | |
--- 363,368 ----
***************
*** 416,421 ****
--- 416,422 ----
e | 18 |
e | |
e | |
+ f | 25 |
f | 19 |
f | 20 |
f | 21 |
***************
*** 422,428 ****
f | 22 |
f | |
f | 24 |
- f | 25 |
f | 26 |
f | |
f | |
--- 423,428 ----
======================================================================
*** ./expected/polymorphism.out Fri Jan 11 19:39:41 2008
--- ./results/polymorphism.out Mon Apr 21 13:51:03 2008
***************
*** 355,362 ****
select f3, myaggp01a(*) from t group by f3;
f3 | myaggp01a
----+-----------
- b | {}
c | {}
a | {}
(3 rows)
--- 355,362 ----
select f3, myaggp01a(*) from t group by f3;
f3 | myaggp01a
----+-----------
c | {}
+ b | {}
a | {}
(3 rows)
***************
*** 363,370 ****
select f3, myaggp03a(*) from t group by f3;
f3 | myaggp03a
----+-----------
- b | {}
c | {}
a | {}
(3 rows)
--- 363,370 ----
select f3, myaggp03a(*) from t group by f3;
f3 | myaggp03a
----+-----------
c | {}
+ b | {}
a | {}
(3 rows)
***************
*** 371,378 ****
select f3, myaggp03b(*) from t group by f3;
f3 | myaggp03b
----+-----------
- b | {}
c | {}
a | {}
(3 rows)
--- 371,378 ----
select f3, myaggp03b(*) from t group by f3;
f3 | myaggp03b
----+-----------
c | {}
+ b | {}
a | {}
(3 rows)
***************
*** 379,386 ****
select f3, myaggp05a(f1) from t group by f3;
f3 | myaggp05a
----+-----------
- b | {1,2,3}
c | {1,2}
a | {1,2,3}
(3 rows)
--- 379,386 ----
select f3, myaggp05a(f1) from t group by f3;
f3 | myaggp05a
----+-----------
c | {1,2}
+ b | {1,2,3}
a | {1,2,3}
(3 rows)
***************
*** 387,394 ****
select f3, myaggp06a(f1) from t group by f3;
f3 | myaggp06a
----+-----------
- b | {}
c | {}
a | {}
(3 rows)
--- 387,394 ----
select f3, myaggp06a(f1) from t group by f3;
f3 | myaggp06a
----+-----------
c | {}
+ b | {}
a | {}
(3 rows)
***************
*** 395,402 ****
select f3, myaggp08a(f1) from t group by f3;
f3 | myaggp08a
----+-----------
- b | {}
c | {}
a | {}
(3 rows)
--- 395,402 ----
select f3, myaggp08a(f1) from t group by f3;
f3 | myaggp08a
----+-----------
c | {}
+ b | {}
a | {}
(3 rows)
***************
*** 403,410 ****
select f3, myaggp09a(f1) from t group by f3;
f3 | myaggp09a
----+-----------
- b | {}
c | {}
a | {}
(3 rows)
--- 403,410 ----
select f3, myaggp09a(f1) from t group by f3;
f3 | myaggp09a
----+-----------
c | {}
+ b | {}
a | {}
(3 rows)
***************
*** 411,418 ****
select f3, myaggp09b(f1) from t group by f3;
f3 | myaggp09b
----+-----------
- b | {}
c | {}
a | {}
(3 rows)
--- 411,418 ----
select f3, myaggp09b(f1) from t group by f3;
f3 | myaggp09b
----+-----------
c | {}
+ b | {}
a | {}
(3 rows)
***************
*** 419,426 ****
select f3, myaggp10a(f1) from t group by f3;
f3 | myaggp10a
----+-----------
- b | {1,2,3}
c | {1,2}
a | {1,2,3}
(3 rows)
--- 419,426 ----
select f3, myaggp10a(f1) from t group by f3;
f3 | myaggp10a
----+-----------
c | {1,2}
+ b | {1,2,3}
a | {1,2,3}
(3 rows)
***************
*** 427,434 ****
select f3, myaggp10b(f1) from t group by f3;
f3 | myaggp10b
----+-----------
- b | {1,2,3}
c | {1,2}
a | {1,2,3}
(3 rows)
--- 427,434 ----
select f3, myaggp10b(f1) from t group by f3;
f3 | myaggp10b
----+-----------
c | {1,2}
+ b | {1,2,3}
a | {1,2,3}
(3 rows)
***************
*** 435,442 ****
select f3, myaggp20a(f1) from t group by f3;
f3 | myaggp20a
----+-----------
- b | {1,2,3}
c | {1,2}
a | {1,2,3}
(3 rows)
--- 435,442 ----
select f3, myaggp20a(f1) from t group by f3;
f3 | myaggp20a
----+-----------
c | {1,2}
+ b | {1,2,3}
a | {1,2,3}
(3 rows)
***************
*** 443,450 ****
select f3, myaggp20b(f1) from t group by f3;
f3 | myaggp20b
----+-----------
- b | {1,2,3}
c | {1,2}
a | {1,2,3}
(3 rows)
--- 443,450 ----
select f3, myaggp20b(f1) from t group by f3;
f3 | myaggp20b
----+-----------
c | {1,2}
+ b | {1,2,3}
a | {1,2,3}
(3 rows)
***************
*** 451,458 ****
select f3, myaggn01a(*) from t group by f3;
f3 | myaggn01a
----+-----------
- b | {}
c | {}
a | {}
(3 rows)
--- 451,458 ----
select f3, myaggn01a(*) from t group by f3;
f3 | myaggn01a
----+-----------
c | {}
+ b | {}
a | {}
(3 rows)
***************
*** 459,466 ****
select f3, myaggn01b(*) from t group by f3;
f3 | myaggn01b
----+-----------
- b | {}
c | {}
a | {}
(3 rows)
--- 459,466 ----
select f3, myaggn01b(*) from t group by f3;
f3 | myaggn01b
----+-----------
c | {}
+ b | {}
a | {}
(3 rows)
***************
*** 467,474 ****
select f3, myaggn03a(*) from t group by f3;
f3 | myaggn03a
----+-----------
- b | {}
c | {}
a | {}
(3 rows)
--- 467,474 ----
select f3, myaggn03a(*) from t group by f3;
f3 | myaggn03a
----+-----------
c | {}
+ b | {}
a | {}
(3 rows)
***************
*** 475,482 ****
select f3, myaggn05a(f1) from t group by f3;
f3 | myaggn05a
----+-----------
- b | {1,2,3}
c | {1,2}
a | {1,2,3}
(3 rows)
--- 475,482 ----
select f3, myaggn05a(f1) from t group by f3;
f3 | myaggn05a
----+-----------
c | {1,2}
+ b | {1,2,3}
a | {1,2,3}
(3 rows)
***************
*** 483,490 ****
select f3, myaggn05b(f1) from t group by f3;
f3 | myaggn05b
----+-----------
- b | {1,2,3}
c | {1,2}
a | {1,2,3}
(3 rows)
--- 483,490 ----
select f3, myaggn05b(f1) from t group by f3;
f3 | myaggn05b
----+-----------
c | {1,2}
+ b | {1,2,3}
a | {1,2,3}
(3 rows)
***************
*** 491,498 ****
select f3, myaggn06a(f1) from t group by f3;
f3 | myaggn06a
----+-----------
- b | {}
c | {}
a | {}
(3 rows)
--- 491,498 ----
select f3, myaggn06a(f1) from t group by f3;
f3 | myaggn06a
----+-----------
c | {}
+ b | {}
a | {}
(3 rows)
***************
*** 499,506 ****
select f3, myaggn06b(f1) from t group by f3;
f3 | myaggn06b
----+-----------
- b | {}
c | {}
a | {}
(3 rows)
--- 499,506 ----
select f3, myaggn06b(f1) from t group by f3;
f3 | myaggn06b
----+-----------
c | {}
+ b | {}
a | {}
(3 rows)
***************
*** 507,514 ****
select f3, myaggn08a(f1) from t group by f3;
f3 | myaggn08a
----+-----------
- b | {}
c | {}
a | {}
(3 rows)
--- 507,514 ----
select f3, myaggn08a(f1) from t group by f3;
f3 | myaggn08a
----+-----------
c | {}
+ b | {}
a | {}
(3 rows)
***************
*** 515,522 ****
select f3, myaggn08b(f1) from t group by f3;
f3 | myaggn08b
----+-----------
- b | {}
c | {}
a | {}
(3 rows)
--- 515,522 ----
select f3, myaggn08b(f1) from t group by f3;
f3 | myaggn08b
----+-----------
c | {}
+ b | {}
a | {}
(3 rows)
***************
*** 523,530 ****
select f3, myaggn09a(f1) from t group by f3;
f3 | myaggn09a
----+-----------
- b | {}
c | {}
a | {}
(3 rows)
--- 523,530 ----
select f3, myaggn09a(f1) from t group by f3;
f3 | myaggn09a
----+-----------
c | {}
+ b | {}
a | {}
(3 rows)
***************
*** 531,538 ****
select f3, myaggn10a(f1) from t group by f3;
f3 | myaggn10a
----+-----------
- b | {1,2,3}
c | {1,2}
a | {1,2,3}
(3 rows)
--- 531,538 ----
select f3, myaggn10a(f1) from t group by f3;
f3 | myaggn10a
----+-----------
c | {1,2}
+ b | {1,2,3}
a | {1,2,3}
(3 rows)
======================================================================
Am Montag, 21. April 2008 schrieb Zdenek Kotala:
I compiled postgreSQL with 1kB block size and regresion test fails. Main
problem is that output is correct but in different order. See attachment.
This was previously reported:
http://archives.postgresql.org/pgsql-hackers/2006-11/msg00901.php
I think affected test should contain order by keyword.
For previously established reasons, we don't want to add ORDER BY clauses to
every test that might fail under exceptional circumstances so we test all
plan types equally. I think very small block sizes are fairly exceptional,
unless you have a reason up your sleeve why they are a good idea.
On Mon, Apr 21, 2008 at 5:55 PM, Peter Eisentraut <peter_e@gmx.net> wrote:
For previously established reasons, we don't want to add ORDER BY clauses to
every test that might fail under exceptional circumstances so we test all
plan types equally. I think very small block sizes are fairly exceptional,
unless you have a reason up your sleeve why they are a good idea.
Now that we have autovacuum on by default, we might get into random
failures because of re-ordering. Though I don't seem to recall anybody
complaining yet, it could just be that we are lucky or our regression
suite don't have long enough running tests to give autovacuum chance
to recycle some of the dead tuples.
Thanks,
Pavan
--
Pavan Deolasee
EnterpriseDB http://www.enterprisedb.com
"Pavan Deolasee" <pavan.deolasee@gmail.com> writes:
Now that we have autovacuum on by default, we might get into random
failures because of re-ordering. Though I don't seem to recall anybody
complaining yet, it could just be that we are lucky or our regression
suite don't have long enough running tests to give autovacuum chance
to recycle some of the dead tuples.
No, the reason you don't see that is that plain VACUUM doesn't move
tuples around.
regards, tom lane
Peter Eisentraut napsal(a):
Am Montag, 21. April 2008 schrieb Zdenek Kotala:
I compiled postgreSQL with 1kB block size and regresion test fails. Main
problem is that output is correct but in different order. See attachment.This was previously reported:
http://archives.postgresql.org/pgsql-hackers/2006-11/msg00901.phpI think affected test should contain order by keyword.
For previously established reasons, we don't want to add ORDER BY clauses to
every test that might fail under exceptional circumstances so we test all
plan types equally. I think very small block sizes are fairly exceptional,
unless you have a reason up your sleeve why they are a good idea.
I'm only testing behavior with different block size and I think it is not good
idea to support only 8kB for regtest. When 4kB is used then PG fails in Join
regresion test and with 16kB, 32kB it fails because:
*** ./expected/bitmapops.out Fri Apr 11 00:25:26 2008
--- ./results/bitmapops.out Mon Apr 21 15:30:18 2008
***************
*** 20,25 ****
--- 20,26 ----
set enable_seqscan=false;
-- Lower work_mem to trigger use of lossy bitmaps
set work_mem = 64;
+ ERROR: 64 is outside the valid range for parameter "work_mem" (256 .. 2097151)
-- Test bitmap-and.
SELECT count(*) FROM bmscantest WHERE a = 1 AND b = 1;
count
Zdenek
Am Montag, 21. April 2008 schrieb Zdenek Kotala:
I'm only testing behavior with different block size and I think it is not
good idea to support only 8kB for regtest. When 4kB is used then PG fails
in Join regresion test and with 16kB, 32kB it fails because:*** ./expected/bitmapops.out Fri Apr 11 00:25:26 2008 --- ./results/bitmapops.out Mon Apr 21 15:30:18 2008 *************** *** 20,25 **** --- 20,26 ---- set enable_seqscan=false; -- Lower work_mem to trigger use of lossy bitmaps set work_mem = 64; + ERROR: 64 is outside the valid range for parameter "work_mem" (256 .. 2097151) -- Test bitmap-and. SELECT count(*) FROM bmscantest WHERE a = 1 AND b = 1; count
This should probably be fixed by using a unit specification on work_mem. Do
you want to prepare a patch?
On Mon, Apr 21, 2008 at 02:25:31PM +0200, Peter Eisentraut wrote:
I think affected test should contain order by keyword.
For previously established reasons, we don't want to add ORDER BY clauses to
every test that might fail under exceptional circumstances so we test all
plan types equally. I think very small block sizes are fairly exceptional,
unless you have a reason up your sleeve why they are a good idea.
I wonder if it would be feasable to, whenever a regression test fails
to sort both files and compare again. This should tell you if the
difference are *only* rearrangement automatically, without having to
eyeball the output.
Have a nice day,
--
Martijn van Oosterhout <kleptog@svana.org> http://svana.org/kleptog/
Show quoted text
Please line up in a tree and maintain the heap invariant while
boarding. Thank you for flying nlogn airlines.
Am Montag, 21. April 2008 schrieb Martijn van Oosterhout:
I wonder if it would be feasable to, whenever a regression test fails
to sort both files and compare again. This should tell you if the
difference are *only* rearrangement automatically, without having to
eyeball the output.
That sounds like it should be worth a try.
Peter Eisentraut wrote:
Am Montag, 21. April 2008 schrieb Martijn van Oosterhout:
I wonder if it would be feasable to, whenever a regression test fails
to sort both files and compare again. This should tell you if the
difference are *only* rearrangement automatically, without having to
eyeball the output.That sounds like it should be worth a try.
I think we need first to identify cases where we don't care that much
about output order. Teaching pg-regress the new check shouldn't be very
hard.
cheers
andrew
Peter Eisentraut <peter_e@gmx.net> writes:
Am Montag, 21. April 2008 schrieb Martijn van Oosterhout:
I wonder if it would be feasable to, whenever a regression test fails
to sort both files and compare again. This should tell you if the
difference are *only* rearrangement automatically, without having to
eyeball the output.
That sounds like it should be worth a try.
That sounds like a pretty bad idea, since it would treat ordering
differences as insignificant even when they aren't --- for example,
an ordering difference in the output of a query that *has* an
ORDER BY is usually a bug.
regards, tom lane
On Mon, Apr 21, 2008 at 8:20 PM, Tom Lane <tgl@sss.pgh.pa.us> wrote:
No, the reason you don't see that is that plain VACUUM doesn't move
tuples around.
I know. But plain VACUUM can free up dead space which can be used for
subsequent updates/inserts and that can cause reordering. For example:
Case 1.
Insert 100 records --- goes into block 1 .. 10
Delete 100 records
Insert 100 more records --- goes into 11 .. 20
Case 2.
Insert 100 records --- goes into block 1 .. 10
Delete 100 records
*Autovacuum triggers*
Insert 100 more records -- goes into block 1 .. 10
Thanks,
Pavan
--
Pavan Deolasee
EnterpriseDB http://www.enterprisedb.com
On Mon, Apr 21, 2008 at 10:54 PM, Pavan Deolasee
<pavan.deolasee@gmail.com> wrote:
Case 1.
Insert 100 records --- goes into block 1 .. 10
Delete 100 records
Insert 100 more records --- goes into 11 .. 20Case 2.
Insert 100 records --- goes into block 1 .. 10
Delete 100 records
*Autovacuum triggers*
Insert 100 more records -- goes into block 1 .. 10
Its probably not a very neat example because in this simplistic case,
the ordering would still be same, but we can easily construct a
slightly complex example to prove the point.
Thanks,
Pavan
--
Pavan Deolasee
EnterpriseDB http://www.enterprisedb.com
Peter Eisentraut <peter_e@gmx.net> writes:
Am Montag, 21. April 2008 schrieb Zdenek Kotala:
set work_mem = 64;
+ ERROR: 64 is outside the valid range for parameter "work_mem" (256 ..
2097151) -- Test bitmap-and.
This should probably be fixed by using a unit specification on work_mem. Do
you want to prepare a patch?
The problem is that guc.c enforces a lower limit of 8*BLCKSZ on the
work_mem setting. Unless we add an explicit unit specifier for "blocks"
to GUC's vocabulary, there doesn't seem to be any way to name that value
in the SET command. And it's not entirely clear that the SET would
still have the desired effect for this test, anyway, if it were getting
translated to 256K or more.
Another possible answer is to change the minimum to be just 64K always.
I'm not certain that it's really sensible to tie the minimum work_mem to
BLCKSZ --- I don't think we do anything where work_mem is controlling a
pool of page buffers, do we?
regards, tom lane
Am Montag, 21. April 2008 schrieb Tom Lane:
That sounds like a pretty bad idea, since it would treat ordering
differences as insignificant even when they aren't --- for example,
an ordering difference in the output of a query that *has* an
ORDER BY is usually a bug.
Well, we wouldn't treat ordering differences as OK, but we could print
foo ... FAILED (only ordering differences)
which might give a clue.
Then again, the effort to make this bulletproof might be more than continuing
to field the occasional question about the issue.
Peter Eisentraut napsal(a):
Am Montag, 21. April 2008 schrieb Tom Lane:
That sounds like a pretty bad idea, since it would treat ordering
differences as insignificant even when they aren't --- for example,
an ordering difference in the output of a query that *has* an
ORDER BY is usually a bug.Well, we wouldn't treat ordering differences as OK, but we could print
foo ... FAILED (only ordering differences)
which might give a clue.
When you are able detect ordering difference you are able also check if it is
important for the test or not without any extra effort. Only what we need is put
some flag to test that order is not important.
Then again, the effort to make this bulletproof might be more than continuing
to field the occasional question about the issue.
Regression test MUST BE bulletproof. If you get a error you must know that it is
really error (in postgresql or regtest) and must be fixed. When you start to
ignore some errors because it can happen sometimes you fall in the trap soon.
Zdenek
Andrew Dunstan napsal(a):
Peter Eisentraut wrote:
Am Montag, 21. April 2008 schrieb Martijn van Oosterhout:
I wonder if it would be feasable to, whenever a regression test fails
to sort both files and compare again. This should tell you if the
difference are *only* rearrangement automatically, without having to
eyeball the output.That sounds like it should be worth a try.
I think we need first to identify cases where we don't care that much
about output order. Teaching pg-regress the new check shouldn't be very
hard.
It seems to me only ORDER BY clauses must return sort order. Or are there any
other cases?
Zdenek
Tom Lane napsal(a):
Another possible answer is to change the minimum to be just 64K always.
I'm not certain that it's really sensible to tie the minimum work_mem to
BLCKSZ --- I don't think we do anything where work_mem is controlling a
pool of page buffers, do we?
Yeah, I try to find all usage and it seems everything is related to tuplestore,
Bitmap or Hash join. I think we can set 64K set limit without any problem.
By the way is any reason to have work_mem * 1024 "everywhere" when we have unit
support in GUC?
Zdenek
On Tue, Apr 22, 2008 at 10:31:53AM +0200, Zdenek Kotala wrote:
When you are able detect ordering difference you are able also check if it
is important for the test or not without any extra effort. Only what we
need is put some flag to test that order is not important.
Not true. Sorting the file is going jumble all the results together.
Since we perform many tests in one file, you're not going to be able to
seperate them.
Regression test MUST BE bulletproof. If you get a error you must know that
it is really error (in postgresql or regtest) and must be fixed. When you
start to ignore some errors because it can happen sometimes you fall in the
trap soon.
I think people are misunderstanding. You posted a bunch of diffs with
that comment that they *appeared* to only be ordering differences. How
good did you check? If an 8 become a 9 chances are you'd miss it.
Having a second test checking the sorted results would at least
preclude the chance that there really is something wrong.
It was a guide, not a way of getting out of tests.
Have a nice day,
--
Martijn van Oosterhout <kleptog@svana.org> http://svana.org/kleptog/
Show quoted text
Please line up in a tree and maintain the heap invariant while
boarding. Thank you for flying nlogn airlines.
Martijn van Oosterhout napsal(a):
On Tue, Apr 22, 2008 at 10:31:53AM +0200, Zdenek Kotala wrote:
When you are able detect ordering difference you are able also check if it
is important for the test or not without any extra effort. Only what we
need is put some flag to test that order is not important.Not true. Sorting the file is going jumble all the results together.
Since we perform many tests in one file, you're not going to be able to
seperate them.
Each statement result must be sort separately, otherwise it should hide
problems. For example one statement return A instead of B and second returns B
instead of A. When sort will be used on whole file then it will be reported as a
ordering problem.
Show quoted text
Regression test MUST BE bulletproof. If you get a error you must know that
it is really error (in postgresql or regtest) and must be fixed. When you
start to ignore some errors because it can happen sometimes you fall in the
trap soon.I think people are misunderstanding. You posted a bunch of diffs with
that comment that they *appeared* to only be ordering differences. How
good did you check? If an 8 become a 9 chances are you'd miss it.
Having a second test checking the sorted results would at least
preclude the chance that there really is something wrong.It was a guide, not a way of getting out of tests.
Have a nice day,
On Tue, Apr 22, 2008 at 4:25 PM, Martijn van Oosterhout <kleptog@svana.org>
wrote:
On Tue, Apr 22, 2008 at 10:31:53AM +0200, Zdenek Kotala wrote:
When you are able detect ordering difference you are able also check if
it
is important for the test or not without any extra effort. Only what we
need is put some flag to test that order is not important.Not true. Sorting the file is going jumble all the results together.
Since we perform many tests in one file, you're not going to be able to
seperate them.Regression test MUST BE bulletproof. If you get a error you must know
that
it is really error (in postgresql or regtest) and must be fixed. When
you
start to ignore some errors because it can happen sometimes you fall in
the
trap soon.
I think people are misunderstanding. You posted a bunch of diffs with
that comment that they *appeared* to only be ordering differences. How
good did you check? If an 8 become a 9 chances are you'd miss it.
Having a second test checking the sorted results would at least
preclude the chance that there really is something wrong.It was a guide, not a way of getting out of tests.
In the past, I had faced and tried to work on this exact problem... here's
what I had in mind:
in the .expected file, we would demarcate the section of lines we expect to
come in any order, by using two special markers. Then, when comparing the
actual output with expected output, we would take the demarcated group of
lines, and the corresponding lines from actual output, and compare them
after sorting.
For eg.
foo.expected:
select * from tenk where col1 <= 3 limit 3;
col1 | col2 | col3
-------------------------
?unsorted_result_start
1 | 10 | 100
2 | 20 | 200
3 | 30 | 300
?unsorted_result_end
foo.out:
select * from tenk where col1 <= 3 limit 3;
col1 | col2 | col3
-------------------------
3 | 30 | 300
2 | 20 | 200
1 | 10 | 100
So, the diff program should discard the lines beginning with '?' (meta
character), and then sort and match exactly the same number of lines.
There's another option of putting these '?' lines in a separate file
with corresponding begin/end line numbers of the unsorted group, and using
this as a parameter to the diffing program.
Of course, this needs a change in the (standard) diff that we use from
pg_regress!
Best regards,
--
gurjeet[.singh]@EnterpriseDB.com
singh.gurjeet@{ gmail | hotmail | indiatimes | yahoo }.com
EnterpriseDB http://www.enterprisedb.com
Mail sent from my BlackLaptop device
I was wandering if there where any intentions of moving some of the currently internal numeric function prototypes to the Numeric header file. I believe such functions like cmp_numerics, add_var, etc would come in handy when writing user defined C code.
Would there be many ill effects by doing this?
What should I look out for if I try it?
_________________________________________________________________
You dream job is up for grabs. Grab it.
http://mycareer.com.au/?s_cid=596065
Aaron Spiteri wrote:
------------------------------------------------------------------------
I was wandering if there where any intentions of moving some of
the currently internal numeric function prototypes to the Numeric
header file. I believe such functions like cmp_numerics, add_var,
etc would come in handy when writing user defined C code.Would there be many ill effects by doing this?
What should I look out for if I try it?
Umm ... you mean like this in builtins.h?
extern Datum numeric_cmp(PG_FUNCTION_ARGS);
cheers
andrew
"Aaron Spiteri" <azzmosphere@hotmail.com> writes:
I was wandering if there where any intentions of moving some of the
currently internal numeric function prototypes to the Numeric header file. I
believe such functions like cmp_numerics, add_var, etc would come in handy
when writing user defined C code.
All the var stuff is internal to the numeric data type operators. What kind of
user defined code are you writing?
--
Gregory Stark
EnterpriseDB http://www.enterprisedb.com
Ask me about EnterpriseDB's 24x7 Postgres support!
Zdenek Kotala <Zdenek.Kotala@Sun.COM> writes:
Regression test MUST BE bulletproof.
I'm sorry, but this is not, never has been, and never will be an
iron-clad project rule. When you get a failure you are supposed
to inspect it to see if it's a problem.
regards, tom lane
Zdenek Kotala <Zdenek.Kotala@Sun.COM> writes:
By the way is any reason to have work_mem * 1024 "everywhere" when we have unit
support in GUC?
Well, would you like to be able to set work_mem higher than 4GB on large
machines?
regards, tom lane
Tom Lane napsal(a):
Zdenek Kotala <Zdenek.Kotala@Sun.COM> writes:
Regression test MUST BE bulletproof.
I'm sorry, but this is not, never has been, and never will be an
iron-clad project rule. When you get a failure you are supposed
to inspect it to see if it's a problem.
Yes, but when you find it you should fix or report a problem in postgresql or
regression test. Keep it untouched is not good idea. It is what I meant. Try to
look on Apache Derby unit/regression tests.
Zdenek
Tom Lane napsal(a):
Zdenek Kotala <Zdenek.Kotala@Sun.COM> writes:
By the way is any reason to have work_mem * 1024 "everywhere" when we have unit
support in GUC?Well, would you like to be able to set work_mem higher than 4GB on large
machines?
I see, another int64 issues.
Thanks Zdenek
Zdenek Kotala <Zdenek.Kotala@Sun.COM> writes:
Tom Lane napsal(a):
Zdenek Kotala <Zdenek.Kotala@Sun.COM> writes:
Regression test MUST BE bulletproof.
I'm sorry, but this is not, never has been, and never will be an
iron-clad project rule. When you get a failure you are supposed
to inspect it to see if it's a problem.
Yes, but when you find it you should fix or report a problem in postgresql or
regression test. Keep it untouched is not good idea.
There are times when it's the most effective answer, though. I'm not
prepared to invest very large amounts of effort to fix corner-case
regression test problems, if those problems don't represent any actual
bug in the software. There are more productive ways to spend our time.
This is especially true if a proposed fix has negative consequences
beyond just the time to implement it --- reducing the scope of test
coverage or making the tests run longer would count as negative
consequences IMHO.
regards, tom lane
On Apr 21, 2008, at 7:25 AM, Peter Eisentraut wrote:
Am Montag, 21. April 2008 schrieb Zdenek Kotala:
I compiled postgreSQL with 1kB block size and regresion test
fails. Main
problem is that output is correct but in different order. See
attachment.This was previously reported:
http://archives.postgresql.org/pgsql-hackers/2006-11/msg00901.phpI think affected test should contain order by keyword.
For previously established reasons, we don't want to add ORDER BY
clauses to
every test that might fail under exceptional circumstances so we
test all
plan types equally. I think very small block sizes are fairly
exceptional,
unless you have a reason up your sleeve why they are a good idea.
What if we used the OFFSET 0 trick to force the ordering to occur
outside of what we're testing? IE:
SELECT * FROM (query we're testing OFFSET 0) ORDER BY blah;
--
Decibel!, aka Jim C. Nasby, Database Architect decibel@decibel.org
Give your computer some brain candy! www.distributed.net Team #1828
Attachments:
I wrote:
Another possible answer is to change the minimum to be just 64K always.
I'm not certain that it's really sensible to tie the minimum work_mem to
BLCKSZ --- I don't think we do anything where work_mem is controlling a
pool of page buffers, do we?
I've committed this change in HEAD. There's no desire to back-patch it
is there?
regards, tom lane
Tom Lane napsal(a):
I wrote:
Another possible answer is to change the minimum to be just 64K always.
I'm not certain that it's really sensible to tie the minimum work_mem to
BLCKSZ --- I don't think we do anything where work_mem is controlling a
pool of page buffers, do we?I've committed this change in HEAD. There's no desire to back-patch it
is there?
Back patch is not necessary. It is important for head, because
--with-blocksize=BLOCKSIZE allows to run buildfarm with different blocksize and
I think users will start to play more with this option.
Zdenek