Improper use about DatumGetInt32

ashutosh.bapat@enterprisedb.com

over 5 years ago

In reply to: Tom Lane (#7)

Re: Improper use about DatumGetInt32

On Wed, Sep 23, 2020 at 1:41 AM Tom Lane <tgl@sss.pgh.pa.us> wrote:

Robert Haas <robertmhaas@gmail.com> writes:

On Mon, Sep 21, 2020 at 3:53 PM Andres Freund <andres@anarazel.de> wrote:

I think we mostly use it for the few places where we currently expose
data as a signed integer on the SQL level, but internally actually treat
it as a unsigned data.

So why is the right solution to that not DatumGetInt32() + a cast to uint32?

You're ignoring the xid use-case, for which DatumGetUInt32 actually is
the right thing.

There is DatumGetTransactionId() which should be used instead.
That made me search if there's PG_GETARG_TRANSACTIONID() and yes it's
there but only defined in xid.c. So pg_xact_commit_timestamp(),
pg_xact_commit_timestamp_origin() and pg_get_multixact_members() use
PG_GETARG_UNIT32. IMO those should be changed to use
PG_GETARG_TRANSACTIONID. That would require moving
PG_GETARG_TRANSACTIONID somewhere outside xid.c; may be fmgr.h where
other PG_GETARG_* are.

I tend to agree though that if the SQL argument is
of a signed type, the least API-abusing answer is a signed DatumGetXXX
macro followed by whatever cast you need.

I looked for some uses of PG_GETARG_UNIT32() which is the counterpart
of DatumGetUint32(). Found some buggy usages apart from the ones which
can be converted to PG_GETARG_TRANSACTIONID listed above.
normal_rand() for example returns a huge number of rows and takes
forever if we pass a negative first argument to it. Someone could
misuse that for a DOS attack or it could be just an accident that they
pass a negative value to that function and the query takes forever.
explain analyze select count(*) from normal_rand(-1000000, 1.0, 1.0);
QUERY
PLAN
-----------------------------------------------------------------------------------------------------------------------------------------
Aggregate (cost=12.50..12.51 rows=1 width=8) (actual
time=2077574.718..2077574.719 rows=1 loops=1)
-> Function Scan on normal_rand (cost=0.00..10.00 rows=1000
width=0) (actual time=1005176.149..1729994.366 rows=4293967296
loops=1)
Planning Time: 0.346 ms
Execution Time: 2079034.835 ms

get_raw_page() also does similar thing but the effect is not as dangerous
SELECT octet_length(get_raw_page('test1', 'main', -1)) AS main_1;
ERROR: block number 4294967295 is out of range for relation "test1"
Similarly for bt_page_stats() and bt_page_items()

PFA patches to correct those.

There's Oracle compatible chr() which also uses PG_GETARG_UINT32() but
it's (accidentally?) reporting the negative inputs correctly because
it filters out very large values and reports those using %d. It's
arguable whether we should change that, so I have left it untouched.
But I think we should change that as well and get rid of
PG_GETARG_UNIT32 altogether. This will prevent any future misuse.

--
Best Wishes,
Ashutosh Bapat

alvherre@2ndquadrant.com

over 5 years ago

In reply to: Ashutosh Bapat (#8)

Re: Improper use about DatumGetInt32

On 2020-Sep-23, Ashutosh Bapat wrote:

You're ignoring the xid use-case, for which DatumGetUInt32 actually is
the right thing.

There is DatumGetTransactionId() which should be used instead.
That made me search if there's PG_GETARG_TRANSACTIONID() and yes it's
there but only defined in xid.c. So pg_xact_commit_timestamp(),
pg_xact_commit_timestamp_origin() and pg_get_multixact_members() use
PG_GETARG_UNIT32. IMO those should be changed to use
PG_GETARG_TRANSACTIONID. That would require moving
PG_GETARG_TRANSACTIONID somewhere outside xid.c; may be fmgr.h where
other PG_GETARG_* are.

Hmm, yeah, I think this would be a good idea.

get_raw_page() also does similar thing but the effect is not as dangerous
SELECT octet_length(get_raw_page('test1', 'main', -1)) AS main_1;
ERROR: block number 4294967295 is out of range for relation "test1"
Similarly for bt_page_stats() and bt_page_items()

Hmm, but page numbers above signed INT_MAX are valid. So this would
prevent reading all legitimate pages past that.

#10

ashutosh.bapat@enterprisedb.com

over 5 years ago

In reply to: Alvaro Herrera (#9)

Re: Improper use about DatumGetInt32

On Fri, 16 Oct 2020 at 19:26, Alvaro Herrera <alvherre@alvh.no-ip.org>
wrote:

On 2020-Sep-23, Ashutosh Bapat wrote:

You're ignoring the xid use-case, for which DatumGetUInt32 actually is
the right thing.

There is DatumGetTransactionId() which should be used instead.
That made me search if there's PG_GETARG_TRANSACTIONID() and yes it's
there but only defined in xid.c. So pg_xact_commit_timestamp(),
pg_xact_commit_timestamp_origin() and pg_get_multixact_members() use
PG_GETARG_UNIT32. IMO those should be changed to use
PG_GETARG_TRANSACTIONID. That would require moving
PG_GETARG_TRANSACTIONID somewhere outside xid.c; may be fmgr.h where
other PG_GETARG_* are.

Hmm, yeah, I think this would be a good idea.

The patch 0003 does that.

get_raw_page() also does similar thing but the effect is not as dangerous
SELECT octet_length(get_raw_page('test1', 'main', -1)) AS main_1;
ERROR: block number 4294967295 is out of range for relation "test1"
Similarly for bt_page_stats() and bt_page_items()

Hmm, but page numbers above signed INT_MAX are valid. So this would
prevent reading all legitimate pages past that.

According to https://www.postgresql.org/docs/12/datatype-numeric.html,
these functions shouldn't be accepting values higher than INT_MAX since
it's outside the integer data type range. But may be it's a convenient way
to avoid using bigint. Anyway those changes are separate in 0002 patch
which can be discarded as a whole. But for now I am keeping it in the bunch.

--
Best Wishes,
Ashutosh

#11

peter_e@gmx.net

over 5 years ago

In reply to: Ashutosh Bapat (#10)

Re: Improper use about DatumGetInt32

I have committed 0003.

For 0001, normal_rand(), I think you should reject negative arguments
with an error.

For 0002, I think you should change the block number arguments to int8,
same as other contrib modules do.

#12

Anastasia Lubennikova

a.lubennikova@postgrespro.ru

over 5 years ago

In reply to: Peter Eisentraut (#11)

Re: Improper use about DatumGetInt32

On 02.11.2020 18:59, Peter Eisentraut wrote:

I have committed 0003.

For 0001, normal_rand(), I think you should reject negative arguments
with an error.

I've updated 0001. The change is trivial, see attached.

For 0002, I think you should change the block number arguments to
int8, same as other contrib modules do.

Agree. It will need a bit more work, though. Probably a new version of
pageinspect contrib, as the public API will change.
Ashutosh, are you going to continue working on it?

--
Anastasia Lubennikova
Postgres Professional: http://www.postgrespro.com
The Russian Postgres Company

#13

peter_e@gmx.net

over 5 years ago

In reply to: Peter Eisentraut (#11)

Re: Improper use about DatumGetInt32

On 2020-11-02 16:59, Peter Eisentraut wrote:

I have committed 0003.

For 0001, normal_rand(), I think you should reject negative arguments
with an error.

I have committed a fix for that.

For 0002, I think you should change the block number arguments to int8,
same as other contrib modules do.

Looking further into this, almost all of pageinspect needs to be updated
to handle block numbers larger than INT_MAX correctly. Attached is a
patch for this. It is meant to work like other contrib modules, such as
pg_freespace and pg_visibility. I haven't tested this much yet.

#14

alvherre@2ndquadrant.com

over 5 years ago

In reply to: Peter Eisentraut (#13)

Re: Improper use about DatumGetInt32

On 2020-Nov-25, Peter Eisentraut wrote:

bt_page_stats(PG_FUNCTION_ARGS)
{
text *relname = PG_GETARG_TEXT_PP(0);
- uint32 blkno = PG_GETARG_UINT32(1);
+ int64 blkno = PG_GETARG_INT64(1);

As a matter of style, I think it'd be better to have an int64 variable
that gets the value from PG_GETARG_INT64(), then you cast that to
another variable that's a BlockNumber and use that throughout the rest
of the code. So you'd avoid changes like this:

static bytea *get_raw_page_internal(text *relname, ForkNumber forknum,
-									BlockNumber blkno);
+									int64 blkno);

where the previous coding was correct, and the new one is dubious and it
forces you to add unnecessary range checks in that function:

Show quoted text

@@ -144,11 +144,16 @@ get_raw_page_internal(text *relname, ForkNumber forknum, BlockNumber blkno)
(errcode(ERRCODE_FEATURE_NOT_SUPPORTED),
errmsg("cannot access temporary tables of other sessions")));
+	if (blkno < 0 || blkno > MaxBlockNumber)
+		ereport(ERROR,
+				(errcode(ERRCODE_INVALID_PARAMETER_VALUE),
+				 errmsg("invalid block number")));
+

#15

peter_e@gmx.net

over 5 years ago

In reply to: Alvaro Herrera (#14)

Re: Improper use about DatumGetInt32

On 2020-11-25 20:04, Alvaro Herrera wrote:

On 2020-Nov-25, Peter Eisentraut wrote:

bt_page_stats(PG_FUNCTION_ARGS)
{
text *relname = PG_GETARG_TEXT_PP(0);
- uint32 blkno = PG_GETARG_UINT32(1);
+ int64 blkno = PG_GETARG_INT64(1);

As a matter of style, I think it'd be better to have an int64 variable
that gets the value from PG_GETARG_INT64(), then you cast that to
another variable that's a BlockNumber and use that throughout the rest
of the code. So you'd avoid changes like this:
static bytea *get_raw_page_internal(text *relname, ForkNumber forknum,
-									BlockNumber blkno);
+									int64 blkno);
where the previous coding was correct, and the new one is dubious and it
forces you to add unnecessary range checks in that function:
@@ -144,11 +144,16 @@ get_raw_page_internal(text *relname, ForkNumber forknum, BlockNumber blkno)
(errcode(ERRCODE_FEATURE_NOT_SUPPORTED),
errmsg("cannot access temporary tables of other sessions")));
+	if (blkno < 0 || blkno > MaxBlockNumber)
+		ereport(ERROR,
+				(errcode(ERRCODE_INVALID_PARAMETER_VALUE),
+				 errmsg("invalid block number")));
+

The point of the patch is to have the range check somewhere. If you
just cast it, then you won't notice out of range arguments. Note that
other contrib modules that take block numbers work the same way.

#16

alvherre@2ndquadrant.com

over 5 years ago

In reply to: Peter Eisentraut (#15)

Re: Improper use about DatumGetInt32

On 2020-Nov-26, Peter Eisentraut wrote:

The point of the patch is to have the range check somewhere. If you just
cast it, then you won't notice out of range arguments. Note that other
contrib modules that take block numbers work the same way.

I'm not saying not to do that; just saying we should not propagate it to
places that don't need it. get_raw_page gets its page number from
PG_GETARG_INT64(), and the range check should be there. But then it
calls get_raw_page_internal, and it could pass a BlockNumber -- there's
no need to pass an int64. So get_raw_page_internal does not need a
range check.

#17

peter_e@gmx.net

over 5 years ago

In reply to: Alvaro Herrera (#16)

Re: Improper use about DatumGetInt32

On 2020-11-26 14:27, Alvaro Herrera wrote:

On 2020-Nov-26, Peter Eisentraut wrote:

The point of the patch is to have the range check somewhere. If you just
cast it, then you won't notice out of range arguments. Note that other
contrib modules that take block numbers work the same way.

I'm not saying not to do that; just saying we should not propagate it to
places that don't need it. get_raw_page gets its page number from
PG_GETARG_INT64(), and the range check should be there. But then it
calls get_raw_page_internal, and it could pass a BlockNumber -- there's
no need to pass an int64. So get_raw_page_internal does not need a
range check.

Yeah, I had it like that for a moment, but then you need to duplicate
the check in get_raw_page() and get_raw_page_fork(). I figured since
get_raw_page_internal() does all the other argument checking also, it
seems sensible to put the block range check there too. But it's not a
big deal either way.

#18

ashutosh.bapat@enterprisedb.com

over 5 years ago

In reply to: Anastasia Lubennikova (#12)

Re: Improper use about DatumGetInt32

On Wed, Nov 25, 2020 at 8:13 PM Anastasia Lubennikova <
a.lubennikova@postgrespro.ru> wrote:

On 02.11.2020 18:59, Peter Eisentraut wrote:

I have committed 0003.

For 0001, normal_rand(), I think you should reject negative arguments
with an error.

I've updated 0001. The change is trivial, see attached.

For 0002, I think you should change the block number arguments to
int8, same as other contrib modules do.

Agree. It will need a bit more work, though. Probably a new version of
pageinspect contrib, as the public API will change.
Ashutosh, are you going to continue working on it?

Sorry I was away on Diwali vacation so couldn't address Peter's comments in
time. Thanks for taking this further. I will review Peter's patch.

--
Best Wishes,
Ashutosh

#19

ashutosh.bapat@enterprisedb.com

over 5 years ago

In reply to: Peter Eisentraut (#17)

Re: Improper use about DatumGetInt32

On Thu, Nov 26, 2020 at 9:57 PM Peter Eisentraut <
peter.eisentraut@enterprisedb.com> wrote:

On 2020-11-26 14:27, Alvaro Herrera wrote:

On 2020-Nov-26, Peter Eisentraut wrote:

The point of the patch is to have the range check somewhere. If you

just

cast it, then you won't notice out of range arguments. Note that other
contrib modules that take block numbers work the same way.

I'm not saying not to do that; just saying we should not propagate it to
places that don't need it. get_raw_page gets its page number from
PG_GETARG_INT64(), and the range check should be there. But then it
calls get_raw_page_internal, and it could pass a BlockNumber -- there's
no need to pass an int64. So get_raw_page_internal does not need a
range check.

Yeah, I had it like that for a moment, but then you need to duplicate
the check in get_raw_page() and get_raw_page_fork(). I figured since
get_raw_page_internal() does all the other argument checking also, it
seems sensible to put the block range check there too. But it's not a
big deal either way.

FWIW, my 2c. Though I agree with both sides, I
prefer get_raw_page_internal() accepting BlockNumber, since that's what it
deals with and not the entire int8.

--
Best Wishes,
Ashutosh

#20

peter_e@gmx.net

over 5 years ago

In reply to: Ashutosh Bapat (#19)

Re: Improper use about DatumGetInt32

On 2020-11-27 13:37, Ashutosh Bapat wrote:

Yeah, I had it like that for a moment, but then you need to duplicate
the check in get_raw_page() and get_raw_page_fork(). I figured since
get_raw_page_internal() does all the other argument checking also, it
seems sensible to put the block range check there too. But it's not a
big deal either way.

FWIW, my 2c. Though I agree with both sides, I
prefer get_raw_page_internal() accepting BlockNumber, since that's what
it deals with and not the entire int8.

Patch updated this way. I agree it's better that way.

#21

alvherre@2ndquadrant.com

over 5 years ago

In reply to: Peter Eisentraut (#20)

#22