Is float8 a reference type?

Started by Paul Jungwirthover 8 years ago9 messagesgeneral

pj@illuminatedcomputing.com

over 8 years ago

The docs say that a Datum can be 4 bytes or 8 depending on the machine:

https://www.postgresql.org/docs/9.5/static/sql-createtype.html

Is a Datum always 8 bytes for 64-bit architectures?

And if so, can my C extension skip a loop like this when compiling
there, and just do a memcpy (or even a cast)?:

float8 *floats;
Datum *datums;

datums = palloc(arrlen * sizeof(Datum));
for (i = 0; i < arrlen; i++) {
datums[i] = Float8GetDatum(floats[i]);
}

Thanks!
Paul

--
Sent via pgsql-general mailing list (pgsql-general@postgresql.org)
To make changes to your subscription:
http://www.postgresql.org/mailpref/pgsql-general

pavel.stehule@gmail.com

over 8 years ago

In reply to: Paul Jungwirth (#1)

Re: Is float8 a reference type?

2017-09-23 4:52 GMT+02:00 Paul A Jungwirth <pj@illuminatedcomputing.com>:

The docs say that a Datum can be 4 bytes or 8 depending on the machine:

https://www.postgresql.org/docs/9.5/static/sql-createtype.html

Is a Datum always 8 bytes for 64-bit architectures?

And if so, can my C extension skip a loop like this when compiling
there, and just do a memcpy (or even a cast)?:

yes, it is 8 bytes on 64-bit.

I don't think so it is good idea to write 64bit only extensions.

Show quoted text

float8 *floats;
Datum *datums;

datums = palloc(arrlen * sizeof(Datum));
for (i = 0; i < arrlen; i++) {
datums[i] = Float8GetDatum(floats[i]);
}

Thanks!
Paul

--
Sent via pgsql-general mailing list (pgsql-general@postgresql.org)
To make changes to your subscription:
http://www.postgresql.org/mailpref/pgsql-general

pj@illuminatedcomputing.com

over 8 years ago

In reply to: Paul Jungwirth (#1)

Re: Is float8 a reference type?

On Fri, Sep 22, 2017 at 7:52 PM, Paul A Jungwirth
<pj@illuminatedcomputing.com> wrote:

Is a Datum always 8 bytes for 64-bit architectures?

Never mind, I found this in `pg_config.h`:

/* float8, int8, and related values are passed by value if 'true', by
reference if 'false' */
#define FLOAT8PASSBYVAL true

Paul

--
Sent via pgsql-general mailing list (pgsql-general@postgresql.org)
To make changes to your subscription:
http://www.postgresql.org/mailpref/pgsql-general

pj@illuminatedcomputing.com

over 8 years ago

In reply to: Pavel Stehule (#2)

Re: Is float8 a reference type?

On Fri, Sep 22, 2017 at 8:05 PM, Pavel Stehule <pavel.stehule@gmail.com> wrote:

yes, it is 8 bytes on 64-bit.

Thanks!

I don't think so it is good idea to write 64bit only extensions.

I agree, but how about this?:

if (FLOAT8PASSBYVAL) {
datums = (Datum *)floats;
} else {
datums = palloc0(arrlen * sizeof(Datum));
for (i = 0; i < arrlen; i++) {
datums[i] = Float8GetDatum(floats[i]);
}
}

Thanks,
Paul

--
Sent via pgsql-general mailing list (pgsql-general@postgresql.org)
To make changes to your subscription:
http://www.postgresql.org/mailpref/pgsql-general

pavel.stehule@gmail.com

over 8 years ago

In reply to: Paul Jungwirth (#4)

Re: Is float8 a reference type?

2017-09-23 5:10 GMT+02:00 Paul A Jungwirth <pj@illuminatedcomputing.com>:

On Fri, Sep 22, 2017 at 8:05 PM, Pavel Stehule <pavel.stehule@gmail.com>
wrote:

yes, it is 8 bytes on 64-bit.

Thanks!

I don't think so it is good idea to write 64bit only extensions.

I agree, but how about this?:

if (FLOAT8PASSBYVAL) {
datums = (Datum *)floats;
} else {
datums = palloc0(arrlen * sizeof(Datum));
for (i = 0; i < arrlen; i++) {
datums[i] = Float8GetDatum(floats[i]);
}
}

it can work.

You have to solve deallocation in only one path. palloc0 is not necessary
in this case.

Show quoted text

Thanks,
Paul

tgl@sss.pgh.pa.us

over 8 years ago

In reply to: Paul Jungwirth (#4)

Re: Is float8 a reference type?

Paul A Jungwirth <pj@illuminatedcomputing.com> writes:

On Fri, Sep 22, 2017 at 8:05 PM, Pavel Stehule <pavel.stehule@gmail.com> wrote:

I don't think so it is good idea to write 64bit only extensions.

I agree, but how about this?:

"Premature optimization is the root of all evil". Do you have good reason
to think that it's worth your time to write unsafe/unportable code? Do
you know that your compiler doesn't turn Float8GetDatum into a no-op
already? (Mine does, on a 64-bit machine.)

regards, tom lane

--
Sent via pgsql-general mailing list (pgsql-general@postgresql.org)
To make changes to your subscription:
http://www.postgresql.org/mailpref/pgsql-general

pj@illuminatedcomputing.com

over 8 years ago

In reply to: Tom Lane (#6)

Re: Is float8 a reference type?

On Fri, Sep 22, 2017 at 8:38 PM, Tom Lane <tgl@sss.pgh.pa.us> wrote:

"Premature optimization is the root of all evil". Do you have good reason
to think that it's worth your time to write unsafe/unportable code? Do
you know that your compiler doesn't turn Float8GetDatum into a no-op
already? (Mine does, on a 64-bit machine.)

Ha ha, thank you for keeping me honest! But can you explain what is
unsafe about the cast? For a little more context: I've loaded a float8
array from a file, but I need to pass a Datum array to
construct_md_array. With an 8-byte Datum, I can just pass the original
float array, right? But with smaller Datums I need to go through the
array and convert each element. (I'm not really worried about these
files being moved between machines, so I'm willing to make the on-disk
format the same as the in-memory format.)

Since I'm expecting ~10 million elements per array, it seems like
skipping the conversion will have a real effect. I checked the
assembly and do see a difference (on both Mac+clang and Linux+gcc).
Here is the Mac command line:

platter:floatfile paul$ clang -Wall -Wmissing-prototypes
-Wpointer-arith -Wdeclaration-after-statement -Wendif-labels
-Wmissing-format-attribute -Wformat-security -fno-strict-aliasing
-fwrapv -Wno-unused-command-line-argument -O2 -I. -I./
-I/usr/local/Cellar/postgresql@9.6/9.6.3/include/server
-I/usr/local/Cellar/postgresql@9.6/9.6.3/include/internal
-I/usr/local/opt/gettext/include -I/usr/local/opt/openldap/include
-I/usr/local/opt/openssl/include -I/usr/local/opt/readline/include
-I/usr/local/opt/tcl-tk/include -g -S -o floatfile.s floatfile.c

Here is the assembly for the cast:

.loc 2 391 23 is_stmt 1 ## floatfile.c:391:23
movq -48(%rbp), %r15
Ltmp176:
##DEBUG_VALUE: load_floatfile:datums <- %R15

Here is the assembly for the loop (after just changing the code to `if
(FLOAT8PASSBYVAL && false)`):

.loc 2 393 21 is_stmt 1 ## floatfile.c:393:21
movslq %r15d, %r13
.loc 2 393 28 is_stmt 0 ## floatfile.c:393:28
leaq (,%r13,8), %rdi
.loc 2 393 14 ## floatfile.c:393:14
callq _palloc
movq %rax, %r12
Ltmp177:
##DEBUG_VALUE: load_floatfile:i <- 0
.loc 2 394 19 is_stmt 1 discriminator 1 ## floatfile.c:394:19
testl %r15d, %r15d
Ltmp178:
.loc 2 394 5 is_stmt 0 discriminator 1 ## floatfile.c:394:5
je LBB7_11
Ltmp179:
## BB#9:
##DEBUG_VALUE: load_floatfile:arrlen <- %R15D
##DEBUG_VALUE: load_floatfile:nulls <- [%RBP+-80]
##DEBUG_VALUE: load_floatfile:floats <- [%RBP+-72]
##DEBUG_VALUE: load_floatfile:filename <- %RBX
.loc 2 0 5 discriminator 1 ## floatfile.c:0:5
movq -72(%rbp), %rbx
Ltmp180:
##DEBUG_VALUE: load_floatfile:floats <- %RBX
xorl %r14d, %r14d
Ltmp181:
.p2align 4, 0x90
LBB7_10: ## =>This Inner Loop Header: Depth=1
##DEBUG_VALUE: load_floatfile:floats <- %RBX
##DEBUG_VALUE: load_floatfile:arrlen <- %R15D
##DEBUG_VALUE: load_floatfile:nulls <- [%RBP+-80]
.loc 2 395 34 is_stmt 1 ## floatfile.c:395:34
movsd (%rbx,%r14,8), %xmm0 ## xmm0 = mem[0],zero
.loc 2 395 19 is_stmt 0 ## floatfile.c:395:19
callq _Float8GetDatum
.loc 2 395 17 ## floatfile.c:395:17
movq %rax, (%r12,%r14,8)
Ltmp182:
.loc 2 394 30 is_stmt 1 discriminator 2 ## floatfile.c:394:30
incq %r14
.loc 2 394 19 is_stmt 0 discriminator 1 ## floatfile.c:394:19
cmpq %r13, %r14
Ltmp183:
.loc 2 394 5 discriminator 1 ## floatfile.c:394:5
jl LBB7_10
Ltmp184:
LBB7_11:
##DEBUG_VALUE: load_floatfile:arrlen <- %R15D
##DEBUG_VALUE: load_floatfile:nulls <- [%RBP+-80]

I get the same results on gcc too: the palloc, the loop, and even
`call Float8GetDatum@PLT`.

I'll do some timing of each version too, but it doesn't look like a
pointless optimization. I'd still like to know what is unsafe about it
though.

Thanks!
Paul

--
Sent via pgsql-general mailing list (pgsql-general@postgresql.org)
To make changes to your subscription:
http://www.postgresql.org/mailpref/pgsql-general

tgl@sss.pgh.pa.us

over 8 years ago

In reply to: Paul Jungwirth (#7)

Re: Is float8 a reference type?

Paul A Jungwirth <pj@illuminatedcomputing.com> writes:

Since I'm expecting ~10 million elements per array, it seems like
skipping the conversion will have a real effect. I checked the
assembly and do see a difference (on both Mac+clang and Linux+gcc).

I wonder whether you're using up-to-date Postgres headers (ones
where Float8GetDatum is a static inline function). For me, both
of those platforms recognize it as a no-op --- in fact, clang
turns a loop like

for (i = 0; i < n; i++) {
datums[i] = Float8GetDatum(floats[i]);
}

into something that looks suspiciously like an inlined, loop-unrolled
memcpy().

regards, tom lane

--
Sent via pgsql-general mailing list (pgsql-general@postgresql.org)
To make changes to your subscription:
http://www.postgresql.org/mailpref/pgsql-general

pj@illuminatedcomputing.com

over 8 years ago

In reply to: Tom Lane (#8)

Re: Is float8 a reference type?

On Sat, Sep 23, 2017 at 9:40 AM, Tom Lane <tgl@sss.pgh.pa.us> wrote:

I wonder whether you're using up-to-date Postgres headers (ones
where Float8GetDatum is a static inline function).

I'm building against 9.6.3 on both machines. I'm not doing anything
special to change the compilation options. Here is my whole Makefile:

MODULES = floatfile
EXTENSION = floatfile
EXTENSION_VERSION = 1.0.0
DATA = floatfile--$(EXTENSION_VERSION).sql

PG_CONFIG = pg_config
PGXS := $(shell $(PG_CONFIG) --pgxs)
include $(PGXS)

But what I'm really interested in is this: What are the bad things
that can happen if I do `datums = (Datum *)floats`, as long as it's
only when Datums are 8 bytes wide? Is there a platform with
pass-by-val float8s where that won't work?

Thanks,
Paul

--
Sent via pgsql-general mailing list (pgsql-general@postgresql.org)
To make changes to your subscription:
http://www.postgresql.org/mailpref/pgsql-general