leakproof

Started by Andrew Dunstanover 14 years ago35 messageshackers

andrew@dunslane.net

over 14 years ago

I missed all the fun while the "leakproof" addition to function
attributes was being decided, so I know I'm late to the party. Today I
had to go and look up what it actually meant. I have to say that I was a
bit surprised. I expected it to refer to memory management in some way.
I don't honestly think "leakproof" as a term is going to convey much to
lots of people. Can we come up with a more descriptive term?

cheers

andrew

Robert Haas

robertmhaas@gmail.com

over 14 years ago

In reply to: Andrew Dunstan (#1)

Re: leakproof

On Sun, Feb 19, 2012 at 5:29 PM, Andrew Dunstan <andrew@dunslane.net> wrote:

I missed all the fun while the "leakproof" addition to function attributes
was being decided, so I know I'm late to the party. Today I had to go and
look up what it actually meant. I have to say that I was a bit surprised. I
expected it to refer to memory management in some way. I don't honestly
think "leakproof" as a term is going to convey much to lots of people. Can
we come up with a more descriptive term?

We bikeshed on that topic a while back and nobody suggested anything
that got more than 1 or 2 votes. But I'm still happy to rename it if
we can come up with something better, because I'm not in love with it
either.

Having now spent far too much time in bed with that patch, I'm feeling
like the concept that we are really looking for there is what some
languages call "pure" - that is, there must be no side effects,
whether by throwing exceptions or otherwise.

--
Robert Haas
EnterpriseDB: http://www.enterprisedb.com
The Enterprise PostgreSQL Company

Tom Lane

tgl@sss.pgh.pa.us

over 14 years ago

In reply to: Robert Haas (#2)

Re: leakproof

Robert Haas <robertmhaas@gmail.com> writes:

On Sun, Feb 19, 2012 at 5:29 PM, Andrew Dunstan <andrew@dunslane.net> wrote:

Can we come up with a more descriptive term?

We bikeshed on that topic a while back and nobody suggested anything
that got more than 1 or 2 votes. But I'm still happy to rename it if
we can come up with something better, because I'm not in love with it
either.

Having now spent far too much time in bed with that patch, I'm feeling
like the concept that we are really looking for there is what some
languages call "pure" - that is, there must be no side effects,
whether by throwing exceptions or otherwise.

Hmm, "pure" doesn't sound bad to me. Nice and short.

regards, tom lane

Don Baccus

dhogaza@pacifier.com

over 14 years ago

In reply to: Tom Lane (#3)

Re: leakproof

On Feb 19, 2012, at 5:42 PM, Tom Lane wrote:

Robert Haas <robertmhaas@gmail.com> writes:

Having now spent far too much time in bed with that patch, I'm feeling
like the concept that we are really looking for there is what some
languages call "pure" - that is, there must be no side effects,
whether by throwing exceptions or otherwise.

Hmm, "pure" doesn't sound bad to me. Nice and short.

Technically, "pure" is stronger than "has no side effects":

http://en.wikipedia.org/wiki/Pure_function

Result can't depend on state (for instance, database contents), either. This is the typical definition used in functional programming.

gcc extends this to allow use of global variables in a "pure" function (the stricter definition is met by "const" functions). PG has "immutable", so a slightly weaker "pure" probably wouldn't be terribly confusing given the gcc precedent (probably across their family of compilers).

"D" adopts the stricter definition of "pure".

So there's some confusion around the term.

But …

I picked up this thread after "leakproof" was settled on and was curious as to what "leakproof" was supposed to be as I didn't read the earlier posts. I assumed it meant "doesn't leak memory", which seems admirable and typical and not needful of an attribute on the function declaration.

"pure" is definitely less confusing IMO, if it's congruent with the weaker sense of "pure" that's found in some languages/implementations.

----
Don Baccus
http://donb.photo.net
http://birdnotes.net
http://openacs.org

Bruce Momjian

bruce@momjian.us

over 14 years ago

In reply to: Don Baccus (#4)

Re: leakproof

I suspect this is wrong for similar reasons as "pure" but I'll throw
it out there: "hermetic"

--
greg

Tom Lane

tgl@sss.pgh.pa.us

over 14 years ago

In reply to: Don Baccus (#4)

Re: leakproof

Don Baccus <dhogaza@pacifier.com> writes:

On Feb 19, 2012, at 5:42 PM, Tom Lane wrote:

Hmm, "pure" doesn't sound bad to me. Nice and short.

Technically, "pure" is stronger than "has no side effects":
http://en.wikipedia.org/wiki/Pure_function
Result can't depend on state (for instance, database contents), either. This is the typical definition used in functional programming.

Well, that condition is subsumed in our idea of an immutable function.
It's not clear to me whether pure/leakproof functions are meant to be a
strict subset of immutable functions, but if they are then they meet
this stricter definition. On the other hand, if pure/leakproof functions
don't have to be immutable but only stable, then the stricter definition
corresponds to "pure immutable". That still doesn't sound too bad, as
long as we define our terms clearly in the docs.

regards, tom lane

Don Baccus

dhogaza@pacifier.com

over 14 years ago

In reply to: Tom Lane (#6)

Re: leakproof

On Feb 19, 2012, at 7:24 PM, Tom Lane wrote:

Don Baccus <dhogaza@pacifier.com> writes:

On Feb 19, 2012, at 5:42 PM, Tom Lane wrote:

Hmm, "pure" doesn't sound bad to me. Nice and short.

Technically, "pure" is stronger than "has no side effects":
http://en.wikipedia.org/wiki/Pure_function
Result can't depend on state (for instance, database contents), either. This is the typical definition used in functional programming.

Well, that condition is subsumed in our idea of an immutable function.

Yes, I said that myself, perhaps you didn't bother to read closely?

It's not clear to me whether pure/leakproof functions are meant to be a
strict subset of immutable functions

Superset, not subset, unless my guessing is wrong. How could "pure" be a subset of "immutable"?

OK, at this point, proponents will explain why ...

But if you're not clear as to what a "leakproof" function is meant to be. then I suggest the definition must be defined very clearly, so everyone understands what it is meant to be.

, but if they are then they meet
this stricter definition. On the other hand, if pure/leakproof functions
don't have to be immutable but only stable, then the stricter definition
corresponds to "pure immutable". That still doesn't sound too bad, as
long as we define our terms clearly in the docs.

Sure, let those making the proposal make things clear.

Just speaking as a gadfly who's not posted here for probably close on 10 years …

----
Don Baccus
http://donb.photo.net
http://birdnotes.net
http://openacs.org

Robert Haas

robertmhaas@gmail.com

over 14 years ago

In reply to: Tom Lane (#6)

Re: leakproof

On Sun, Feb 19, 2012 at 10:24 PM, Tom Lane <tgl@sss.pgh.pa.us> wrote:

Don Baccus <dhogaza@pacifier.com> writes:

On Feb 19, 2012, at 5:42 PM, Tom Lane wrote:

Hmm, "pure" doesn't sound bad to me. Nice and short.

Technically, "pure" is stronger than "has no side effects":
http://en.wikipedia.org/wiki/Pure_function
Result can't depend on state (for instance, database contents), either. This is the typical definition used in functional programming.

Well, that condition is subsumed in our idea of an immutable function.
It's not clear to me whether pure/leakproof functions are meant to be a
strict subset of immutable functions, but if they are then they meet
this stricter definition. On the other hand, if pure/leakproof functions
don't have to be immutable but only stable, then the stricter definition
corresponds to "pure immutable". That still doesn't sound too bad, as
long as we define our terms clearly in the docs.

For the present application (pushdown into security views), we really
only care whether the function has side effects, such as throwing an
error or mutating global state. So, in theory, even a volatile
function could be leakproof - it could read (but not write) some piece
of global, volatile state. In practice, I'm not sure those cases are
important at all. Right now, the only things marked as leakproof are
relational operators that might be indexable, precisely so that we
might be able to push an indexable qual down far enough to allow an
index scan, even in the presence of an intervening security view.
Maybe someone will want to push down a qual like x > now() or x >
clock_timestamp(), but I guess I can't get that excited about that.
There are so few leakproof functions that the chances of making
pushdown work safely for much of anything beyond col = const seem
remote. So, my tea leaves are telling me that if we want to make pure
a subset of immutable, that probably isn't going to cause a problem.
However, I am not a CTLR (certified tea leaf reader).

--
Robert Haas
EnterpriseDB: http://www.enterprisedb.com
The Enterprise PostgreSQL Company

Yeb Havinga

yebhavinga@gmail.com

over 14 years ago

In reply to: Don Baccus (#7)

Re: leakproof

On 2012-02-20 06:37, Don Baccus wrote:

On Feb 19, 2012, at 7:24 PM, Tom Lane wrote:

It's not clear to me whether pure/leakproof functions are meant to be a
strict subset of immutable functions

Superset, not subset, unless my guessing is wrong. How could "pure" be a subset of "immutable"?

If immutable functions are not necessarily leakproof/pure, and all
leakproof/pure functions are immutable.

If the latter is not the case, "pure" leads to confusion as well.

What about "discreet"?

-- Yeb

#10

Laurenz Albe

laurenz.albe@cybertec.at

over 14 years ago

In reply to: Bruce Momjian (#5)

Re: leakproof

Greg Stark wrote:

I suspect this is wrong for similar reasons as "pure" but I'll throw
it out there: "hermetic"

I personally have no problem with "leakproof", but if it does not tickle
the right associations with many people:

What about "secure"? It is also not self-explanatory, but it might give
people the idea that it's more a problem of restricting access to
information
than anything else.

On the downside, "security" has been over- and abused so much already.

Yours,
Laurenz Albe

#11

Peter Eisentraut

peter_e@gmx.net

over 14 years ago

In reply to: Robert Haas (#8)

Re: leakproof

On mån, 2012-02-20 at 01:17 -0500, Robert Haas wrote:

For the present application (pushdown into security views), we really
only care whether the function has side effects, such as throwing an
error or mutating global state.

How about [NO] SIDEEFFECTS?

#12

Tom Lane

tgl@sss.pgh.pa.us

over 14 years ago

In reply to: Peter Eisentraut (#11)

Re: leakproof

Peter Eisentraut <peter_e@gmx.net> writes:

On mån, 2012-02-20 at 01:17 -0500, Robert Haas wrote:

For the present application (pushdown into security views), we really
only care whether the function has side effects, such as throwing an
error or mutating global state.

How about [NO] SIDEEFFECTS?

Well, that's already stated to be one of the requirements for being
immutable or stable, so I think we need a term that's a bit stronger.

The real issue here is that the notion of what is a side effect is
much much broader than what we have used in the past. I don't think
we clarify that by continuing to use the same term "side effect".

regards, tom lane

#13

Merlin Moncure

mmoncure@gmail.com

over 14 years ago

In reply to: Don Baccus (#4)

Re: leakproof

On Sun, Feb 19, 2012 at 8:20 PM, Don Baccus <dhogaza@pacifier.com> wrote:

On Feb 19, 2012, at 5:42 PM, Tom Lane wrote:

Robert Haas <robertmhaas@gmail.com> writes:

Having now spent far too much time in bed with that patch, I'm feeling
like the concept that we are really looking for there is what some
languages call "pure" - that is, there must be no side effects,
whether by throwing exceptions or otherwise.

Hmm, "pure" doesn't sound bad to me. Nice and short.

Technically, "pure" is stronger than "has no side effects":

+1 pure.

merlin

#14

Jeff Janes

jeff.janes@gmail.com

over 14 years ago

In reply to: Don Baccus (#4)

Re: leakproof

On Sun, Feb 19, 2012 at 6:20 PM, Don Baccus <dhogaza@pacifier.com> wrote:

On Feb 19, 2012, at 5:42 PM, Tom Lane wrote:

Robert Haas <robertmhaas@gmail.com> writes:

Having now spent far too much time in bed with that patch, I'm feeling
like the concept that we are really looking for there is what some
languages call "pure" - that is, there must be no side effects,
whether by throwing exceptions or otherwise.

Hmm, "pure" doesn't sound bad to me. Nice and short.

Technically, "pure" is stronger than "has no side effects":

http://en.wikipedia.org/wiki/Pure_function

Result can't depend on state (for instance, database contents), either. This is the typical definition used in functional programming.

gcc extends this to allow use of global variables in a "pure" function (the stricter definition is met by "const" functions). PG has "immutable", so a slightly weaker "pure" probably wouldn't be terribly confusing given the gcc precedent (probably across their family of compilers).

"D" adopts the stricter definition of "pure".

So there's some confusion around the term.

But …

I picked up this thread after "leakproof" was settled on and was curious as to what "leakproof" was supposed to be as I didn't read the earlier posts. I assumed it meant "doesn't leak memory", which seems admirable and typical and not needful of an attribute on the function declaration.

"pure" is definitely less confusing IMO, if it's congruent with the weaker sense of "pure" that's found in some languages/implementations.

I don't think that "pure" is sufficient to be leakproof. For example,
if I have a function which is pure but which takes an unusually long
time to evaluate for some unique pathological combination of
arguments, I don't think that it would be considered leakproof.

Cheers,

Jeff

#15

Robert Haas

robertmhaas@gmail.com

over 14 years ago

In reply to: Jeff Janes (#14)

Re: leakproof

On Tue, Feb 21, 2012 at 11:54 AM, Jeff Janes <jeff.janes@gmail.com> wrote:

I don't think that "pure" is sufficient to be leakproof. For example,
if I have a function which is pure but which takes an unusually long
time to evaluate for some unique pathological combination of
arguments, I don't think that it would be considered leakproof.

I think we've made up our mind to largely ignore such cases, though.
There may be information that gets leaked either through the choice of
plan or the time it takes to execute it, but plugging that type of
hole completely is almost impossible. Suppose the security view
filters for rows where user = 'rhaas' and the evil rhaas requests rows
where id = 5. The planner chooses an index-scan on id = 5 and then
applies a filter of user = 'rhaas' to the result. There's probably
some slight timing difference between the case where no id = 5 row
exists, and the case where it exists but does not have user = 'rhaas',
so a sufficiently dedicated hacker might be able to employ a timing
attack to determine the existence of such a row. Also, by jiggering
with page costs and EXPLAIN, they might be able to get some idea of
the number of rows in the table.

I don't believe that this makes the feature useless, though. In most
cases, and especially when using surrogate keys (which I typically do,
whether Josh Berkus likes it or not), the existence of the row is not
a terribly interesting piece of information: it's the content of the
row that's sensitive. Neither table-level security nor column-level
security attempt to prevent you from determining that the table or
column exists; they just hide its contents. And if you need to bar
EXPLAIN so people can't peek at query plans, it's certainly possible
to do that using ProcessUtility_hook. If you're worried about more
subtle timing attacks, you should probably rethink the idea of letting
people log into the database in the first place.

Anyway, to your point, I suppose I might hesitate to mark factorial
leak-proof even if it didn't throw an error on overflow, because the
time it takes to return an answer for larger inputs does grow rather
rapidly. But it's kind of a moot point because the error makes it not
leak-proof anyway. So maybe we're just splitting hairs here, however
we decide to label this.

--
Robert Haas
EnterpriseDB: http://www.enterprisedb.com
The Enterprise PostgreSQL Company

#16

Tom Lane

tgl@sss.pgh.pa.us

over 14 years ago

In reply to: Robert Haas (#15)

Re: leakproof

Robert Haas <robertmhaas@gmail.com> writes:

Anyway, to your point, I suppose I might hesitate to mark factorial
leak-proof even if it didn't throw an error on overflow, because the
time it takes to return an answer for larger inputs does grow rather
rapidly. But it's kind of a moot point because the error makes it not
leak-proof anyway. So maybe we're just splitting hairs here, however
we decide to label this.

Speaking of hair-splitting ...

A strict interpretation of "no errors can be thrown" would, for example,
rule out any function that either takes or returns a varlena datatype,
since those are potentially going to throw out-of-memory errors if they
can't palloc a result value or a temporary detoasted input value.
I don't suppose that we want that, which means that this rule of thumb
is wrong in detail, and there had better be some more subtle definition
of what is okay or not, perhaps along the line of "must not throw any
errors that reveal anything useful about the input value". Have we got
such a definition? (I confess to not having followed this patch very
closely.)

regards, tom lane

#17

Andrew Dunstan

andrew@dunslane.net

over 14 years ago

In reply to: Tom Lane (#16)

Re: leakproof

On 02/22/2012 10:21 AM, Tom Lane wrote:

Robert Haas<robertmhaas@gmail.com> writes:

Anyway, to your point, I suppose I might hesitate to mark factorial
leak-proof even if it didn't throw an error on overflow, because the
time it takes to return an answer for larger inputs does grow rather
rapidly. But it's kind of a moot point because the error makes it not
leak-proof anyway. So maybe we're just splitting hairs here, however
we decide to label this.

Speaking of hair-splitting ...

A strict interpretation of "no errors can be thrown" would, for example,
rule out any function that either takes or returns a varlena datatype,
since those are potentially going to throw out-of-memory errors if they
can't palloc a result value or a temporary detoasted input value.
I don't suppose that we want that, which means that this rule of thumb
is wrong in detail, and there had better be some more subtle definition
of what is okay or not, perhaps along the line of "must not throw any
errors that reveal anything useful about the input value". Have we got
such a definition? (I confess to not having followed this patch very
closely.)

Yeah.

Returning to the original point, I've come to the conclusion that "pure"
isn't the right way to go. The trouble with "leakproof" is that it
doesn't point to what it is that's not leaking, which is information
rather than memory, as many might imagine (and I did) without further
hints. I'm not sure any single English word would be as descriptive as
I'd like.

cheers

andrew

#18

Kevin Grittner

Kevin.Grittner@wicourts.gov

over 14 years ago

In reply to: Andrew Dunstan (#17)

Re: leakproof

Andrew Dunstan <andrew@dunslane.net> wrote:

Returning to the original point, I've come to the conclusion that
"pure" isn't the right way to go. The trouble with "leakproof" is
that it doesn't point to what it is that's not leaking, which is
information rather than memory, as many might imagine (and I did)
without further hints. I'm not sure any single English word would
be as descriptive as I'd like.

Discreet?

http://www.merriam-webster.com/dictionary/discreet

I guess the risk is that people would confuse it with "discrete".

-Kevin

#19

Andrew Dunstan

andrew@dunslane.net

over 14 years ago

In reply to: Kevin Grittner (#18)

Re: leakproof

On 02/22/2012 11:14 AM, Kevin Grittner wrote:

Andrew Dunstan<andrew@dunslane.net> wrote:

Returning to the original point, I've come to the conclusion that
"pure" isn't the right way to go. The trouble with "leakproof" is
that it doesn't point to what it is that's not leaking, which is
information rather than memory, as many might imagine (and I did)
without further hints. I'm not sure any single English word would
be as descriptive as I'd like.

Discreet?

http://www.merriam-webster.com/dictionary/discreet

I guess the risk is that people would confuse it with "discrete".

Yes, too confusing.

"silent" might be better along those lines.

cheers

andrew

#20

Marc Munro

marc@bloodnok.com

over 14 years ago

In reply to: Andrew Dunstan (#19)

Re: leakproof

On Wed, 2012-02-22 at 12:44 -0400, Andrew Dunstan wrote:

Returning to the original point, I've come to the conclusion that
"pure"
isn't the right way to go. The trouble with "leakproof" is that it
doesn't point to what it is that's not leaking, which is information
rather than memory, as many might imagine (and I did) without further
hints. I'm not sure any single English word would be as descriptive as
I'd like.

As the developer of veil I feel marginally qualified to bikeshed here:
how about "silent"? A silent function being one that will not blab.

There are also quite a few synonyms in the thesaurus for trustworthy. I
kind of like "honorable" or "righteous" myself.

__
Marc

Import Notes

Reply to msg id not found: 20120222164542.2A4B412DA9@bloodnok.comReference msg id not found: 20120222164542.2A4B412DA9@bloodnok.com | Resolved by subject fallback

#21

Andrew Dunstan

andrew@dunslane.net

over 14 years ago

In reply to: Marc Munro (#20)

#22

Robert Haas

robertmhaas@gmail.com

over 14 years ago

In reply to: Tom Lane (#16)

#23

Tom Lane