Remove redundant DISTINCT when GROUP BY already guarantees uniqueness

Started by Ilia Evdokimov16 days ago7 messageshackers

ilya.evdokimov@tantorlabs.com

16 days ago

Hi hackers,

When a query contains both DISTINCT and GROUP BY, planner currently
always creates a separate plan node to eliminate duplicate output rows.
However, if GROUP BY is present, each output row already corresponds to
exactly one group - meaning the rows produced by GROUP BY are already
unique on their group keys. Any DISTINCT clause that covers all those
group keys therefore adds no filtering and the de-duplication step is
pure overhead.

This patch teaches the planner to detect this situation and skip
creating the DISTINCT paths entirely. The optimization applies when
three conditions hold:
- the query uses a plan GROUP BY (not GROUPING SETS, which can produce
extra NULL-filled rows across sets),
- the clause is plain DISTINCT (not DISTINCT ON, which has different
semantics),
- and every GROUP BY key appears in the DISTINCT clause.

The last condition is necessary because if a GROUP BY key is absent from
the DISTINCT list, two different groups could still produce identical
output tuple (e.g. `SELECT DISTINCT 1 FROM t GROUP BY a`)

Example:

```
CREATE TABLE t (a INT, b INT, c INT);
INSERT INTO t VALUES (1,1,10),(1,2,20),(2,1,30),(2,2,40),(1,1,50);
ANALYZE t;

EXPLAIN (COSTS OFF) SELECT DISTINCT a, b, sum(c) FROM t GROUP BY a, b;
```

Before patch:
```
QUERY PLAN
----------------------------------
Unique
-> Sort
Sort Key: a, b, (sum(c))
-> HashAggregate
Group Key: a, b
-> Seq Scan on t
(6 rows)
```

After patch
```
QUERY PLAN
---------------------
HashAggregate
Group Key: a, b
-> Seq Scan on t
(3 rows)
```

Any feedback or suggestions for additional cases are welcome.

--
Best regards.
Ilia Evdokimov,
Tantor Labs LLC,
https://tantorlabs.com/

Zsolt Parragi

zsolt.parragi@percona.com

16 days ago

In reply to: Ilia Evdokimov (#1)

Re: Remove redundant DISTINCT when GROUP BY already guarantees uniqueness

Hello!

+static bool
+distinct_redundant_by_groupby(Query *parse)

I think this also should check hasTargetSRFs:

CREATE TABLE t (a int, b int, c int);
INSERT INTO t VALUES (1,1,10),(1,2,20),(2,1,30),(2,2,40),(1,1,50);
SELECT DISTINCT a, unnest(ARRAY[1,1]) AS u FROM t GROUP BY a ORDER BY a, u;

Also, query_is_distinct_for already does something similar (and already handles hasTargetSRFs) - would it be possible to extract the common part instead of duplicating the logic?

Andres Taylor

andres@taylor.se

13 days ago

In reply to: Zsolt Parragi (#2)

Re: Remove redundant DISTINCT when GROUP BY already guarantees uniqueness

Hi Ilia, Zsolt,

+1 to Zsolt's hasTargetSRFs point -- I built v1 and reproduced the wrong
results:

CREATE TABLE t (a int, b int, c int);
INSERT INTO t VALUES (1,1,10),(1,2,20),(2,1,30),(2,2,40),(1,1,50);
SELECT DISTINCT a, unnest(ARRAY[1,1]) AS u FROM t GROUP BY a ORDER BY a, u;

on master: (1,1),(2,1) # DISTINCT collapses the unnest dups
with v1: (1,1),(1,1),(2,1),(2,1) # DISTINCT removed, dups leak

The SRF in the tlist is expanded after grouping but before DISTINCT, so
the DISTINCT is not redundant. Window functions are safe (they don't
change cardinality), so hasTargetSRFs is the right and sufficient guard.

On Zsolt's suggetion to reuse query_is_distinct_for(): I think that's
the right direction, and it fixes more than the SRF case. The GROUP BY
branch there also checks equality_ops_are_compatible(opid, sgc->eqop)
and collations_agree_on_equality(...), neither of which the new
distinct_redundant_by_groupby() checks (it matches on tleSortGroupRef
alone). So reusing it closes the SRF, opclass, and collation gaps at
once.

One wrinkle if you go that way: query_is_distinct_for() can't be called
directly for this, because its first branch inspects query->distinctClause
and would return true trivially (the query's own DISTINCT "proves"
distinctness on its own columns). The reusable part is specifically the
GROUP-BY branch (plus the hasTargetSRFs precheck). Extracting that into a
helper that takes the distinct columns and is called from both sites
seems like the cleanest shape.

FWIW there's a complementary case this structure could host later:
DISTINCT made redundant by a unique index/constraint
(SELECT DISTINCT pk FROM t), proven via relation_has_unique_index_for().
That one needs a NULL gate that the GROUP BY case doesn't, so it's
separate logic -- but a shared "is this DISTINCT redundant?" entry point
with pluggable proofs would fit both. Happy to follow up with that as a
separate patch once this lands.

I tried signing up as a reviewer, but I need to wait for my one week
cool-off period first. I sign up ASAP.

Thanks for working on this!

Show quoted text

On Thu, Jun 11, 2026 at 6:01 PM Zsolt Parragi <zsolt.parragi@percona.com> wrote:

Hello!
+static bool
+distinct_redundant_by_groupby(Query *parse)
I think this also should check hasTargetSRFs:

CREATE TABLE t (a int, b int, c int);
INSERT INTO t VALUES (1,1,10),(1,2,20),(2,1,30),(2,2,40),(1,1,50);
SELECT DISTINCT a, unnest(ARRAY[1,1]) AS u FROM t GROUP BY a ORDER BY a, u;

Also, query_is_distinct_for already does something similar (and
already handles hasTargetSRFs) - would it be possible to extract the
common part instead of duplicating the logic?

David Rowley

dgrowleyml@gmail.com

12 days ago

In reply to: Ilia Evdokimov (#1)

Re: Remove redundant DISTINCT when GROUP BY already guarantees uniqueness

On Mon, 8 Jun 2026 at 23:12, Ilia Evdokimov
<ilya.evdokimov@tantorlabs.com> wrote:

When a query contains both DISTINCT and GROUP BY, planner currently
always creates a separate plan node to eliminate duplicate output rows.
However, if GROUP BY is present, each output row already corresponds to
exactly one group - meaning the rows produced by GROUP BY are already
unique on their group keys. Any DISTINCT clause that covers all those
group keys therefore adds no filtering and the de-duplication step is
pure overhead.

I think only doing something just for that case is likely too narrow a scope.

In [1]/messages/by-id/CAKJS1f-uSUr9dw6j48S5g3zMS=w-vp2oCNZzfriW2yVLAXj9iw@mail.gmail.com, I wrote:

Sometime in the future, I'd like to see some sort of UniqueKeys List
in RelOptInfo which is initially populated by using looking at the
unique indexes during build_simple_rel(). The idea is that these
UniqueKeys would get propagated into RELOPT_JOINRELs when the join
condition matches a set of unique keys on the other side of the join.
e.g. If the outer side of the join had a UniqueKey matching the join
condition then we could take all of the UniqueKeys from the RelOptInfo
for the inner side of the join (and vice versa). This infrastructure
would allow us to no-op a DISTINCT when the RelOptInfo has targetlist
items which completely cover at least one UniqueKey set, or not bother
performing a sort for a GROUP BY when the GROUP BY clause is covered
by a UniqueKey.

There was some work on that by Andy Fan after I wrote a prototype for
it. I didn't have much spare bandwidth to take a serious look at that
patch at the time. However, I still think it's the best way to solve
this as it allows the relevant UniqueKeys to be propagated up each
join level and we'll know exactly what the results are unique on at
each join level. Doing this would allow many more optimisations than
just skipping redundant DISTINCT columns or operations.

One example is: SELECT t1.id,sum(t1.col) FROM t1 INNER JOIN t2 ON
t1.t2_id = t2.unique_column GROUP BY t1.id; This would be able to do
a simple Group Aggregate without sorting as there's guaranteed to be 1
row per group.

It also allows the unique join optimisation to be based on the unique
keys rather than calculating that from looking at unique indexes or
the properties of a subquery.

I believe the latest work on UniqueKeys is in [2]/messages/by-id/flat/7mlamswjp81p.fsf@e18c07352.et15sqa.

David

[1]: /messages/by-id/CAKJS1f-uSUr9dw6j48S5g3zMS=w-vp2oCNZzfriW2yVLAXj9iw@mail.gmail.com
[2]: /messages/by-id/flat/7mlamswjp81p.fsf@e18c07352.et15sqa

Andres Taylor

andres@taylor.se

12 days ago

In reply to: David Rowley (#4)

Re: Remove redundant DISTINCT when GROUP BY already guarantees uniqueness

Picking up on David's point about UniqueKeys: that direction makes sense
to me, and I agree that removing DISTINCT only in the GROUP BY case
seems too narrow by itself.

I spent some time getting hands-on with the existing patch series and
trying to understand where the harder parts are. I'd like to help move
this forward if that would be useful.

As a starting point, I rebased Antonín's 2024-04 set (the rebase of
Andy Fan's v3, plus 0002-0004) onto current master. It is in better
shape than I expected:

- The core .c files apply cleanly with a 3-way merge; I only saw two
trivial header conflicts.
- It compiles after two mechanical API-drift fixes:
get_eclass_for_sort_expr() has gained a Relids argument, and direct
access to TupleDescData.attrs[] needs to become TupleDescAttr().
- In the cases I checked, it behaves as intended: DISTINCT over a
primary key collapses to a plain scan, and

SELECT DISTINCT u.id
FROM u JOIN t ON u.t_id = t.id

drops the DISTINCT via join propagation, since the join to t's
primary key cannot duplicate rows from u. The corresponding
duplicating-join case still keeps the DISTINCT.
- The bundled uniquekey regression test passes. The only diffs I saw
against the 2024 expected output is plan/annotation churn from
intervening core work, such as self-join-elimination "Replaces:"
lines and incremental-sort costing. I did not see result-data
changes.

One useful thing that has changed since that patch was written is that
RelOptInfo.notnullattnums now exists in core. That seems to make the
patch's own notnullattrs mostly redundant. It is not a blind
substitution, since the encodings differ and the patch also augments
notnullattrs with subquery-derived non-null facts in the 0004 path, but
I think this part can probably be slimmed down by seeding from
notnullattnums while keeping it augmentable.

Before spending much more time on the multi-join scaling problem, I
looked a bit on how other optimizers deal with similar issues. The main
thing I took away is that derived-key enumeration can become expensive
surprisingly quickly:

- Calcite appears to have hit this with unique-key enumeration over
composite keys; the fix there was to cap the result and fall back
to "not known unique" on overflow.
- CockroachDB's FuncDepSet takes a different approach and avoids
enumerating all candidate keys, instead keeping a single minimal
candidate key.
- The NULL issue seems closely related to the strict-vs-lax
distinction from the FD literature. PostgreSQL already has both
ideas in nearby code: the NOT NULL / NULLS NOT DISTINCT gate in
distinct_is_redundant(), and the different requirements in
relation_has_unique_index_for().

So my current thought is to stay fairly close to the patch's existing
keys-over-EquivalenceClasses model. EC-index sets are canonical, which
is attractive because it gives order-independence across join search
without introducing a larger FD framework.

The shape I was considering is:

1. tag each key as strict or lax, derived at the base relation from
attnotnull and indnullsnotdistinct, so the same facility can serve
both index-uniqueness and DISTINCT/GROUP BY-style consumers;

2. track only "interesting" keys, for example keys relevant to
DISTINCT, GROUP BY, ORDER BY, or merge clauses, reduce those to
minimal keys, and use a hard cap with fallback to "unknown" to
avoid unbounded growth;

3. keep the representation extensible enough that a strict/lax flag
and constants, e.g. () -> A, can be added later, without trying to
build a full FD closure engine now.

A few questions before I sink more time into this:

- Is reviving Antonín's rebase a reasonable path, or is there newer
WIP I should start from instead?
- Does the keys-only + strict/lax + interesting-keys-with-cap approach
match what people had in mind, or is the preference to move toward a
fuller FD model?
- I pushed the rebased branch here, in case it is useful as a starting
point:

https://github.com/systay/postgres/tree/uk-rebase

I also have more detailed notes on the design tradeoffs, but I did not
want to make this mail longer unless that would be useful.

Show quoted text

On Fri, Jun 12, 2026 at 5:19 AM David Rowley <dgrowleyml@gmail.com> wrote:

On Mon, 8 Jun 2026 at 23:12, Ilia Evdokimov
<ilya.evdokimov@tantorlabs.com> wrote:

When a query contains both DISTINCT and GROUP BY, planner currently
always creates a separate plan node to eliminate duplicate output rows.
However, if GROUP BY is present, each output row already corresponds to
exactly one group - meaning the rows produced by GROUP BY are already
unique on their group keys. Any DISTINCT clause that covers all those
group keys therefore adds no filtering and the de-duplication step is
pure overhead.

I think only doing something just for that case is likely too narrow a scope.

In [1], I wrote:

Sometime in the future, I'd like to see some sort of UniqueKeys List
in RelOptInfo which is initially populated by using looking at the
unique indexes during build_simple_rel(). The idea is that these
UniqueKeys would get propagated into RELOPT_JOINRELs when the join
condition matches a set of unique keys on the other side of the join.
e.g. If the outer side of the join had a UniqueKey matching the join
condition then we could take all of the UniqueKeys from the RelOptInfo
for the inner side of the join (and vice versa). This infrastructure
would allow us to no-op a DISTINCT when the RelOptInfo has targetlist
items which completely cover at least one UniqueKey set, or not bother
performing a sort for a GROUP BY when the GROUP BY clause is covered
by a UniqueKey.

There was some work on that by Andy Fan after I wrote a prototype for
it. I didn't have much spare bandwidth to take a serious look at that
patch at the time. However, I still think it's the best way to solve
this as it allows the relevant UniqueKeys to be propagated up each
join level and we'll know exactly what the results are unique on at
each join level. Doing this would allow many more optimisations than
just skipping redundant DISTINCT columns or operations.

One example is: SELECT t1.id,sum(t1.col) FROM t1 INNER JOIN t2 ON
t1.t2_id = t2.unique_column GROUP BY t1.id; This would be able to do
a simple Group Aggregate without sorting as there's guaranteed to be 1
row per group.

It also allows the unique join optimisation to be based on the unique
keys rather than calculating that from looking at unique indexes or
the properties of a subquery.

I believe the latest work on UniqueKeys is in [2].

David

[1] /messages/by-id/CAKJS1f-uSUr9dw6j48S5g3zMS=w-vp2oCNZzfriW2yVLAXj9iw@mail.gmail.com
[2] /messages/by-id/flat/7mlamswjp81p.fsf@e18c07352.et15sqa

Andres Taylor

andres@taylor.se

2 days ago

In reply to: Andres Taylor (#5)

Re: Remove redundant DISTINCT when GROUP BY already guarantees uniqueness

I’ve signed up as reviewer for the CF entry, so I wanted to summarize
where I think this stands.

After reviewing v1 and the discussion, I think the optimization is
worthwhile, but the current patch needs a small correctness fix before
it can move forward.

The target-list SRF case discussed earlier in the thread is a
correctness issue: GROUP BY may produce one row, but SRF expansion
happens before DISTINCT, so removing DISTINCT can change the query
result.

At minimum, I think v1 needs a guard for that case and a regression
test. That seems like a fairly contained revision.

There is also a broader design question about whether this should
eventually be handled through planner-level uniqueness infrastructure,
as David raised earlier in the thread. I don’t think the current patch
needs to answer that entire question before v2, but it does seem
useful context for where this optimization might fit longer term.

So my review conclusion for the current CF entry is:

* v1 needs revision before it can proceed
* I’ll mark the entry Waiting on Author for now, unless others think a
different status is more appropriate
* there is a broader planner-uniqueness question here, but the
immediate issue is the target-list SRF correctness case

I already have Antonín’s UniqueKeys series rebased onto master and
passing its tests, as linked earlier in the thread. I’m happy to keep
pushing on that direction if that’s the most useful next step.

Andres Taylor

andres@taylor.se

2 days ago

In reply to: Andres Taylor (#6)

Re: Remove redundant DISTINCT when GROUP BY already guarantees uniqueness

The following review has been posted through the commitfest application:
make installcheck-world: not tested
Implements feature: tested, passed
Spec compliant: tested, failed
Documentation: not tested

Reviewed v1. The optimization looks worthwhile, but the current patch needs a correctness fix for target-list SRFs, as discussed on-list, plus a regression test. This seems like a contained revision. There is also a broader design question about whether this should eventually be handled through planner-level uniqueness infrastructure, as raised earlier on the thread. Marking Waiting on Author for now.

The new status of this patch is: Waiting on Author

Remove redundant DISTINCT when GROUP BY already guarantees uniqueness

Attachments: