Inefficient use of index scan on 2nd column of composite index during concurrent activity

Started by Durgamahesh Manneover 1 year ago7 messagesgeneral
Jump to latest
#1Durgamahesh Manne
maheshpostgres9@gmail.com

Hi team

Second column of composite index not in use effectively with index scan
when using second column at where clause

I have composite index on (placedon,id) of test
When quering select * from test where id = '4234';
Value of id changes and during concurrent activity and cpu utilization
increased toomuch that i have observed which means query plan changed why

I could see index scan with explain for it

Is there any way to keep index scan for it during even on concurrency
rather than seperate index on second column of composite index ?

Hope everyone understand this

Regards,
Durga Mahesh

#2Durgamahesh Manne
maheshpostgres9@gmail.com
In reply to: Durgamahesh Manne (#1)
Fwd: Inefficient use of index scan on 2nd column of composite index during concurrent activity

---------- Forwarded message ---------
From: Durgamahesh Manne <maheshpostgres9@gmail.com>
Date: Mon, Oct 7, 2024 at 10:01 AM
Subject: Inefficient use of index scan on 2nd column of composite index
during concurrent activity
To: <pgsql-in-general@postgresql.org>

Hi team

Second column of composite index not in use effectively with index scan
when using second column at where clause

I have composite index on (placedon,id) of test
When quering select * from test where id = '4234';
Value of id changes and during concurrent activity and cpu utilization
increased toomuch that i have observed which means query plan changed why

I could see index scan with explain for it

Is there any way to keep index scan for it during even on concurrency
rather than seperate index on second column of composite index ?

Hope everyone understand this

Regards,
Durga Mahesh

Hi Greg

you mentioned that below

(please start a new thread in the future rather than replying to an
existing one)

You cannot query on b and use an index on (a,b) as you observed. However,
you can have two indexes:

index1(a)
index2(b)

Postgres will be able to combine those when needed in the case where your
WHERE clause needs to filter by both columns. So then you no longer need
the two-column index.

Hi Greg ,

Here not using composite index on ordinary table.
Composite index that i use on partitioned table is mandatory for use to
replicate data to target using pglogical (sorry this is not mentioned
earlier)

composite key (placedon,id)
In concurrent mode if i use id at where clause then query plan for that id
column changes

How to mitigate it rather than use seperate index for id to continue
without change in query plan (index scan) during concurrent activity

I hope you understand this

Regards,
Durga Mahesh

Cheers,
Greg

#3Greg Sabino Mullane
greg@turnstep.com
In reply to: Durgamahesh Manne (#2)
Re: Inefficient use of index scan on 2nd column of composite index during concurrent activity

On Fri, Oct 11, 2024 at 9:28 AM Durgamahesh Manne <maheshpostgres9@gmail.com>
wrote:

composite key (placedon,id)
In concurrent mode if i use id at where clause then query plan for that id
column changes

How to mitigate it rather than use seperate index for id to continue
without change in query plan (index scan) during concurrent activity

Why the focus on "concurrent mode"? Perhaps explain what you mean by that.

Speaking of explain, it might help if you show us the explain plans and how
they are not coming out how you want. Also the table definitions, but feel
free to not show columns unrelated to the problem.

Cheers,
Greg

#4Durgamahesh Manne
maheshpostgres9@gmail.com
In reply to: Greg Sabino Mullane (#3)
Re: Inefficient use of index scan on 2nd column of composite index during concurrent activity

On Fri, Oct 11, 2024 at 9:57 PM Greg Sabino Mullane <htamfids@gmail.com>
wrote:

On Fri, Oct 11, 2024 at 9:28 AM Durgamahesh Manne <
maheshpostgres9@gmail.com> wrote:

composite key (placedon,id)
In concurrent mode if i use id at where clause then query plan for that
id column changes

How to mitigate it rather than use seperate index for id to continue
without change in query plan (index scan) during concurrent activity

Why the focus on "concurrent mode"? Perhaps explain what you mean by that.

Speaking of explain, it might help if you show us the explain plans and
how they are not coming out how you want. Also the table definitions, but
feel free to not show columns unrelated to the problem.

Cheers,
Greg

Hi Greg

Thanks for your quick response

Partitioned table "test"
Column | Type | Collation | Nullable |
Default | Storage | Compression | Stats target | Description
---------------------+--------------------------+-----------+----------+---------+----------+-------------+--------------+-------------
id | bigint | | not null |
| plain | | |
externalbetid | text | | |
| extended | | |
externalsystem | text | | |
| extended | | |
placedon | timestamp with time zone | | not null |
| plain | | |
txnstep | integer | | |
| plain | | |
txnstage | text | | |
| extended | | |
txnstatus | text | | |
| extended | | |
"pmk_test" PRIMARY KEY, btree (id, placedon) REPLICA IDENTITY
if use this (id,placedon) when running select query then no issues bez
select picks up first column of composite key
select * from test where id = '4234';
Append (cost=0.14..42.14 rows=19 width=1355) (actual time=0.177..0.186
rows=1 loops=1)
-> Index Scan using test_p2023_07_id_idx on test_p2023_07 test_1
(cost=0.14..2.38 rows=1 width=1874) (actual time=0.009..0.009 rows=0
loops=1)
Index Cond: (id = '4234'::text)
-> Index Scan using test_p2023_08_id_idx on test_p2023_08 test_2
(cost=0.14..2.38 rows=1 width=1848) (actual time=0.005..0.005 rows=0
loops=1)
Index Cond: (id = '4234'::text)
Planning Time: 0.100 ms
Execution Time: 0.40 ms

if i change constraint order (placedon,id) then in this case

I could see same index scan with explain analyze for 1 call or 2 calls

Here concurrent mode means you are already aware (no of calls increases
concurrently)
Sudden cpu spike i have observed which is unusual(more than needed) when
no of calls increased concurrently on that query

Based on that info i suspected that query plan changed hence raised
question here this is what i faced with mentioned columns order related to
problem

Example for better understanding to you
in oracle
CREATE INDEX idx_orders_customer_date ON orders (customer_id, order_date);
SELECT /*+ INDEX(orders idx_orders_customer_date) */ *
FROM orders
WHERE order_date = '2024-01-01';
I am not sure how this works . this is the example gathered for you

I hope you can understand . Sorry i can't explain more than this much

Regards,
Durga Mahesh

#5Durgamahesh Manne
maheshpostgres9@gmail.com
In reply to: Durgamahesh Manne (#4)
Re: Inefficient use of index scan on 2nd column of composite index during concurrent activity

On Fri, 11 Oct, 2024, 23:33 Durgamahesh Manne, <maheshpostgres9@gmail.com>
wrote:

On Fri, Oct 11, 2024 at 9:57 PM Greg Sabino Mullane <htamfids@gmail.com>
wrote:

On Fri, Oct 11, 2024 at 9:28 AM Durgamahesh Manne <
maheshpostgres9@gmail.com> wrote:

composite key (placedon,id)
In concurrent mode if i use id at where clause then query plan for that
id column changes

How to mitigate it rather than use seperate index for id to continue
without change in query plan (index scan) during concurrent activity

Why the focus on "concurrent mode"? Perhaps explain what you mean by that.

Speaking of explain, it might help if you show us the explain plans and
how they are not coming out how you want. Also the table definitions, but
feel free to not show columns unrelated to the problem.

Cheers,
Greg

Hi Greg

Thanks for your quick response

Partitioned table "test"
Column | Type | Collation | Nullable |
Default | Storage | Compression | Stats target | Description

---------------------+--------------------------+-----------+----------+---------+----------+-------------+--------------+-------------
id | bigint | | not null |
| plain | | |
externalbetid | text | | |
| extended | | |
externalsystem | text | | |
| extended | | |
placedon | timestamp with time zone | | not null |
| plain | | |
txnstep | integer | | |
| plain | | |
txnstage | text | | |
| extended | | |
txnstatus | text | | |
| extended | | |
"pmk_test" PRIMARY KEY, btree (id, placedon) REPLICA IDENTITY
if use this (id,placedon) when running select query then no issues bez
select picks up first column of composite key
select * from test where id = '4234';
Append (cost=0.14..42.14 rows=19 width=1355) (actual time=0.177..0.186
rows=1 loops=1)
-> Index Scan using test_p2023_07_id_idx on test_p2023_07 test_1
(cost=0.14..2.38 rows=1 width=1874) (actual time=0.009..0.009 rows=0
loops=1)
Index Cond: (id = '4234'::text)
-> Index Scan using test_p2023_08_id_idx on test_p2023_08 test_2
(cost=0.14..2.38 rows=1 width=1848) (actual time=0.005..0.005 rows=0
loops=1)
Index Cond: (id = '4234'::text)
Planning Time: 0.100 ms
Execution Time: 0.40 ms

if i change constraint order (placedon,id) then in this case

I could see same index scan with explain analyze for 1 call or 2 calls

Here concurrent mode means you are already aware (no of calls increases
concurrently)
Sudden cpu spike i have observed which is unusual(more than needed) when
no of calls increased concurrently on that query

Based on that info i suspected that query plan changed hence raised
question here this is what i faced with mentioned columns order related to
problem

Example for better understanding to you
in oracle
CREATE INDEX idx_orders_customer_date ON orders (customer_id, order_date);
SELECT /*+ INDEX(orders idx_orders_customer_date) */ *
FROM orders
WHERE order_date = '2024-01-01';
I am not sure how this works . this is the example gathered for you

I hope you can understand . Sorry i can't explain more than this much

Regards,
Durga Mahesh

Hi PGDG

In oracle

Example for better understanding to you

CREATE INDEX idx_orders_customer_date ON orders (customer_id, order_date);

SELECT /*+ INDEX(orders idx_orders_customer_date) */ *
FROM orders
WHERE order_date = '2024-01-01';

I am not sure how this works in oracle . this is the example gathered for
reference

In the similar way

Do we have anything in postgres like oracle ?

Regards,
Durga Mahesh

Show quoted text
#6David Rowley
dgrowleyml@gmail.com
In reply to: Durgamahesh Manne (#2)
Re: Inefficient use of index scan on 2nd column of composite index during concurrent activity

On Sat, 12 Oct 2024 at 02:28, Durgamahesh Manne
<maheshpostgres9@gmail.com> wrote:

Second column of composite index not in use effectively with index scan when using second column at where clause

I have composite index on (placedon,id) of test
When quering select * from test where id = '4234';
Value of id changes and during concurrent activity and cpu utilization increased toomuch that i have observed which means query plan changed why

I could see index scan with explain for it

Is there any way to keep index scan for it during even on concurrency rather than seperate index on second column of composite index ?

It sounds like you might be asking about something we call "index skip
scans". Currently, PostgreSQL does not support these, however there is
work being done to add support and that might arrive in PG18.

There is some information about a possible workaround in [1]https://wiki.postgresql.org/wiki/Loose_indexscan which may
be of use to you.

David

[1]: https://wiki.postgresql.org/wiki/Loose_indexscan

#7Durgamahesh Manne
maheshpostgres9@gmail.com
In reply to: David Rowley (#6)
Re: Inefficient use of index scan on 2nd column of composite index during concurrent activity

On Tue, 15 Oct, 2024, 15:15 David Rowley, <dgrowleyml@gmail.com> wrote:

On Sat, 12 Oct 2024 at 02:28, Durgamahesh Manne
<maheshpostgres9@gmail.com> wrote:

Second column of composite index not in use effectively with index scan

when using second column at where clause

I have composite index on (placedon,id) of test
When quering select * from test where id = '4234';
Value of id changes and during concurrent activity and cpu utilization

increased toomuch that i have observed which means query plan changed why

I could see index scan with explain for it

Is there any way to keep index scan for it during even on concurrency

rather than seperate index on second column of composite index ?

It sounds like you might be asking about something we call "index skip
scans". Currently, PostgreSQL does not support these, however there is
work being done to add support and that might arrive in PG18.

There is some information about a possible workaround in [1] which may
be of use to you.

David

[1] https://wiki.postgresql.org/wiki/Loose_indexscan

Hi David

Thanks you for valuable info

Regards,
Durga Mahesh

Show quoted text