Telegram-канал pg_sql - PostgreSQL: Unsorted - каталог телеграмм

pg_sql | Unsorted

Subscribe to a channel

Telegram-канал pg_sql - PostgreSQL

2830

English speaking PostgreSQL public chat. This group is for discussions on PostgreSQL-related topics and strives to provide best-effort support as well.

Subscribe to a channel

PostgreSQL

01 November 2025 11:20

The query itself is fine. It's straightforward, idiomatic and reflects the db structure.

Optimizer failed a bit, especially with condition result estimation. It could be corrected by different methods: changing (increasing samples and adding some complex sampling) statistics, manipulating planner cost settings, adding indexes....

However, the database structure failed miserably. I see no reason to do something before getting it to a sane state.

Читать полностью…

PostgreSQL

01 November 2025 11:03

Try reducing random_page_cost (the default values are still tailored for spinning rust)

Читать полностью…

PostgreSQL

01 November 2025 10:36

Query is not well .... What?

Читать полностью…

PostgreSQL

01 November 2025 02:09

Shouldn't.

Well, 10k nested loops should be comparable to seq scan actually, even somewhat worse may be. Or not worse — considering nearly all segment access in a nested loop would result in just a few comparisions on the root index page.

However, that's just 16 nested loops necessary — and it should be much faster than seq scans needed for hash join.

So the planner estimation fail does contribute a lot to the slowness.

Читать полностью…

PostgreSQL

31 October 2025 22:41

@tzirechnoy , If that table has been partitioned by ID only as Anba mentioned, so forcing use of index would make things even worst, isn't it?

Читать полностью…

PostgreSQL

31 October 2025 20:18

2. Extra table partitioning is also useless and damaging, but not that much.

The planner, however, missed badly on your filter — which is not good. Thing may be fixed a bit by some analyze factors changes or index changes.

Consider adding deleted_at and date to that index at first. Maybe partial index on deleted_at is null would also be a good idea.

Читать полностью…

PostgreSQL

31 October 2025 18:28

Not a good approach, I have to mention. Refactoring seams to be the best action to take this far.

Читать полностью…

PostgreSQL

31 October 2025 18:22

If they are equal, the planner will match partition-by-partition, and with indexes this will generally be very fast. But imagine the following tables partitioned with the following ranges:

Imagine joining t1 to t2. Even if you limit your joined IDs from 200 to 300 with a where clause (which equals to only one partition on t1), this still corresponds to two (bigger) partitions on t2.

Since pre-computing overlaps in different ranges with different partitions can be expensive the planner resorts to scanning every intersection between every partition, basically creating (p(t1)+p(t2))! joins*

(*this is inaccurate, it depend on how you design your WHERE query, but you get the gist)

Читать полностью…

PostgreSQL

31 October 2025 18:11

You mean range partitioning should be done exactly in the same way for both the tables ?

Читать полностью…

PostgreSQL

31 October 2025 18:06

You have a partitioned table, and the partition key is not being used in your filter. That's the reason why you have so many Seq Scans in the lower level. Those scans represent 34,342.741 of the total cost of 37,771.858. You may try some approaches to optimize it.

The client application may run your query in a Loop, including the partition key in the filter on each iteration. The results shall be accumulated in a result list, on client side.

Читать полностью…

PostgreSQL

31 October 2025 07:52

@unfoxo , plan for the impacted query is here
https://explain.depesz.com/s/mUPT (It's typically during the time when query gets suceeded within expected latency)

Читать полностью…

PostgreSQL

30 October 2025 13:39

Oh damn. My bad. I'm not using any AWS services. :D

Читать полностью…

PostgreSQL

30 October 2025 13:33

before we go too far. what kind of storage does your on prem use ? ssd, nfs, raid ?

Читать полностью…

PostgreSQL

30 October 2025 12:33

Thanks @unfoxo , will get that..

Читать полностью…

PostgreSQL

30 October 2025 12:29

But no other go except to run with that. Not too many users. So not worthy for even more additional compute

Читать полностью…

PostgreSQL

01 November 2025 11:08

Hi @NickBluth , apologies . Can you bit more explain how this change could avoid 10k loops and target only 16 necessary loops

Читать полностью…

PostgreSQL

01 November 2025 10:58

Is there a right approach which could be followed to have just necessary 16 nested loops?

Читать полностью…

PostgreSQL

01 November 2025 06:01

It means query is not well framed?

Читать полностью…

PostgreSQL

31 October 2025 22:42

I got your point now. Yes, I totally agree with you.

Читать полностью…

PostgreSQL

31 October 2025 20:20

You may try to explicitly disable seq scans to provoke PK scan on screenshots....
But, really, getting rid of partitions should go first.

Читать полностью…

PostgreSQL

31 October 2025 20:07

0. You partition of screenshots is a useless crap.

Just stop it. It's far beyound insanity.
(Most actual partition schemes of relational databases are crap, in fact — people naturally tend to think that working with smaller chunks of data would be easier to DBMS. No, relational databases are written to work with relatively large relations, partitions usually make the database to work worse, slower and to be harder to manage properly.

Your thousand partitions below 10k narrow values is something exceptionally bad even comparing to indusrry average, however.)

1. Show \d+ of every table somewhere. Even with that partitioning — this should fall to nested loop on pk, that would be not fast — but generally acceptable.

Читать полностью…

PostgreSQL

31 October 2025 18:24

Oh also, check this out

https://postgresqlco.nf/doc/en/param/enable_partitionwise_join/

Читать полностью…

PostgreSQL

31 October 2025 18:12

Yeah that's the reason of having 1900+ parallel seq scans on to the screenshot partition. That table has been range partitioned with id column only

Читать полностью…

PostgreSQL

31 October 2025 18:07

"screenshot_capture"."screenshot_id" = "screenshot"."id"

Make sure both screenshot_capture and screenshot are partitioned in the same exact way (same ids), even having slightly different ranges will prevent postgresql from optimizing that search

Читать полностью…

PostgreSQL

31 October 2025 08:36

Have you got index in every ID column( extra.id, screenshot.id.. ? And an index with ID and partition key in every partition table ?

Читать полностью…

PostgreSQL

30 October 2025 13:43

Noone knows yet.

Performance tuning is about thorough checking execution plans, timings, perf sometimes — generally, about problem isolation, algorithmical optimisation and numerical stress test results.

There is no room for random parameter guessing by strangers here.

Читать полностью…

PostgreSQL

30 October 2025 13:34

It's not on prem it's still a AWS vm (m6g large)

Читать полностью…

PostgreSQL

30 October 2025 12:42

I can talk only based on my own experience. Setting max_parallel_workers to equal or higher than CPU cores will allow PostgreSQL to use all available CPU when needed. A good practice may be you leave some resources to the OS, or else oom killer may be triggered and shut down your cluster.

Читать полностью…

PostgreSQL

30 October 2025 12:31

Anyway, try to drop your caches, then run the query cold and warm (aka fast and slow) using EXPLAIN ANALYZE with BUFFERS, and share both plans on https://explain.dalibo.com so we can look further into what's slowing down between the runs

Читать полностью…

PostgreSQL

30 October 2025 12:28

I assume RDS runs bare metal or containerized, while you have the extra performance drop of virtualization

Читать полностью…

Subscribe to a channel