May 2019
Intermediate to advanced
600 pages
20h 46m
English
If you have multi-column indexes (or joins), the optimizer will assume that the column values are independent of each other, which can lead to misestimation in cases where there is a correlation between the values.
If you have two dependent columns, such as state and area_code, then you can define additional statistics that will be collected when you next ANALYZE the table:
CREATE STATISTICS cust_s1 (ndistinct, dependencies) ON state, area_code FROM cust;
The execution time of ANALYZE will increase to collect the additional stats information, plus there is a small increase in query planning time, so use this sparingly when you can confirm this will make a difference. You don't need to have both distinct and dependencies in all ...
Read now
Unlock full access