analyze use MaxUint64 ts to read data #35233

xuyifangreeneyes · 2022-06-08T09:33:14Z

Enhancement

#24575 makes analyze read data on snapshot. Combined with incremental update of modify_count and count at the end of analyze, we can get a more accurate modify_count especially when lots of updates happen during the long time analyze. However, long-time snapshot analyze can throw error GC life time is shorter than transaction duration(#29862) or block GC(#35062). Considering analyze doesn't require strong data consistency, we hope to change back to use MaxUint64 ts to read data in analyze.

The text was updated successfully, but these errors were encountered:

xhebox · 2022-06-22T10:10:19Z

But the original issue of #24575 has concerns on overestimation of modify_count, maybe add another TiDBAnalyzeVersion?

xuyifangreeneyes · 2022-06-22T11:36:59Z

#24575

Yes. We switch back to read the latest data rather than certain snapshot because analyze does't require strong data consistency and analyze on snapshot may bring some problems(auto analyze blocks gc or long-time auto analyze fails) more severe than inaccurate modify_count. When we use MaxUint64 ts to read data in analyze, there are two ways to update modify_count when analyze is finished. The first way is to update it incrementally(see #24720), which causes overestimation of modify_count. The second way is to just set it 0, which is the original implementation and causes underestimation of modify_count. Underestimation makes auto analyze fail to trigger when it should be triggered and has more risk than overestimation on cardinality/cost estimation. Thus we choose to use MaxUint64 ts in analyze and update modify_count incrementally. The doc has more details.

close #35233

xuyifangreeneyes added the type/enhancement The issue or PR belongs to an enhancement. label Jun 8, 2022

xuyifangreeneyes mentioned this issue Jun 8, 2022

executor, statistics: analyze use MaxUint64 ts to read data #35232

Merged

12 tasks

ti-chi-bot closed this as completed in #35232 Jul 20, 2022

ti-chi-bot pushed a commit that referenced this issue Jul 20, 2022

executor, statistics: analyze use MaxUint64 ts to read data (#35232)

d00b984

close #35233

VelocityLight added the affects-6.1 label Jan 10, 2023

ti-chi-bot mentioned this issue Jan 10, 2023

executor, statistics: analyze use MaxUint64 ts to read data (#35232) #40466

Closed

12 tasks

jebter added the sig/planner SIG: Planner label Jan 18, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

analyze use MaxUint64 ts to read data #35233

analyze use MaxUint64 ts to read data #35233

xuyifangreeneyes commented Jun 8, 2022

xhebox commented Jun 22, 2022

xuyifangreeneyes commented Jun 22, 2022 •

edited

Loading

analyze use MaxUint64 ts to read data #35233

analyze use MaxUint64 ts to read data #35233

Comments

xuyifangreeneyes commented Jun 8, 2022

Enhancement

xhebox commented Jun 22, 2022

xuyifangreeneyes commented Jun 22, 2022 • edited Loading

xuyifangreeneyes commented Jun 22, 2022 •

edited

Loading