[Bug] Star tree indexing - kahan summation fix#18462
Conversation
|
❌ Gradle check result for 60144ea: FAILURE Please examine the workflow log, locate, and copy-paste the failure(s) below, then iterate to green. Is the failure a flaky test unrelated to your change? |
4595246 to
a151898
Compare
Signed-off-by: bharath-techie <bharath78910@gmail.com>
|
❌ Gradle check result for 5165fe5: FAILURE Please examine the workflow log, locate, and copy-paste the failure(s) below, then iterate to green. Is the failure a flaky test unrelated to your change? |
|
❌ Gradle check result for 5165fe5: FAILURE Please examine the workflow log, locate, and copy-paste the failure(s) below, then iterate to green. Is the failure a flaky test unrelated to your change? |
Codecov ReportAttention: Patch coverage is
Additional details and impacted files@@ Coverage Diff @@
## main #18462 +/- ##
=========================================
Coverage 72.66% 72.67%
- Complexity 67858 67893 +35
=========================================
Files 5521 5522 +1
Lines 312541 312574 +33
Branches 45364 45377 +13
=========================================
+ Hits 227113 227148 +35
- Misses 66903 66921 +18
+ Partials 18525 18505 -20 ☔ View full report in Codecov by Sentry. 🚀 New features to boost your workflow:
|
|
I did one round of benchmarks on nyc_taxis. Indexing has very slight to no regression. Search - Overall sum : |
asimmahmood1
left a comment
There was a problem hiding this comment.
Any thoughts about adding this minor improvement to account for values that larger than current sum? https://en.wikipedia.org/wiki/Kahan_summation_algorithm#Further_enhancements
Signed-off-by: bharath-techie <bharath78910@gmail.com>
Signed-off-by: bharath-techie <bharath78910@gmail.com>
Since we're targeting this for 3.1 and code freeze is today , will create an issue to track this enhancement. |

Description
Star tree's SumValueAggregator today stores kahan summation state which gets reused and we found an edge case where the children node's value gets set in the kahan summation that gets used in the parent nodes which results in incorrect results.
As part of this PR removing the state from the SumValueAggregator and using
CompensatedSumas theSumAggregatortype.Related Issues
#18463
Check List
By submitting this pull request, I confirm that my contribution is made under the terms of the Apache 2.0 license.
For more information on following Developer Certificate of Origin and signing off your commits, please check here.