-
Notifications
You must be signed in to change notification settings - Fork 28.6k
[SPARK-2887] fix bug of countApproxDistinct() when have more than one partition #1812
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Conversation
cc @mateiz |
Hah, pretty crazy! Glad you caught that. |
Jenkins, add to whitelist and test this please |
Jenkins, test this please. |
QA tests have started for PR 1812. This patch merges cleanly. |
QA results for PR 1812: |
Thanks, going to merge this. |
… partition fix bug of countApproxDistinct() when have more than one partition Author: Davies Liu <davies.liu@gmail.com> Closes #1812 from davies/approx and squashes the following commits: bf757ce [Davies Liu] fix bug of countApproxDistinct() when have more than one partition (cherry picked from commit ffd1f59) Signed-off-by: Patrick Wendell <pwendell@gmail.com>
Merged into master and 1.1. @davies mind submitting a PR for 1.0 and 0.9? This doesn't merge cleanly into those ATM. |
@@ -1004,7 +1004,7 @@ abstract class RDD[T: ClassTag]( | |||
}, | |||
(h1: HyperLogLogPlus, h2: HyperLogLogPlus) => { | |||
h1.addAll(h2) | |||
h2 | |||
h1 |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
ah can't believe I missed that
… partition fix bug of countApproxDistinct() when have more than one partition Author: Davies Liu <davies.liu@gmail.com> Closes apache#1812 from davies/approx and squashes the following commits: bf757ce [Davies Liu] fix bug of countApproxDistinct() when have more than one partition
fix bug of countApproxDistinct() when have more than one partition