-
Notifications
You must be signed in to change notification settings - Fork 28.6k
[SPARK-14634][ML] Add BisectingKMeansSummary #12394
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Conversation
Test build #55820 has finished for PR 12394 at commit
|
cc @jkbradley |
This looks good. |
cc @jkbradley @mengxr Summary for BisectingKMeans is still missing now |
Test build #58702 has finished for PR 12394 at commit
|
cc @mengxr |
7b8f308
to
64ce1a7
Compare
Test build #66156 has finished for PR 12394 at commit
|
@yanboliang Could you please have a review? |
Jenkins, test this please. |
Test build #66958 has finished for PR 12394 at commit
|
LGTM, merged into master. Thanks! |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Just saw this. Sorry to come late to the discussion.
This looks good, but I realized I messed up when reviewing the similar KMeansSummary for 2.0: I should have separated a Summary from a TrainingSummary. Maybe it's not important for clustering, so no need to change this PR. But in the future, for other models, let's try to separate the 2. It's really important for linear models, and it might become more important for others as well. Thanks!
@zhengruifeng @yanboliang does that sound good?
val model = copyValues(new BisectingKMeansModel(uid, parentModel).setParent(this)) | ||
val summary = new BisectingKMeansSummary( | ||
model.transform(dataset), $(predictionCol), $(featuresCol), $(k)) | ||
model.setSummary(summary) |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
This looks like a superfluous call.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
yes, I will del this line ASAP. Thanks for your comment!
@jkbradley Sounds good to me! |
@jkbradley +1 Sounds good. |
## What changes were proposed in this pull request? As commented by jkbradley in apache#12394, `model.setSummary(summary)` is superfluous ## How was this patch tested? existing tests Author: Zheng RuiFeng <ruifengz@foxmail.com> Closes apache#15619 from zhengruifeng/del_superfluous.
## What changes were proposed in this pull request? Add BisectingKMeansSummary ## How was this patch tested? unit test Author: Zheng RuiFeng <ruifengz@foxmail.com> Closes apache#12394 from zhengruifeng/biKMSummary.
## What changes were proposed in this pull request? As commented by jkbradley in apache#12394, `model.setSummary(summary)` is superfluous ## How was this patch tested? existing tests Author: Zheng RuiFeng <ruifengz@foxmail.com> Closes apache#15619 from zhengruifeng/del_superfluous.
## What changes were proposed in this pull request? Add BisectingKMeansSummary ## How was this patch tested? unit test Author: Zheng RuiFeng <ruifengz@foxmail.com> Closes apache#12394 from zhengruifeng/biKMSummary.
## What changes were proposed in this pull request? As commented by jkbradley in apache#12394, `model.setSummary(summary)` is superfluous ## How was this patch tested? existing tests Author: Zheng RuiFeng <ruifengz@foxmail.com> Closes apache#15619 from zhengruifeng/del_superfluous.
What changes were proposed in this pull request?
Add BisectingKMeansSummary
How was this patch tested?
unit test