-
Notifications
You must be signed in to change notification settings - Fork 28.6k
[SPARK-6793][SPARK-5567][SPARK-8936][MLlib]Perplexity, prediction, and hyperparameter optimization in OnlineLDAOptimizer #7507
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Conversation
Test build #37764 has finished for PR 7507 at commit
|
* master: (133 commits) [SPARK-8536] [MLLIB] Generalize OnlineLDAOptimizer to asymmetric document-topic Dirichlet priors [SPARK-4366] [SQL] [Follow-up] Fix SqlParser compiling warning. [SPARK-9224] [MLLIB] OnlineLDA Performance Improvements [SPARK-9024] Unsafe HashJoin/HashOuterJoin/HashSemiJoin [SPARK-9165] [SQL] codegen for CreateArray, CreateStruct and CreateNamedStruct [SPARK-9082] [SQL] Filter using non-deterministic expressions should not be pushed down [SPARK-9254] [BUILD] [HOTFIX] sbt-launch-lib.bash should support HTTP/HTTPS redirection [SPARK-4233] [SPARK-4367] [SPARK-3947] [SPARK-3056] [SQL] Aggregation Improvement [SPARK-9232] [SQL] Duplicate code in JSONRelation [SPARK-9121] [SPARKR] Get rid of the warnings about `no visible global function definition` in SparkR [SPARK-9154][SQL] Rename formatString to format_string. [SPARK-9154] [SQL] codegen StringFormat [SPARK-9206] [SQL] Fix HiveContext classloading for GCS connector. [SPARK-8906][SQL] Move all internal data source classes into execution.datasources. [SPARK-8357] Fix unsafe memory leak on empty inputs in GeneratedAggregate Revert "[SPARK-9154] [SQL] codegen StringFormat" [SPARK-5989] [MLLIB] Model save/load for LDA [SPARK-9154] [SQL] codegen StringFormat [SPARK-5423] [CORE] Register a TaskCompletionListener to make sure release all resources [SPARK-4598] [WEBUI] Task table pagination for the Stage page ...
Test build #38143 has finished for PR 7507 at commit
|
Are these several JIRAs tied together, or would it be easy to separate out the PRs? |
Can probably separate them out; both perplexity and predict depend on On Fri, Jul 24, 2015 at 3:00 PM jkbradley notifications@github.com wrote:
|
Test build #38583 has finished for PR 7507 at commit
|
Test build #38601 has finished for PR 7507 at commit
|
Closing this and splitting into smaller PRs |
Adds
logPerplexity
andtopicDistribution
methods to LocalLDAModel as well as{get,set}OptimizeAlpha
to OnlineLDAOptimizer. DRY outvariationalTopicInference
andbound
code.