Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

word2vec效果 #1

Open
cc-cb opened this issue Oct 19, 2020 · 3 comments
Open

word2vec效果 #1

cc-cb opened this issue Oct 19, 2020 · 3 comments

Comments

@cc-cb
Copy link

cc-cb commented Oct 19, 2020

使用您提供的框架训练跟spark版本,python gensim版本对比,效果差别比较大,没有找出原因所在

@DSXiangLi
Copy link
Owner

@cc-cb 是嘛?之前在item2vec上拿这套框架跑过比较大的数据,感觉效果还比较符合预期,但确实没和gensim仔细对比有。有代码和case可以看下么?我看下能不能复现下找找问题在哪里~

@cc-cb
Copy link
Author

cc-cb commented Oct 20, 2020

你训练大量数据需要多长时间,我是用分布式跑的,很快就结束了

@DSXiangLi
Copy link
Owner

@cc-cb 这个当时用的和这里给的还不太一样,参数和dataset的部分都没用这里的。印象中100万左右数据,100epochs,batch =1000跑了几个小时吧

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants