Skip to content

Issues: piskvorky/gensim

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Assignee
Filter by who’s assigned
Sort

Issues list

Update keyedvectors.py documentation Current issue related to documentation feature Issue described a new feature
#3311 opened Mar 23, 2022 by annagiabelli Loading…
Wheel support for linux aarch64 feature Issue described a new feature impact MEDIUM Big annoyance for affected users reach LOW Affects only niche use-case users
#2994 opened Nov 2, 2020 by odidev
Support multiple most_similar() queries in one call feature Issue described a new feature performance Issue related to performance (in HW meaning) wishlist Feature request
#2987 opened Oct 20, 2020 by gojomo
preprocessing.strip_punctuation does not handle Unicode feature Issue described a new feature
#2962 opened Sep 28, 2020 by sciatro
Really remove the 10000-token limit in [Word2Vec, FastText, Doc2Vec] difficulty hard Hard issue: required deep gensim understanding & high python/cython skills feature Issue described a new feature impact MEDIUM Big annoyance for affected users reach LOW Affects only niche use-case users
#2880 opened Jul 12, 2020 by gojomo
Implement position-dependent weighting to fastText feature Issue described a new feature
#2840 opened May 14, 2020 by Witiko
Warning when batch_words > MAX_WORDS_IN_BATCH in word2vec documentation Current issue related to documentation feature Issue described a new feature
#2801 opened Apr 21, 2020 by louisabraham
Doc2VecKeyedVectors doesn't effectively support __setitem__()/add() bug Issue described a bug feature Issue described a new feature
#2683 opened Nov 21, 2019 by gojomo
potential Doc2Vec feature: reverse inference, to synthesize doc/summary words difficulty medium Medium issue: required good gensim understanding & python skills feature Issue described a new feature good first issue Issue for new contributors (not required gensim understanding + very simple) Hacktoberfest Issues marked for hacktoberfest wishlist Feature request
#2459 opened Apr 21, 2019 by gojomo
Feature proposal: model trimming feature Issue described a new feature
#2413 opened Mar 13, 2019 by menshikh-iv
there is no log when i use word2vec by corpus_file difficulty medium Medium issue: required good gensim understanding & python skills feature Issue described a new feature
#2342 opened Jan 20, 2019 by luzhongqiu
SECURITY: api.load() recklessly downloads & runs arbitrary python code difficulty medium Medium issue: required good gensim understanding & python skills feature Issue described a new feature
#2283 opened Dec 3, 2018 by gojomo
Allow reserved IDs for token2id in Dictionary feature Issue described a new feature
#2190 opened Sep 20, 2018 by Froskekongen
Allow asymmetrical windows for word2vec. difficulty medium Medium issue: required good gensim understanding & python skills feature Issue described a new feature
#2172 opened Sep 3, 2018 by generall
Adding Word-to-Context Prediction in Word2Vec (inverse of predict_output_word()) difficulty easy Easy issue: required small fix feature Issue described a new feature
#2152 opened Aug 9, 2018 by elliottash
Add ELMo (Deep contextualized word representations) difficulty hard Hard issue: required deep gensim understanding & high python/cython skills feature Issue described a new feature
#2134 opened Jul 19, 2018 by chengrufeng
2 tasks
Ability to weight context words by distance from target word for *2vec models difficulty medium Medium issue: required good gensim understanding & python skills feature Issue described a new feature
#2114 opened Jun 29, 2018 by zkurtz
Adding quadratic regularization proves to boost the performance of word2vec? feature Issue described a new feature wishlist Feature request
#2023 opened Apr 8, 2018 by mucun1988
Addition of entity embedding model difficulty hard Hard issue: required deep gensim understanding & high python/cython skills feature Issue described a new feature
#2006 opened Mar 28, 2018 by shubham0704
LdaMulticore and OpenMP BLAS difficulty medium Medium issue: required good gensim understanding & python skills documentation Current issue related to documentation feature Issue described a new feature Hacktoberfest Issues marked for hacktoberfest
#1988 opened Mar 20, 2018 by rmalouf
Faster SVD using sampling & GPU difficulty hard Hard issue: required deep gensim understanding & high python/cython skills feature Issue described a new feature wishlist Feature request
#1965 opened Mar 8, 2018 by piskvorky
Support for OOV words using only word-vectors difficulty medium Medium issue: required good gensim understanding & python skills feature Issue described a new feature
#1953 opened Mar 5, 2018 by menshikh-iv
Make it possible for transform methods in sklearn_api to take a sparse matrix as an argument difficulty medium Medium issue: required good gensim understanding & python skills feature Issue described a new feature
#1929 opened Feb 24, 2018 by altescy
Optimize sparse * random dense matrix multiply in LsiModel difficulty medium Medium issue: required good gensim understanding & python skills feature Issue described a new feature performance Issue related to performance (in HW meaning)
#1888 opened Feb 8, 2018 by ghost
9 of 13 tasks
Add methods for mixture of word-vectors difficulty medium Medium issue: required good gensim understanding & python skills feature Issue described a new feature
#1879 opened Feb 7, 2018 by menshikh-iv
ProTip! Add no:assignee to see everything that’s not assigned.