-
-
Notifications
You must be signed in to change notification settings - Fork 4.4k
Issues: piskvorky/gensim
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Author
Label
Projects
Milestones
Assignee
Sort
Issues list
add functions to reproduce preprocessing matching
GoogleNews
, GLoVe
, etc pretrained word-vectors
#3485
opened Jul 19, 2023 by
gojomo
potential 'alias method' negative-sampling optimization from 'Koan' paper
#3292
opened Feb 22, 2022 by
gojomo
FastText models Issue described a bug
performance
Issue related to performance (in HW meaning)
.save()
d from 4.0+ slower to load; gain less benefit from mmap
bug
#3192
opened Jul 13, 2021 by
gojomo
make Easy issue: required small fix
good first issue
Issue for new contributors (not required gensim understanding + very simple)
remove_stopwords()
behavior more consistent
difficulty easy
#3171
by gojomo
was closed Aug 12, 2021
Possible unexplainable segfault after save/load cycles of KeyedVectors or Word2Vec
#3046
opened Feb 17, 2021 by
gojomo
Add convenience Easy issue: required small fix
impact MEDIUM
Big annoyance for affected users
reach MEDIUM
Affects a significant number of users
wishlist
Feature request
get_sentence_vector()
-like methods for FastText, other models
difficulty easy
#3015
by gojomo
was closed Mar 22, 2022
Support multiple Issue described a new feature
performance
Issue related to performance (in HW meaning)
wishlist
Feature request
most_similar()
queries in one call
feature
#2987
opened Oct 20, 2020 by
gojomo
Improve/prune docs/tutorial of TranslationMatrix functionality
bug
Issue described a bug
documentation
Current issue related to documentation
testing
Issue related with testing (code, documentation, etc)
#2977
opened Oct 8, 2020 by
gojomo
Restore/improve/streamline hooks for controlling/reusing build_vocab() steps
#2975
opened Oct 6, 2020 by
gojomo
Adopting a (narrow) backward-compatibility standard; implications for 4.0.0
#2967
opened Sep 30, 2020 by
gojomo
Ensure 2Vec classes (KeyedVectors, Word2Vec, Doc2Vec, FastText) support desired level of mmap support
#2955
opened Sep 23, 2020 by
gojomo
calculation of downsampling .sample_int after vocab-updates looks wrong
#2951
opened Sep 16, 2020 by
gojomo
Add 32/64-bit reporting to issue template
housekeeping
internal tasks and processes
#2906
by gojomo
was closed Jul 29, 2020
Really remove the 10000-token limit in [Word2Vec, FastText, Doc2Vec]
difficulty hard
Hard issue: required deep gensim understanding & high python/cython skills
feature
Issue described a new feature
impact MEDIUM
Big annoyance for affected users
reach LOW
Affects only niche use-case users
#2880
opened Jul 12, 2020 by
gojomo
Proposal: Replace word2vec-specific implementation w/ constrained subclass of FastText
housekeeping
internal tasks and processes
#2879
opened Jul 12, 2020 by
gojomo
Consider: dropping custom SaveLoad in favor of Pickle v.5 or joblib.dump?
breaks backward-compatibility
Change breaks backward compatibility
difficulty hard
Hard issue: required deep gensim understanding & high python/cython skills
housekeeping
internal tasks and processes
#2848
opened May 26, 2020 by
gojomo
CHANGELOG.md vs PyPI?
bug
Issue described a bug
documentation
Current issue related to documentation
impact MEDIUM
Big annoyance for affected users
reach LOW
Affects only niche use-case users
#2828
by gojomo
was closed May 9, 2020
Sigmoid-table behavior in FastText, etc code is fishy
fasttext
Issues related to the FastText model
#2725
opened Jan 8, 2020 by
gojomo
Proposal: drop support for Python 3.5; add Python 3.8 to builds
#2713
by gojomo
was closed Jan 28, 2020
Doc2VecKeyedVectors doesn't effectively support __setitem__()/add()
bug
Issue described a bug
feature
Issue described a new feature
#2683
opened Nov 21, 2019 by
gojomo
Previous Next
ProTip!
Follow long discussions with comments:>50.