-
-
Notifications
You must be signed in to change notification settings - Fork 4.4k
Issues: piskvorky/gensim
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Author
Label
Projects
Milestones
Assignee
Sort
Issues list
Investigate Bolt to speed up Gensim and save memory
difficulty medium
Medium issue: required good gensim understanding & python skills
performance
Issue related to performance (in HW meaning)
reach HIGH
Affects most or all Gensim users
wishlist
Feature request
#3356
opened Jun 18, 2022 by
piskvorky
FastText models Issue described a bug
performance
Issue related to performance (in HW meaning)
.save()
d from 4.0+ slower to load; gain less benefit from mmap
bug
#3192
opened Jul 13, 2021 by
gojomo
Using corpus_file does not speed up while the CPU utilization seems full.
performance
Issue related to performance (in HW meaning)
#3089
opened Mar 25, 2021 by
Donghyun-Kang-Soc
SECURITY: bad regex pattern in 'gensim/corpora/wikicorpus.py' maybe cause 'ReDos' security problem.
difficulty easy
Easy issue: required small fix
impact LOW
Low impact on affected users
performance
Issue related to performance (in HW meaning)
reach LOW
Affects only niche use-case users
#3029
opened Jan 19, 2021 by
leveryd
Support multiple Issue described a new feature
performance
Issue related to performance (in HW meaning)
wishlist
Feature request
most_similar()
queries in one call
feature
#2987
opened Oct 20, 2020 by
gojomo
random.RandomState with different versions of numpy has vastly different performance
help wanted
impact MEDIUM
Big annoyance for affected users
performance
Issue related to performance (in HW meaning)
reach HIGH
Affects most or all Gensim users
#2782
opened Apr 3, 2020 by
alexcarterkarsus
Allow file-based *2vec training from compressed files
difficulty medium
Medium issue: required good gensim understanding & python skills
performance
Issue related to performance (in HW meaning)
wishlist
Feature request
#2159
opened Aug 23, 2018 by
piskvorky
AuthorTopicModel memory issue
bug
Issue described a bug
difficulty medium
Medium issue: required good gensim understanding & python skills
performance
Issue related to performance (in HW meaning)
#1947
opened Mar 2, 2018 by
menshikh-iv
Optimize sparse * random dense matrix multiply in LsiModel
difficulty medium
Medium issue: required good gensim understanding & python skills
feature
Issue described a new feature
performance
Issue related to performance (in HW meaning)
#1888
opened Feb 8, 2018 by
ghost
9 of 13 tasks
Use Bounter for approx frequency counting
difficulty medium
Medium issue: required good gensim understanding & python skills
feature
Issue described a new feature
performance
Issue related to performance (in HW meaning)
#1654
opened Oct 25, 2017 by
piskvorky
lda multicore not scaling into large number of cores
bug
Issue described a bug
difficulty hard
Hard issue: required deep gensim understanding & high python/cython skills
performance
Issue related to performance (in HW meaning)
#1592
opened Sep 18, 2017 by
abhishekbuyt
LDA multicore stuck after a few passes
bug
Issue described a bug
difficulty hard
Hard issue: required deep gensim understanding & high python/cython skills
performance
Issue related to performance (in HW meaning)
#1588
opened Sep 16, 2017 by
abhishekbuyt
Check what's the reason to use double-precision in topic models
bug
Issue described a bug
difficulty easy
Easy issue: required small fix
good first issue
Issue for new contributors (not required gensim understanding + very simple)
Hacktoberfest
Issues marked for hacktoberfest
performance
Issue related to performance (in HW meaning)
#1576
opened Sep 8, 2017 by
menshikh-iv
Improving computational time in LDASeqModel
difficulty medium
Medium issue: required good gensim understanding & python skills
feature
Issue described a new feature
good first issue
Issue for new contributors (not required gensim understanding + very simple)
Hacktoberfest
Issues marked for hacktoberfest
performance
Issue related to performance (in HW meaning)
#1545
opened Aug 22, 2017 by
Diego999
Virtual Core Distribution Issue Error When Importing/Assigning Very Large Word2Vec Encodings
difficulty medium
Medium issue: required good gensim understanding & python skills
feature
Issue described a new feature
performance
Issue related to performance (in HW meaning)
#1486
opened Jul 17, 2017 by
jkterry1
Word Mover's Distance slow for fasttext
difficulty easy
Easy issue: required small fix
feature
Issue described a new feature
performance
Issue related to performance (in HW meaning)
#1292
opened Apr 26, 2017 by
arashsa
Enabling multiprocessing in the author-topic model
difficulty hard
Hard issue: required deep gensim understanding & high python/cython skills
feature
Issue described a new feature
performance
Issue related to performance (in HW meaning)
#1169
opened Feb 24, 2017 by
olavurmortensen
Enable distributed computation in the author-topic model
difficulty hard
Hard issue: required deep gensim understanding & high python/cython skills
feature
Issue described a new feature
performance
Issue related to performance (in HW meaning)
#1168
opened Feb 24, 2017 by
olavurmortensen
Scaling up Doc2Vec using Spark
difficulty hard
Hard issue: required deep gensim understanding & high python/cython skills
feature
Issue described a new feature
performance
Issue related to performance (in HW meaning)
#713
opened May 30, 2016 by
ankit-cliqz
word2vec vocab building is not multi-threaded
difficulty medium
Medium issue: required good gensim understanding & python skills
performance
Issue related to performance (in HW meaning)
#400
opened Jul 14, 2015 by
khogeland
ProTip!
Exclude everything labeled
bug
with -label:bug.