Skip to content

Issues: piskvorky/gensim

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Assignee
Filter by who’s assigned
Sort

Issues list

Investigate Bolt to speed up Gensim and save memory difficulty medium Medium issue: required good gensim understanding & python skills performance Issue related to performance (in HW meaning) reach HIGH Affects most or all Gensim users wishlist Feature request
#3356 opened Jun 18, 2022 by piskvorky
FastText models .save()d from 4.0+ slower to load; gain less benefit from mmap bug Issue described a bug performance Issue related to performance (in HW meaning)
#3192 opened Jul 13, 2021 by gojomo
Using corpus_file does not speed up while the CPU utilization seems full. performance Issue related to performance (in HW meaning)
#3089 opened Mar 25, 2021 by Donghyun-Kang-Soc
SECURITY: bad regex pattern in 'gensim/corpora/wikicorpus.py' maybe cause 'ReDos' security problem. difficulty easy Easy issue: required small fix impact LOW Low impact on affected users performance Issue related to performance (in HW meaning) reach LOW Affects only niche use-case users
#3029 opened Jan 19, 2021 by leveryd
Support multiple most_similar() queries in one call feature Issue described a new feature performance Issue related to performance (in HW meaning) wishlist Feature request
#2987 opened Oct 20, 2020 by gojomo
random.RandomState with different versions of numpy has vastly different performance help wanted impact MEDIUM Big annoyance for affected users performance Issue related to performance (in HW meaning) reach HIGH Affects most or all Gensim users
#2782 opened Apr 3, 2020 by alexcarterkarsus
Allow file-based *2vec training from compressed files difficulty medium Medium issue: required good gensim understanding & python skills performance Issue related to performance (in HW meaning) wishlist Feature request
#2159 opened Aug 23, 2018 by piskvorky
AuthorTopicModel memory issue bug Issue described a bug difficulty medium Medium issue: required good gensim understanding & python skills performance Issue related to performance (in HW meaning)
#1947 opened Mar 2, 2018 by menshikh-iv
Optimize sparse * random dense matrix multiply in LsiModel difficulty medium Medium issue: required good gensim understanding & python skills feature Issue described a new feature performance Issue related to performance (in HW meaning)
#1888 opened Feb 8, 2018 by ghost
9 of 13 tasks
Use Bounter for approx frequency counting difficulty medium Medium issue: required good gensim understanding & python skills feature Issue described a new feature performance Issue related to performance (in HW meaning)
#1654 opened Oct 25, 2017 by piskvorky
lda multicore not scaling into large number of cores bug Issue described a bug difficulty hard Hard issue: required deep gensim understanding & high python/cython skills performance Issue related to performance (in HW meaning)
#1592 opened Sep 18, 2017 by abhishekbuyt
LDA multicore stuck after a few passes bug Issue described a bug difficulty hard Hard issue: required deep gensim understanding & high python/cython skills performance Issue related to performance (in HW meaning)
#1588 opened Sep 16, 2017 by abhishekbuyt
Check what's the reason to use double-precision in topic models bug Issue described a bug difficulty easy Easy issue: required small fix good first issue Issue for new contributors (not required gensim understanding + very simple) Hacktoberfest Issues marked for hacktoberfest performance Issue related to performance (in HW meaning)
#1576 opened Sep 8, 2017 by menshikh-iv
Improving computational time in LDASeqModel difficulty medium Medium issue: required good gensim understanding & python skills feature Issue described a new feature good first issue Issue for new contributors (not required gensim understanding + very simple) Hacktoberfest Issues marked for hacktoberfest performance Issue related to performance (in HW meaning)
#1545 opened Aug 22, 2017 by Diego999
Virtual Core Distribution Issue Error When Importing/Assigning Very Large Word2Vec Encodings difficulty medium Medium issue: required good gensim understanding & python skills feature Issue described a new feature performance Issue related to performance (in HW meaning)
#1486 opened Jul 17, 2017 by jkterry1
Word Mover's Distance slow for fasttext difficulty easy Easy issue: required small fix feature Issue described a new feature performance Issue related to performance (in HW meaning)
#1292 opened Apr 26, 2017 by arashsa
Enabling multiprocessing in the author-topic model difficulty hard Hard issue: required deep gensim understanding & high python/cython skills feature Issue described a new feature performance Issue related to performance (in HW meaning)
#1169 opened Feb 24, 2017 by olavurmortensen
Enable distributed computation in the author-topic model difficulty hard Hard issue: required deep gensim understanding & high python/cython skills feature Issue described a new feature performance Issue related to performance (in HW meaning)
#1168 opened Feb 24, 2017 by olavurmortensen
Scaling up Doc2Vec using Spark difficulty hard Hard issue: required deep gensim understanding & high python/cython skills feature Issue described a new feature performance Issue related to performance (in HW meaning)
#713 opened May 30, 2016 by ankit-cliqz
word2vec vocab building is not multi-threaded difficulty medium Medium issue: required good gensim understanding & python skills performance Issue related to performance (in HW meaning)
#400 opened Jul 14, 2015 by khogeland
ProTip! Exclude everything labeled bug with -label:bug.