-
-
Notifications
You must be signed in to change notification settings - Fork 4.4k
Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
- Loading branch information
Showing
217 changed files
with
12,191 additions
and
6,878 deletions.
There are no files selected for viewing
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -1,63 +1,2 @@ | ||
.. gensim documentation master file, created by | ||
sphinx-quickstart on Tue Mar 16 19:45:41 2010. | ||
You can adapt this file completely to your liking, but it should at least | ||
contain the root `toctree` directive. | ||
|
||
Gensim -- Topic Modelling for Humans | ||
===================================================== | ||
|
||
|
||
.. raw:: html | ||
:file: _static/tagcloud.html | ||
|
||
|
||
.. raw:: html | ||
|
||
<table class="links" style="font-size: 200%;"> | ||
<tr> | ||
<td><a href='intro.html'>What is this?</a></td> | ||
<td><a href='install.html'>Installation</a></td> | ||
</tr> | ||
<tr> | ||
<td><a href='tutorial.html'>Get started!</a></td> | ||
<td><a href='about.html'>About</a></td> | ||
</tr> | ||
</table> | ||
|
||
|
||
Quick Reference Example | ||
------------------------ | ||
|
||
>>> from gensim import corpora, models, similarities | ||
>>> | ||
>>> # Load corpus iterator from a Matrix Market file on disk. | ||
>>> corpus = corpora.MmCorpus('/path/to/corpus.mm') | ||
>>> | ||
>>> # Initialize a transformation (Latent Semantic Indexing with 200 latent dimensions). | ||
>>> lsi = models.LsiModel(corpus, num_topics=200) | ||
>>> | ||
>>> # Convert another corpus to the latent space and index it. | ||
>>> index = similarities.MatrixSimilarity(lsi[another_corpus]) | ||
>>> | ||
>>> # determine similarity of a query document against each document in the index | ||
>>> sims = index[query] | ||
|
||
|
||
.. admonition:: What's new? | ||
|
||
* 15 Sep 2012: release 0.8.6 : added the `hashing trick <http://en.wikipedia.org/wiki/Hashing-Trick>`_ to allow online changes to the vocabulary; fixed parallel lemmatization + `other minor improvements <https://github.com/piskvorky/gensim/blob/develop/CHANGELOG.txt>`_ | ||
* 22 Jul 2012: release 0.8.5 : better Wikipedia parsing, faster similarity queries, maintenance fixes | ||
* 30 Apr 2012: William Bert's `interview with me <http://williamjohnbert.com/2012/04/interview-with-radim-rehurek-creator-of-gensim/>`_ | ||
* 9 Mar 2012: release 0.8.4: new model `Hierarchical Dirichlet Process <http://radimrehurek.com/gensim/models/hdpmodel.html>`_ (full `CHANGELOG <https://github.com/piskvorky/gensim/blob/develop/CHANGELOG.txt>`_) | ||
|
||
|
||
.. toctree:: | ||
:hidden: | ||
:maxdepth: 1 | ||
|
||
intro | ||
install | ||
tutorial | ||
distributed | ||
wiki | ||
apiref | ||
:file: _static/indexcontent.html |
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Oops, something went wrong.