Skip to content

Latest commit

 

History

History
118 lines (96 loc) · 2.09 KB

NEWS.md

File metadata and controls

118 lines (96 loc) · 2.09 KB

StringAnalysis Release Notes

v0.4.2

  • Bugfixes

v0.4.1

  • New snowball stemmer

v0.4.0

  • A vector of strings is equivalent to a corpus for DTM, inverse index, lexicon

v0.3.9

  • Lexicon, inverse index creation functions
  • ngram complexity specification support
  • Sparse/Frequent terms stripping fixes

v0.3.8

  • Tokenization fixes
  • Stemmer fixes

v0.3.7

  • Bugfix release

v0.3.6

  • Bugfix release

v0.3.5

  • Improved LSA embedding performance
  • AbstractMetadata support

v0.3.4

  • All forms of DTVs are sparse
  • DTMs, COOMs are immutable

v0.3.3

  • Performance improvementss

v0.3.2

  • DTM document vectors are columns
  • Tokenizer can be specified in some methods
  • Regex based DTV's
  • Additional documentation
  • svd fallback in LSA
  • COOM performance improvement
  • Bugfixes

v0.3.1

  • Added :count option to LSA, RP models
  • No projection hack for RP models
  • More documentation

v0.3.0

  • Added Co-occurrence matrix
  • Refined LSA, RP models
  • More embedding methods
  • Small bugfixes, improvements

v0.2.4

  • Added sparse random projections
  • Bugfixes

v0.2.3

  • Preprocessing improvements
  • Additional documentation

v0.2.2

  • LSA models can be saved/loaded
  • Small additions

v0.2.1

  • Improved LSA
  • Expanded online documentation

v0.2.0

  • Improved latent semantic analysis (LSA)
  • Online documentation with Documenter.jl

v0.1.1

  • Typing improvements
  • Added support for Vector element type in DTV iteration
  • Made AbstractDocument a parametric type
  • Extended test coverage
  • Bugfixes

v0.1.0

  • Many fixed bugs and inconsistencies
  • Added bm25 ranking, tweaked tf-idf
  • Extended tokenization and stemming methods
  • Extended pre-processing API
  • Extended document metadata
  • Extended test coverage
  • Simplified API i.e. removed sentiment analysis, lots of deps

v0.0.0

  • Inital version, very similar to TextAnalysis, commit:8517fe2
  • Not released