Skip to content

A lightweight TF-IDF implementation in Java 8. Text indexing for a collection of source files.

Notifications You must be signed in to change notification settings

Meghdut-Mandal/TextIndexer

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

17 Commits
 
 
 
 
 
 
 
 

Repository files navigation

Text Indexer

A CLI app for indexing Text Files with tf-idf.

Requirements

  • Java 11.0.5+
  • Maven for building

Building

Make sure maven is installed by mvn -v. Then you can build the jar by

  1. mvn compile
  2. mvn package

Features

  • Cosine Similarity algorithm by Tf-IDF
  • Multi Threaded index creation and searching
  • Full file tree scanning.

About

A lightweight TF-IDF implementation in Java 8. Text indexing for a collection of source files.

Topics

Resources

Stars

Watchers

Forks

Packages

No packages published

Languages