Implementation of a text clustering algorithm using Kmeans clustering in order to derive quick insights from unstructured text. Please check the below links for details -
- https://machinelearningblogs.com/2017/01/26/text-clustering-get-quick-insights-from-unstructured-data/
- https://machinelearningblogs.com/2017/06/23/text-clustering-get-quick-insights-unstructured-data-2/
- Install Docker
- Run
git clone https://github.com/vivekkalyanarangan30/Text-Clustering-API
- Open docker terminal and navigate to
/path/to/Text-Clustering-API
- Run
docker build -t clustering-api .
- Run
docker run -p 8180:8180 clustering-api
- Access http://192.168.99.100:8180/apidocs/index.html from your browser [assuming you are on windows and docker-machine has that IP. Otherwise just use localhost]
- Anaconda distribution of python 2.7
pip install -r requirements.txt
- Some dependencies from nltk (
nltk.download()
from python console and download averaged perceptron tagger)
- Place the script in any folder
- Open command prompt and navigate to that folder
- Type "python CLAAS.py"and hit enter
- Go over to http://localhost:8180/apidocs/index.html in your browser (preferably Chrome) and start using.