project uder progress, may not work as expected.
prediksi is a web app, hosted using django server. It provides general purpose online text/markdown editor with context aware word autocompletion.
most of the online and offline text/markdown text editors do not provide general english autocompletion and if they provide, those are not content aware i.e. autocompetions are not very much relevant. prediksi tries to solve this problem by using users text to find the context/ topic of text and suggesting autocompletions according to the topic. for example in an article about 'Squirrel' initiating with squ we are most likely to type 'Squirrel' not 'square'.
for analyzing the context this text edtor uses tfidf. to calculate tfidf, document frequecy is used i.e. document frequency of a word is number of documents in which that word is present., this document frequency is precalculated on 3600000 wikipedia articles and stored in a file named doc_freq_36lac.txt inside app/ folder. simmilar result can be produced by using app/doc_freq.ipynb file
to set up your own text editor,
git clone https://github.com/agusmakmun/django-markdown-editorgit clone https://github.com/sandeshpatel/prediksi.gitgit clone https://github.com/MarioVilas/googlesearch.gitmv prediksi/martor.js django-markdown-editor/static/martor/js/martor.jspip install django-markdown-editorpip install googlesearchpip install numpypip install djangocd prediksipython manage.py runserver