- To simplify comparison, Porter's Stemmer is used.
- For the comparison itself, Cosine Similarity is used.
- The user id, the first and second text, and the status are added to the database. The texts are deleted immediately after calculating the cosine distance. DBMS - Postgres.
- The bot is deployed on free hosting Heroku, so there may be some delays in responses.
- You can test the bot here @maxon_verygood_first_bot.
P.S. The comparison of texts is carried out by words, therefore the two texts "Hi" and "Hello" will be completely different. The best work results are shown by comparing texts in Russian.