Tika-Similarity uses the Tika-Python package (Python port of Apache Tika) to compute file similarity based on Metadata features.
- 
            Updated
            Apr 9, 2025 
- Python
Tika-Similarity uses the Tika-Python package (Python port of Apache Tika) to compute file similarity based on Metadata features.
Samples on how to use Azure SQL database with Azure OpenAI
A Clojure library for querying large data-sets on similarity
String similarity functions, String distance's, Jaccard, Levenshtein, Hamming, Jaro-Winkler, Q-grams, N-grams, LCS - Longest Common Subsequence, Cosine similarity...
Spark functions to run popular phonetic and string matching algorithms
Implementation of TextRank with the option of using pre-trained Word2Vec embeddings as the similarity metric
Sentential Semantic Similarity measurement library using BERT Embeddings for spatial distance evaluation.
It is a replication of google image search engine for finding similar images in our database using artificial intelligence. It is a project which uses cosine distance or finding the similarity which is an amazing application of cosine similarity.
Developed a book recommendation system for Amazon customers using memory and model based collaborative filtering by utilizing the description of book consumed and user interests.
This repository contains a web application that integrates with a music recommendation system, which leverages a dataset of 3,415 audio files, each lasting thirty seconds, utilising a Locality-Sensitive Hashing (LSH) implementation to determine rhythmic similarity, as part of an assignment for the Fundamental of Big Data Analytics (DS2004) course.
In this repository, we have implemented the CNN based recommendation system for finding similar products.
Реализация система извлечения изображений по текстовому описанию и поиск похожих фотографий
Movie recommendation system based on popularity and also using KNN and Cosine similarity. 🎥🍿
An academic project to find the most similar image to the given input image, based on Image Processing, Cosine Similarity Model, StreamLit, written primarily in Python using Visual Studio Code and Jupyter Notebook
Given a directed social graph, have to predict missing links to recommend users.
Practical experiments on Machine Learning in Python. Processing of sentences and finding relevant ones, approximation of function with polynomials, function optimization
Locality sensitive hashing based plagiarism checker
String distances in rust
Computes Pairwise Semantic Distance Between Tokens (ngrams, words, turns) in Ordered and Unordered Text
Implementation of DTW algorithm between audio and midi files, plots of results and saving the path as JSON. A Sakoe-chiba band is the current optimization.
Add a description, image, and links to the cosine-distance topic page so that developers can more easily learn about it.
To associate your repository with the cosine-distance topic, visit your repo's landing page and select "manage topics."