An experimental project designed for embedding-based text similarity search in web pages.
-
Updated
Mar 6, 2025 - JavaScript
An experimental project designed for embedding-based text similarity search in web pages.
This project extends the v2-0 Hybrid Model (statistical features + TTM embeddings) by adding a third modality: text embeddings derived from equipment master data. The three feature vectors — statistical x ∈ ℝ²⁸, TTM embedding y ∈ ℝ⁶⁴, and text embedding z ∈ ℝ¹⁰²⁴ — are concatenated into a 1,116-dimensional triplet feature h and fed into a LightGBM.
Add a description, image, and links to the multilingual-e5 topic page so that developers can more easily learn about it.
To associate your repository with the multilingual-e5 topic, visit your repo's landing page and select "manage topics."