Superlinked Inference Engine is an Open-source inference server and production cluster for embeddings, reranking, and extraction.
-
Updated
May 25, 2026 - Python
Superlinked Inference Engine is an Open-source inference server and production cluster for embeddings, reranking, and extraction.
SPLADE: sparse neural search (SIGIR21, SIGIR22)
Neural Search
An easy-to-use python toolkit for flexibly adapting various neural ranking models to target domain.
Fast search index for SPLADE sparse retrieval models implemented in Python using Numpy and Numba
Lite weight wrapper for the independent implementation of SPLADE++ models for search & retrieval pipelines. Models and Library created by Prithivi Da, For PRs and Collaboration checkout the readme.
Provides a minimal PyTorch implementation of SPLADE
Optimized RAG Retrieval with Indexing, Quantization, Hybrid Search and Caching
Local-first RAG system with hybrid (dense + BM25) and SPLADE retrieval, hierarchical conversational memory, and document-aware reasoning.
A production-grade Multi-Agent RAG (Retrieval-Augmented Generation) system designed for scalable, low-latency, and reliable AI-powered retrieval. Built with hybrid search, cross-encoder reranking, intelligent query decomposition, semantic caching, adaptive LLM routing, and ONNX-optimized inference using Qdrant, Groq, Gemini, and BGE embeddings.
Learning-to-Rank on MS MARCO Passages: candidate generation from prebuilt indexes and re-ranking for QA search
Training Data Generator for SPLADE Model Fine-tuning
Official code and camera-ready analyses for PFW Task 8 at SemEval-2026 Task 8 (MTRAGEval).
QLoRA fine-tuning pipeline: Llama 3.1 & Qwen 2.5 for structured extraction — SPLADE sparse embeddings, vLLM FP8 inference, end-to-end data pipeline
A comprehensive toolkit for training, evaluating, and deploying SPLADE models
Memsplora - An in-memory SPLADE (SParse Lexical AnD Expansion) content server with FAISS integration
RAG system for Claude Code — hybrid retrieval (semantic + SPLADE + RRF), cross-encoder reranking (Qwen3), pgvector, PDF/web indexing pipelines, subagent evaluation
Add a description, image, and links to the splade topic page so that developers can more easily learn about it.
To associate your repository with the splade topic, visit your repo's landing page and select "manage topics."