- Chile
-
22:47
(UTC -03:00) - https://medium.com/@prudant
Stars
Make websites accessible for AI agents
"LightRAG: Simple and Fast Retrieval-Augmented Generation"
Positron, a next-generation data science IDE
Use Microsoft Edge's online text-to-speech service from Python WITHOUT needing Microsoft Edge or Windows or an API key
📄 Awesome OCR multiple programing languages toolkits based on ONNXRuntime, OpenVINO and PaddlePaddle.
WPP_Whatsapp aim of exporting functions from WhatsApp Web to the python, which can be used to support the creation of any interaction, such as customer service, media sending, intelligence recognit…
Juego de Tic Tac Toe en la Terminal - V2 (Imbatible, usando Minimax)
Turbocharge your user intent classification NLU pipeline with efficient machine learning and fast embeddings for spanish language (no GPU required)
Send WhatsApp messages from the command-line 📯
This repository contains a Python implementation that allows you to use gorilla-llm/gorilla-openfunctions-v2 LLM to perform function calling using the OpenAI protocol. It provides a way to extend t…
Large-scale LLM inference engine
An OAI compatible exllamav2 API that's both lightweight and fast
Infinity is a high-throughput, low-latency serving engine for text-embeddings, reranking models, clip, clap and colpali
A high-throughput and memory-efficient inference and serving engine for LLMs
Large Language Model Text Generation Inference
A guidance language for controlling large language models.
This code sets up a simple yet robust server using FastAPI for handling asynchronous requests for embedding generation and reranking tasks using the BAAI M3 multilingual model.
A fast inference library for running LLMs locally on modern consumer-class GPUs
🤗 A specialized library for integrating context-free grammars (CFG) in EBNF with the Hugging Face Transformers
Simple and efficient pytorch-native transformer text generation in <1000 LOC of python.