
- Taipei, Taiwan
-
18:26
(UTC +08:00)
Highlights
- Pro
Lists (2)
Sort Name ascending (A-Z)
Stars
A lightweight, low-dependency, unified API to use all common reranking and cross-encoder models.
This repository contains LLM (Large language model) interview question asked in top companies like Google, Nvidia , Meta , Microsoft & fortune 500 companies.
A robust, efficient, low-latency speech-to-text library with advanced voice activity detection, wake word activation and instant transcription.
Python tool for converting files and office documents to Markdown.
lovefirst02 / tix_bot
Forked from Gilg4mesh/tixcraft_botMax搶票機器人(maxbot) help you quickly buy your tickets
A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Recognition, Voice Activity Detection, Text Post-processing etc.
Enforce the output format (JSON Schema, Regex etc) of a language model
Convert any URL to an LLM-friendly input with a simple prefix https://r.jina.ai/
AI chat and search for text, news, images and videos using the DuckDuckGo.com search engine.
Instruct-tune LLaMA on consumer hardware
Dify is an open-source LLM app development platform. Dify's intuitive interface combines AI workflow, RAG pipeline, agent capabilities, model management, observability features and more, letting yo…
A developer reference project for creating Retrieval Augmented Generation (RAG) chatbots on Windows using TensorRT-LLM
Tuning and Evaluation of RAG pipeline. (Automated optimization to be added soon)
Use ArXiv ChatGuru to talk to research papers. This app uses LangChain, OpenAI, Streamlit, and Redis as a vector database/semantic cache.
LlamaIndex is the leading framework for building LLM-powered agents over your data.
PyTorch implementation of VALL-E(Zero-Shot Text-To-Speech), Reproduced Demo https://lifeiteng.github.io/valle/index.html
An open source implementation of Microsoft's VALL-E X zero-shot TTS model. Demo is available in https://plachtaa.github.io/vallex/
Chat with your documents on your local device using GPT models. No data leaves your device and 100% private.
SoftVC VITS Singing Voice Conversion
Core Engine of Singing Voice Conversion & Singing Voice Clone
Best practice TTS based on BERT and VITS with some Natural Speech Features Of Microsoft; Support ONNX streaming out!
PITS: Variational Pitch Inference for End-to-end Pitch-controllable TTS without External Pitch Predictor
XPhoneBERT: A Pre-trained Multilingual Model for Phoneme Representations for Text-to-Speech (INTERSPEECH 2023)
Official Code for DragGAN (SIGGRAPH 2023)
This repo is a pipeline of VITS finetuning for fast speaker adaptation TTS, and many-to-many voice conversion
Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities