Stars
Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.
An open source implementation of Microsoft's VALL-E X zero-shot TTS model. Demo is available in https://plachtaa.github.io/vallex/
EmotiVoice 😊: a Multi-Voice and Prompt-Controlled TTS Engine
Crawlee—A web scraping and browser automation library for Node.js to build reliable crawlers. In JavaScript and TypeScript. Extract data for AI, LLMs, RAG, or GPTs. Download HTML, PDF, JPG, PNG, an…
Official inference repo for FLUX.1 models
2021年最新总结,推荐工程师合适读本,计算机科学,软件技术,创业,思想类,数学类,人物传记书籍
Use PEFT or Full-parameter to finetune 400+ LLMs (Qwen2.5, Llama3.2, GLM4, Internlm2.5, Yi1.5, Mistral, Baichuan2, DeepSeek, ...) or 150+ MLLMs (Qwen2-VL, Qwen2-Audio, Llama3.2-Vision, Llava, Inter…
[CVPR 2024 Oral] InternVL Family: A Pioneering Open-Source Alternative to GPT-4o. 接近GPT-4o表现的开源多模态对话模型
Code for "GVHMR: World-Grounded Human Motion Recovery via Gravity-View Coordinates", Siggraph Asia 2024
g1: Using Llama-3.1 70b on Groq to create o1-like reasoning chains
Dify is an open-source LLM app development platform. Dify's intuitive interface combines AI workflow, RAG pipeline, agent capabilities, model management, observability features and more, letting yo…
Perplexica is an AI-powered search engine. It is an Open source alternative to Perplexity AI
🔊 Text-Prompted Generative Audio Model
Mobile-Agent: The Powerful Mobile Device Operation Assistant Family
Foundational Models for State-of-the-Art Speech and Text Translation
The new Windows Terminal and the original Windows console host, all in the same place!
A series of large language models trained from scratch by developers @01-ai
A multi-voice TTS system trained with an emphasis on quality
中文LLaMA-2 & Alpaca-2大模型二期项目 + 64K超长上下文模型 (Chinese LLaMA-2 & Alpaca-2 LLMs with 64K long context models)
Llama2开源模型中文版-全方位测评,基于SuperCLUE的OPEN基准 | Llama2 Chinese evaluation with SuperCLUE
Platform to experiment with the AI Software Engineer. Terminal based. NOTE: Very different from https://gptengineer.app
The official repo of Qwen (通义千问) chat & pretrained large language model proposed by Alibaba Cloud.
Transformer related optimization, including BERT, GPT
Unified Efficient Fine-Tuning of 100+ LLMs (ACL 2024)