![:dependabot: :dependabot:](https://github.githubassets.com/images/icons/emoji/dependabot.png)
-
UIT, VNU-HCM
- Ho Chi Minh City, VietNam
- in/lynguyenminh
- nguyenminhly.fb
- channel/UClRiDnX_OmJAT16VuFOoFZg
Stars
Qwen2.5 is the large language model series developed by Qwen team, Alibaba Cloud.
High-Resolution 3D Assets Generation with Large Scale Hunyuan3D Diffusion Models.
Neural building blocks for speaker diarization: speech activity detection, speaker change detection, overlapped speech detection, speaker embedding
Fair-code workflow automation platform with native AI capabilities. Combine visual building with custom code, self-host or cloud, 400+ integrations.
Dify is an open-source LLM app development platform. Dify's intuitive interface combines AI workflow, RAG pipeline, agent capabilities, model management, observability features and more, letting yo…
Official inference repo for FLUX.1 models
Real-time monitor and web admin for Celery distributed task queue
🔊 Text-Prompted Generative Audio Model
Faster Whisper transcription with CTranslate2
Document to Markdown OCR library with Llama 3.2 vision
Underthesea - Vietnamese NLP Toolkit
Silero VAD: pre-trained enterprise-grade Voice Activity Detector
AI powered speech denoising and enhancement
Question generation using state-of-the-art Natural Language Processing algorithms
This repository provides a comprehensive step-by-step guide to building AI projects using the Raspberry Pi AI Kit.
React app for inspecting, building and debugging with the Realtime API
Collection of awesome LLM apps with AI Agents and RAG using OpenAI, Anthropic, Gemini and opensource models.
Sample code and notebooks for Generative AI on Google Cloud, with Gemini on Vertex AI
Translate manga/image 一键翻译各类图片内文字 https://cotrans.touhou.ai/
An open-source RAG-based tool for chatting with your documents.
🎙️🤖Create, Customize and Talk to your AI Character/Companion in Realtime (All in One Codebase!). Have a natural seamless conversation with AI everywhere (mobile, web and terminal) using LLM OpenAI …
A super lightweight image processing algorithm for detection and extraction of overlapped handwritten signatures on scanned documents using OpenCV and scikit-image.
Gemini-based translation API that integrates with the "Immersive Translate", 基于 Gemini 的翻译 API,可与沉浸式翻译插件集成
🦙 LaMa Image Inpainting, Resolution-robust Large Mask Inpainting with Fourier Convolutions, WACV 2022
Supporting code from my related video
[ECCV2024] API code for T-Rex2: Towards Generic Object Detection via Text-Visual Prompt Synergy