- Hangzhou, China
-
18:19
(UTC +08:00) - https://www.huiyadan.com
๐ค AI
High-performance GPGPU inference of OpenAI's Whisper automatic speech recognition (ASR) model
[CVPR 2023] SadTalker๏ผLearning Realistic 3D Motion Coefficients for Stylized Audio-Driven Single Image Talking Face Animation
DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.
ChatGPT ็ฝ็ซๅฏผ่ชใๆ้ๅฝๅ ๅฏ็จ็ ChatGPT ๅจ็บฟไฝ้ชๅ ่ดน็ฝ็ซๅ่กจใๅฎๆถไปปๅกๆฏๆฅๆดๆฐ
Official Tensorflow implementation of "M-LSD: Towards Light-weight and Real-time Line Segment Detection" (AAAI 2022 Oral)
[CVPR 2024] Real-Time Open-Vocabulary Object Detection
The repository provides code for running inference with the SegmentAnything Model (SAM), links for downloading the trained model checkpoints, and example notebooks that show how to use the model.
Understand Human Behavior to Align True Needs
[ECCV2024] API code for T-Rex2: Towards Generic Object Detection via Text-Visual Prompt Synergy
๐ฅ A code review bot powered by ChatGPT
MiniCPM-V 2.6: A GPT-4V Level MLLM for Single Image, Multi Image and Video on Your Phone