Robust Speech Recognition via Large-Scale Weak Supervision
A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Recognition, Voice Activity Detection, Text Post-processing etc.
FastGPT is a knowledge-based platform built on the LLMs, offers a comprehensive suite of out-of-the-box capabilities such as data processing, RAG retrieval, and visual AI workflow orchestration, le…
Chat with any character you like: ChatGLM2+SadTalker+Voice Cloning | 和喜欢的角色沉浸式对话吧:ChatGLM2+声音克隆+视频对话
InsightFace REST API for easy deployment of face recognition services with TensorRT in Docker.
ESP32S2 native USB library. Implemented few common classes, like MIDI, CDC, HID or DFU (update).
🔥🔥🔥🔥 (Earlier YOLOv7 not official one) YOLO with Transformers and Instance Segmentation, with TensorRT acceleration! 🔥🔥🔥
Implementation of paper - YOLOv7: Trainable bag-of-freebies sets new state-of-the-art for real-time object detectors
fanhuafeng / YOLOv7
Forked from DataXujing/YOLOv7🔥🔥🔥 Official YOLOv7训练自己的数据集并实现端到端的TensorRT模型加速推断
Arduino Shiled for running Gimbal BLDC motors with FOC algorithm
Clone a voice in 5 seconds to generate arbitrary speech in real-time
Awesome multilingual OCR toolkits based on PaddlePaddle (practical ultra lightweight OCR system, support 80+ languages recognition, provide data annotation and synthesis tools, support training and…
YOLO5Face: Why Reinventing a Face Detector ( ECCV Workshops 2022)
fanhuafeng / face_detection_and_recognition_yolov5
Forked from duckzhao/face_detection_and_recognition_yolov5使用yolov5构建人脸检测模型,使用预训练的Arcface完成人脸特征提取和识别
yolov5 + deepsort实现了行人计数功能, 统计摄像头内出现过的总人数,以及对穿越自定义黄线行人计数效果如下
This project uses a variety of advanced voiceprint recognition models such as EcapaTdnn, ResNetSE, ERes2Net, CAM++, etc. It is not excluded that more models will be supported in the future. At the …