Stars
We Speech Transcript based on LLM, in 300 lines of code.
Port of Funasr's Sense-voice model in C/C++
Jitsi Meet - Secure, Simple and Scalable Video Conferences that you use as a standalone app or embed in your web application.
Tracking the progress in end-to-end speech translation
Foundational Models for State-of-the-Art Speech and Text Translation
Code for ACL 2023 main conference paper "Understanding and Bridging the Modality Gap for Speech Translation".
Repository containing the open source code of works published at the FBK MT unit.
🚀「Douyin_TikTok_Download_API」是一个开箱即用的高性能异步抖音、快手、TikTok、Bilibili数据爬取工具,支持API调用,在线批量解析及下载。
Facebook AI Research Sequence-to-Sequence Toolkit written in Python.
ScrollViewProxy for SwiftUI on iOS 13 and up
PhoNLP: A BERT-based multi-task learning model for part-of-speech tagging, named entity recognition and dependency parsing (NAACL 2021)