Stars
- All languages
- Assembly
- C
- C#
- C++
- CSS
- Clojure
- CoffeeScript
- Cuda
- Dart
- Elixir
- Erlang
- F#
- Go
- HTML
- Java
- JavaScript
- Jinja
- Julia
- Jupyter Notebook
- Kotlin
- LLVM
- Lua
- MATLAB
- MDX
- Makefile
- OCaml
- Objective-C
- Objective-C++
- PHP
- PLpgSQL
- Perl
- PostScript
- Python
- Ruby
- Rust
- SCSS
- Scala
- Scheme
- Shell
- Svelte
- Swift
- SystemVerilog
- TeX
- Thrift
- TypeScript
- Vue
📄 A curated list of awesome .cursorrules files
A 4-hour coding workshop to understand how LLMs are implemented and used
High Performance ServiceMesh Data Plane Based on Programmable Kernel
Awesome multilingual OCR toolkits based on PaddlePaddle (practical ultra lightweight OCR system, support 80+ languages recognition, provide data annotation and synthesis tools, support training and…
Unified management of projects with large model APIs, unified conversion to OpenAI format, calling multiple backend services, OpenAI, Anthropic, Gemini, Vertex, Cloudflare, DeepBricks, OpenRouter, …
🤱🏻 Turn any webpage into a desktop app with Rust. 🤱🏻 利用 Rust 轻松构建轻量级多端桌面应用
[ECCV2024] Video Foundation Models & Data for Multimodal Understanding
AL-Ref-SAM 2: Unleashing the Temporal-Spatial Reasoning Capacity of GPT for Training-Free Audio and Language Referenced Video Object Segmentation
face detection face recognition包含人脸检测(retinaface,yolov5face,yolov7face,yolov8face),人脸检测跟踪(ByteTracker),人脸角度计算(Face_Angle)人脸矫正(Face_Aligner),人脸识别(Arcface),口罩检测(MaskRecognitiion),年龄性别检测(Gender_age),静…
The official code of "RWKV-CLIP: A Robust Vision-Language Representation Learner"
This is the official code of VideoAgent: A Memory-augmented Multimodal Agent for Video Understanding (ECCV 2024)
Minimal code and examnples for inferencing Sapiens foundation human models in Pytorch
High-resolution models for human tasks.
Qwen2-VL is the multimodal large language model series developed by Qwen team, Alibaba Cloud.
An open-source RAG-based tool for chatting with your documents.
Lightning-fast serving engine for AI models. Flexible. Easy. Enterprise-scale.
Find the best cursor rules for your framework and language
Clapper.app, a video synthesizer and sequencer designed for the age of AI cinema
🚀 基于大语言模型和 RAG 的知识库问答系统。开箱即用、模型中立、灵活编排,支持快速嵌入到第三方业务系统。
Fast and flexible image augmentation library. Paper about the library: https://www.mdpi.com/2078-2489/11/2/125
A toolkit for making real world machine learning and data analysis applications in C++
This is the official repository of "eDifFIQA: Towards Efficient Face Image Quality Assessment based on Denoising Diffusion Probabilistic Models" accepted in IEEE TBIOM (Transactions on Biometrics, …
Official code of "EVF-SAM: Early Vision-Language Fusion for Text-Prompted Segment Anything Model"
Make bilingual epub books Using AI translate