Skip to content
View silverriver's full-sized avatar
🎯
Focusing
🎯
Focusing

Block or report silverriver

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Bring portraits to life!

Python 13,708 1,466 Updated Jan 1, 2025

Next Token Prediction Towards Multimodal Intelligence: A Comprehensive Survey

318 9 Updated Jan 17, 2025

OmniCorpus: A Unified Multimodal Corpus of 10 Billion-Level Images Interleaved with Text

Python 305 7 Updated Nov 17, 2024
Python 125 5 Updated Dec 17, 2024
MATLAB 2 1 Updated Dec 20, 2017

南开大学软件学院编译原理作业:简单C语言编译器

C++ 44 6 Updated Dec 23, 2020

Dify is an open-source LLM app development platform. Dify's intuitive interface combines AI workflow, RAG pipeline, agent capabilities, model management, observability features and more, letting yo…

TypeScript 59,762 8,878 Updated Jan 23, 2025

A suite of image and video neural tokenizers

Jupyter Notebook 1,511 65 Updated Jan 19, 2025

A PyTorch native library for large model training

Python 3,167 252 Updated Jan 23, 2025

GLM-4-Voice | 端到端中英语音对话模型

Python 2,590 210 Updated Dec 5, 2024

Python & Command-line tool to gather text and metadata on the Web: Crawling, scraping, extraction, output as CSV, JSON, HTML, MD, TXT, XML

Python 3,853 272 Updated Dec 28, 2024

Freeing data processing from scripting madness by providing a set of platform-agnostic customizable pipeline processing blocks.

Python 2,163 164 Updated Jan 22, 2025

open-source multimodal large language model that can hear, talk while thinking. Featuring real-time end-to-end speech input and streaming audio output conversational capabilities.

Python 3,098 270 Updated Nov 5, 2024
Python 7,184 564 Updated Jan 14, 2025

The most powerful and modular diffusion model GUI, api and backend with a graph/nodes interface.

Python 64,435 6,903 Updated Jan 23, 2025

Vector (and Scalar) Quantization, in Pytorch

Python 2,852 231 Updated Jan 10, 2025

LLaMA-Omni is a low-latency and high-quality end-to-end speech interaction model built upon Llama-3.1-8B-Instruct, aiming to achieve speech capabilities at the GPT-4o level.

Python 2,765 186 Updated Nov 14, 2024

Official Implementation of EAGLE-1 (ICML'24) and EAGLE-2 (EMNLP'24)

Python 925 96 Updated Jan 2, 2025

A series of math-specific large language models of our Qwen2 series.

Python 717 74 Updated Jan 11, 2025

Ongoing research training transformer models at scale

Python 11,165 2,493 Updated Jan 22, 2025

[ACL 2024 (Oral)] A Prospector of Long-Dependency Data for Large Language Models

Python 53 Updated Jul 23, 2024

A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)

Python 12,943 2,641 Updated Jan 23, 2025

A feature-rich command-line audio/video downloader

Python 97,573 7,644 Updated Jan 23, 2025

GLM-4 series: Open Multilingual Multimodal Chat LMs | 开源多语言多模态对话模型

Python 5,782 484 Updated Jan 17, 2025

Official repository for paper "Weak-to-Strong Extrapolation Expedites Alignment"

Python 71 2 Updated Jun 7, 2024

The official Meta Llama 3 GitHub site

Python 28,023 3,214 Updated Aug 12, 2024

OpenAI compatible API for TensorRT LLM triton backend

Rust 187 27 Updated Aug 1, 2024

LLM training in simple, raw C/CUDA

Cuda 25,110 2,868 Updated Oct 2, 2024

AutoAWQ implements the AWQ algorithm for 4-bit quantization with a 2x speedup during inference. Documentation:

Python 1,901 234 Updated Jan 20, 2025
Next