Skip to content
View 631068264's full-sized avatar
🤪
I may be slow to respond.
🤪
I may be slow to respond.

Block or report 631068264

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

先进编译实验室的个人主页

C++ 34 3 Updated Dec 27, 2024

Development repository for the Triton language and compiler

C++ 13,887 1,693 Updated Jan 3, 2025

A streamlined and customizable framework for efficient large model evaluation and performance benchmarking

Python 330 39 Updated Jan 3, 2025
Python 7,085 552 Updated Dec 20, 2024

Heterogeneous AI Computing Virtualization Middleware

Go 1,156 237 Updated Jan 3, 2025

AIFoundation 主要是指AI系统遇到大模型,从底层到上层如何系统级地支持大模型训练和推理,全栈的核心技术。

Python 543 75 Updated Dec 26, 2024

微信机器人底层框架,可接入Gemini、ChatGPT、ChatGLM、讯飞星火、Tigerbot等大模型。WeChat Robot Hook.

C++ 4,640 819 Updated Dec 30, 2024

This repository showcases various advanced techniques for Retrieval-Augmented Generation (RAG) systems. RAG systems combine information retrieval with generative models to provide accurate and cont…

Jupyter Notebook 9,916 1,031 Updated Jan 2, 2025

SGLang is a fast serving framework for large language models and vision language models.

Python 7,032 647 Updated Jan 3, 2025

OpenVINO™ is an open-source toolkit for optimizing and deploying AI inference

C++ 7,562 2,365 Updated Jan 3, 2025

The Memory layer for your AI apps

Python 23,717 2,195 Updated Jan 3, 2025

An easy-to-use LLMs quantization package with user-friendly apis, based on GPTQ algorithm.

Python 4,597 491 Updated Dec 15, 2024

A modular graph-based Retrieval-Augmented Generation (RAG) system

Python 21,318 2,088 Updated Jan 3, 2025

TensorRT Extension for Stable Diffusion Web UI

Python 1,934 150 Updated Jun 14, 2024

🤗 Diffusers: State-of-the-art diffusion models for image, video, and audio generation in PyTorch and FLAX.

Python 26,955 5,534 Updated Jan 4, 2025
C++ 301 30 Updated Dec 26, 2024

SuperSonic is the next-generation AI+BI platform that unifies Chat BI (powered by LLM) and Headless BI (powered by semantic layer) paradigms.

Java 2,587 451 Updated Jan 3, 2025

Unify Efficient Fine-tuning of RAG Retrieval, including Embedding, ColBERT, ReRanker.

Python 607 49 Updated Dec 29, 2024

Get up and running with Llama 3.3, Mistral, Gemma 2, and other large language models.

Go 105,654 8,445 Updated Jan 4, 2025

🔥🔥hooker是一个基于frida实现的逆向工具包。为逆向开发人员提供统一化的脚本包管理方式、通杀脚本、自动化生成hook脚本、内存漫游探测activity和service、firda版JustTrustMe、disable ssl pinning

JavaScript 3,817 950 Updated Dec 23, 2024

搜索引擎原理

1,545 130 Updated Apr 19, 2024

Set of tools to assess and improve LLM security.

Python 2,813 464 Updated Dec 20, 2024

The AI-native database built for LLM applications, providing incredibly fast hybrid search of dense vector, sparse vector, tensor (multi-vector), and full-text

C++ 2,881 281 Updated Jan 3, 2025

Towhee is a framework that is dedicated to making neural data processing pipelines simple and fast.

Python 3,271 254 Updated Oct 18, 2024

Open source libraries and APIs to build custom preprocessing pipelines for labeling, training, or production machine learning pipelines.

HTML 9,631 809 Updated Jan 4, 2025

Collecting awesome papers of RAG for AIGC. We propose a taxonomy of RAG foundations, enhancements, and applications in paper "Retrieval-Augmented Generation for AI-Generated Content: A Survey".

1,358 92 Updated Aug 20, 2024

This repository contains examples for customers to get started using the Amazon Bedrock Service. This contains examples for all available foundational models

Jupyter Notebook 669 322 Updated Dec 27, 2024

Improved file parsing for LLM’s

Python 2,611 100 Updated Nov 13, 2024
Jupyter Notebook 262 63 Updated Dec 24, 2024
Next