Skip to content
View 631068264's full-sized avatar
🤪
I may be slow to respond.
🤪
I may be slow to respond.

Block or report 631068264

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

先进编译实验室的个人主页

C++ 34 3 Updated Dec 27, 2024

Development repository for the Triton language and compiler

C++ 13,861 1,690 Updated Jan 1, 2025

A streamlined and customizable framework for efficient large model evaluation and performance benchmarking

Python 327 38 Updated Dec 31, 2024
Python 7,076 551 Updated Dec 20, 2024

Heterogeneous AI Computing Virtualization Middleware

Go 1,141 235 Updated Dec 31, 2024

AIFoundation 主要是指AI系统遇到大模型,从底层到上层如何系统级地支持大模型训练和推理,全栈的核心技术。

Python 523 69 Updated Dec 26, 2024

微信机器人底层框架,可接入Gemini、ChatGPT、ChatGLM、讯飞星火、Tigerbot等大模型。WeChat Robot Hook.

C++ 4,615 818 Updated Dec 30, 2024

This repository showcases various advanced techniques for Retrieval-Augmented Generation (RAG) systems. RAG systems combine information retrieval with generative models to provide accurate and cont…

Jupyter Notebook 9,841 1,020 Updated Dec 25, 2024

SGLang is a fast serving framework for large language models and vision language models.

Python 6,951 638 Updated Jan 1, 2025

OpenVINO™ is an open-source toolkit for optimizing and deploying AI inference

C++ 7,546 2,364 Updated Jan 1, 2025

The Memory layer for your AI apps

Python 23,669 2,189 Updated Dec 31, 2024

An easy-to-use LLMs quantization package with user-friendly apis, based on GPTQ algorithm.

Python 4,588 490 Updated Dec 15, 2024

A modular graph-based Retrieval-Augmented Generation (RAG) system

Python 21,216 2,076 Updated Dec 31, 2024

TensorRT Extension for Stable Diffusion Web UI

Python 1,930 150 Updated Jun 14, 2024

🤗 Diffusers: State-of-the-art diffusion models for image, video, and audio generation in PyTorch and FLAX.

Python 26,920 5,524 Updated Jan 1, 2025
C++ 299 30 Updated Dec 26, 2024

SuperSonic is the next-generation AI+BI platform that unifies Chat BI (powered by LLM) and Headless BI (powered by semantic layer) paradigms.

Java 2,575 446 Updated Dec 31, 2024

Unify Efficient Fine-tuning of RAG Retrieval, including Embedding, ColBERT, ReRanker.

Python 592 49 Updated Dec 29, 2024

Get up and running with Llama 3.3, Mistral, Gemma 2, and other large language models.

Go 105,150 8,410 Updated Jan 1, 2025

🔥🔥hooker是一个基于frida实现的逆向工具包。为逆向开发人员提供统一化的脚本包管理方式、通杀脚本、自动化生成hook脚本、内存漫游探测activity和service、firda版JustTrustMe、disable ssl pinning

JavaScript 3,816 951 Updated Dec 23, 2024

搜索引擎原理

1,544 129 Updated Apr 19, 2024

Set of tools to assess and improve LLM security.

Python 2,809 464 Updated Dec 20, 2024

The AI-native database built for LLM applications, providing incredibly fast hybrid search of dense vector, sparse vector, tensor (multi-vector), and full-text

C++ 2,873 278 Updated Dec 31, 2024

Towhee is a framework that is dedicated to making neural data processing pipelines simple and fast.

Python 3,271 254 Updated Oct 18, 2024

Open source libraries and APIs to build custom preprocessing pipelines for labeling, training, or production machine learning pipelines.

HTML 9,603 810 Updated Dec 27, 2024

Collecting awesome papers of RAG for AIGC. We propose a taxonomy of RAG foundations, enhancements, and applications in paper "Retrieval-Augmented Generation for AI-Generated Content: A Survey".

1,349 92 Updated Aug 20, 2024

This repository contains examples for customers to get started using the Amazon Bedrock Service. This contains examples for all available foundational models

Jupyter Notebook 667 319 Updated Dec 27, 2024

Improved file parsing for LLM’s

Python 2,603 100 Updated Nov 13, 2024
Jupyter Notebook 262 63 Updated Dec 24, 2024
Next