Skip to content
View MaoXianXin's full-sized avatar

Block or report MaoXianXin

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Make websites accessible for AI agents

Python 8,919 646 Updated Dec 28, 2024

"LightRAG: Simple and Fast Retrieval-Augmented Generation"

Python 12,489 1,734 Updated Jan 2, 2025

Qwen2-VL is the multimodal large language model series developed by Qwen team, Alibaba Cloud.

Python 4,047 244 Updated Dec 4, 2024

YOLO v5 Object Detection on Triton Inference Server

Python 15 5 Updated Mar 30, 2023

Building AI agents, atomically

Python 1,659 115 Updated Dec 31, 2024

A lightweight library for generating synthetic instruction tuning datasets for your data without GPT.

Python 724 46 Updated Sep 12, 2024

Python scraper based on AI

Python 16,921 1,406 Updated Jan 3, 2025

Perplexica is an AI-powered search engine. It is an Open source alternative to Perplexity AI

TypeScript 17,887 1,687 Updated Dec 26, 2024

fabric is an open-source framework for augmenting humans using AI. It provides a modular framework for solving specific problems using a crowdsourced set of AI prompts that can be used anywhere.

Go 26,363 2,802 Updated Jan 3, 2025

TensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and build TensorRT engines that contain state-of-the-art optimizations to perform inference efficie…

C++ 9,066 1,044 Updated Jan 3, 2025

Large Language Model Text Generation Inference

Python 9,554 1,110 Updated Jan 3, 2025

LMDeploy is a toolkit for compressing, deploying, and serving LLMs.

Python 5,063 453 Updated Jan 3, 2025

A modular graph-based Retrieval-Augmented Generation (RAG) system

Python 21,314 2,088 Updated Jan 3, 2025

structured outputs for llms

Python 8,754 691 Updated Jan 2, 2025

A curated list of awesome synthetic data for text location and recognition

330 63 Updated Jun 16, 2021

Code for generating synthetic text images as described in "Synthetic Data for Text Localisation in Natural Images", Ankush Gupta, Andrea Vedaldi, Andrew Zisserman, CVPR 2016.

Python 2,044 623 Updated Aug 9, 2023

A synthetic data generator for text recognition

Python 3,356 988 Updated Jul 18, 2024

Official Implementation of OCR-free Document Understanding Transformer (Donut) and Synthetic Document Generator (SynthDoG), ECCV 2022

Python 5,940 479 Updated Jul 11, 2024

Freeing data processing from scripting madness by providing a set of platform-agnostic customizable pipeline processing blocks.

Python 2,132 158 Updated Jan 2, 2025

experiments with microsoft phi3 vision language model. Image captioning, OCR, data extraction

Jupyter Notebook 7 4 Updated Jun 26, 2024

Quick exploration into fine tuning florence 2

Jupyter Notebook 285 26 Updated Sep 19, 2024

An annotated implementation of the Transformer paper.

Jupyter Notebook 5,852 1,251 Updated Apr 7, 2024

Implementation for MatMul-free LM.

Python 2,942 187 Updated Nov 5, 2024

[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.

Python 20,969 2,305 Updated Aug 12, 2024

Run Mixtral-8x7B models in Colab or consumer desktops

Python 2,296 226 Updated Apr 8, 2024

TextGrad: Automatic ''Differentiation'' via Text -- using large language models to backpropagate textual gradients.

Python 1,963 170 Updated Dec 15, 2024

This project presents a RAG chat app for the Speckle Developer Documentation.

Jupyter Notebook 30 11 Updated Jun 30, 2024

MiniCPM-V 2.6: A GPT-4V Level MLLM for Single Image, Multi Image and Video on Your Phone

Python 12,993 910 Updated Oct 22, 2024

LLM training in simple, raw C/CUDA

Cuda 24,924 2,831 Updated Oct 2, 2024

The official repo of Qwen-VL (通义千问-VL) chat & pretrained large vision language model proposed by Alibaba Cloud.

Python 5,270 400 Updated Aug 7, 2024
Next