Skip to content
View CatxFish's full-sized avatar
💭
Busy
💭
Busy

Block or report CatxFish

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

2D to 3D CAD Conversion Using VLM

Python 111 22 Updated Aug 9, 2025

Open Source Alternative to NotebookLM / Perplexity, connected to external sources such as Search Engines, Slack, Linear, Jira, ClickUp, Confluence, Notion, YouTube, GitHub, Discord and more. Join o…

Python 7,743 588 Updated Sep 13, 2025

Collection of extracted System Prompts from popular chatbots like ChatGPT, Claude & Gemini

JavaScript 20,499 3,226 Updated Sep 3, 2025

LightlyTrain is the first PyTorch framework to pretrain computer vision models on unlabeled data for industrial applications

Python 871 28 Updated Sep 15, 2025

Reference PyTorch implementation and models for DINOv3

Jupyter Notebook 6,967 402 Updated Sep 15, 2025

Toolkit for linearizing PDFs for LLM datasets/training

Python 14,080 1,046 Updated Sep 15, 2025

Collection of awesome LLM apps with AI Agents and RAG using OpenAI, Anthropic, Gemini and opensource models.

Python 68,741 8,679 Updated Sep 15, 2025

[CVPR 2025 Best Paper Award] VGGT: Visual Geometry Grounded Transformer

Python 10,902 1,095 Updated Sep 1, 2025

Get started with building Fullstack Agents using Gemini 2.5 and LangGraph

Jupyter Notebook 16,789 2,838 Updated Sep 10, 2025
TypeScript 26,995 2,013 Updated Aug 7, 2025

State-of-the-art Image & Video CLIP, Multimodal Large Language Models, and More!

Jupyter Notebook 1,607 104 Updated Sep 5, 2025

[ICCV 2025] Implementation for Describe Anything: Detailed Localized Image and Video Captioning

Python 1,332 74 Updated Jun 26, 2025

Fully Local Manus AI. No APIs, No $200 monthly bills. Enjoy an autonomous agent that thinks, browses the web, and code for the sole cost of electricity. 🔔 Official updates only via twitter @Martin9…

Python 21,872 2,306 Updated Sep 14, 2025

An implementation of iterative deep research using the OpenAI Agents SDK

Python 612 69 Updated Jun 2, 2025

This repository showcases various advanced techniques for Retrieval-Augmented Generation (RAG) systems. RAG systems combine information retrieval with generative models to provide accurate and cont…

Jupyter Notebook 21,573 2,388 Updated Sep 10, 2025

Scaling Vision Pre-Training to 4K Resolution

Python 204 10 Updated Aug 28, 2025

Recipes for shrinking, optimizing, customizing cutting edge vision models. 💜

Jupyter Notebook 1,602 124 Updated Sep 12, 2025

The code used to train and run inference with the ColVision models, e.g. ColPali, ColQwen2, and ColSmol.

Python 2,208 198 Updated Sep 2, 2025

In-depth tutorials on LLMs, RAGs and real-world AI agent applications.

Jupyter Notebook 18,048 3,041 Updated Sep 12, 2025

AI Agents & MCPs & AI Workflow Automation • (~400 MCP servers for AI agents) • AI Automation / AI Agent with MCPs • AI Workflows & AI Agents • MCPs for AI Agents

TypeScript 17,915 2,611 Updated Sep 15, 2025

Python tool for converting files and office documents to Markdown.

Python 74,518 4,136 Updated Sep 8, 2025

Official implementation of 🛸 "UFO: A Unified Approach to Fine-grained Visual Perception via Open-ended Language Interface"

Python 217 11 Updated Jun 11, 2025

Official repository of 'Visual-RFT: Visual Reinforcement Fine-Tuning' & 'Visual-ARFT: Visual Agentic Reinforcement Fine-Tuning'’

Jupyter Notebook 2,187 93 Updated Jul 22, 2025

🪄 Create rich visualizations with AI

TypeScript 13,669 1,202 Updated Sep 16, 2025

This repository contains the official implementation of the research papers, "MobileCLIP" CVPR 2024 and "MobileCLIP2" TMLR August 2025

Python 1,212 99 Updated Sep 8, 2025

Multiview matching with deep-learning and hand-crafted local features for COLMAP and other SfM software. Supports high-resolution formats and images with rotations. Both CLI and GUI are supported.

Python 471 60 Updated Sep 12, 2025

[CVPR 2025 Best Paper Nomination] FoundationStereo: Zero-Shot Stereo Matching

Python 2,084 148 Updated Sep 6, 2025

[CVPR 2025 Highlight] DepthCrafter: Generating Consistent Long Depth Sequences for Open-world Videos

Python 1,435 80 Updated Jul 29, 2025

Chat with your database or your datalake (SQL, CSV, parquet). PandasAI makes data analysis conversational using LLMs and RAG.

Python 22,076 2,147 Updated Aug 13, 2025

Use your locally running AI models to assist you in your web browsing

TypeScript 7,105 637 Updated Sep 14, 2025
Next