Lists (1)
Sort Name ascending (A-Z)
Stars
🚀 Level up your GitHub profile readme with customizable cards including LOC statistics!
The official code for "Olympus: A Universal Task Router for Computer Vision Tasks"
A 15TB Collection of Physics Simulation Datasets
Large-Scale Multimodal Dataset of Astronomical Data
This repository contains the implementation for the Meesho Data Challenge, featuring robust training pipelines with validation, full-dataset training, and inference scripts for deploying state-of-t…
An open-source invisible desktop application to help you pass your technical interviews.
Python tool for converting files and office documents to Markdown.
Awesome multilingual OCR toolkits based on PaddlePaddle (practical ultra lightweight OCR system, support 80+ languages recognition, provide data annotation and synthesis tools, support training and…
Official implementation of the paper "HUGS: Holistic Urban 3D Scene Understanding via Gaussian Splatting""
Library to interact with the Bitcoin network. Ideal for low-level learning and experimenting.
An Industrial Grade Federated Learning Framework
Interpretability and explainability of data and machine learning models
C++ implementation of the Seam Carving Algorithm using Dynamic Programming. Available to use as a C++ binary
a state-of-the-art-level open visual language model | 多模态预训练模型
Auto_Jobs_Applier_AI_Agent aims to easy job hunt process by automating the job application process. Utilizing artificial intelligence, it enables users to apply for multiple jobs in an automated an…
Notebooks for fine tuning pali gemma
INF-LLaVA: Dual-perspective Perception for High-Resolution Multimodal Large Language Model
Interaction-first method for generating demonstrations for web-agents on any website
VILA - a multi-image visual language model with training, inference and evaluation recipe, deployable from cloud to edge (Jetson Orin and laptops)
😎 Awesome lists about Speech Emotion Recognition
File Parser optimised for LLM Ingestion with no loss 🧠 Parse PDFs, Docx, PPTx in a format that is ideal for LLMs.
LLaVA-HR: High-Resolution Large Language-Vision Assistant