-
The University of Tokyo
- http://jeonghunbaek.net/
Stars
Examples and tutorials on using SOTA computer vision models and techniques. Learn everything from old-school ResNet, through YOLO and object-detection transformers like DETR, to the latest models l…
An easy-to-use federated learning platform
Awesome-LLM: a curated list of Large Language Model
Official github repo for SafetyBench, a comprehensive benchmark to evaluate LLMs' safety. [ACL 2024]
A generative AI extension for JupyterLab
Cross-platform, customizable ML solutions for live and streaming media.
Awesome LLMs on Device: A Comprehensive Survey
Yomitoku is an AI-powered document image analysis package designed specifically for the Japanese language.
[TMLR] A curated list of language modeling researches for code (and other software engineering activities), plus related datasets.
MTVQA: Benchmarking Multilingual Text-Centric Visual Question Answering. A comprehensive evaluation of multimodal large model multilingual text perception and comprehension capabilities across nine…
PyMuPDF is a high performance Python library for data extraction, analysis, conversion & manipulation of PDF (and other) documents.
Official repo of Respond-and-Respond: data, code, and evaluation
MobiLlama : Small Language Model tailored for edge devices
The TinyLlama project is an open endeavor to pretrain a 1.1B Llama model on 3 trillion tokens.
Official code implementation of General OCR Theory: Towards OCR-2.0 via a Unified End-to-end Model
A collection of Google research projects related to Federated Learning and Federated Analytics.
EAGLE: Exploring The Design Space for Multimodal LLMs with Mixture of Encoders
pix2tex: Using a ViT to convert images of equations into LaTeX code.
BeaverTails is a collection of datasets designed to facilitate research on safety alignment in large language models (LLMs).
Universal and Transferable Attacks on Aligned Language Models
The AI Scientist: Towards Fully Automated Open-Ended Scientific Discovery 🧑🔬
EMNLP'22 | MedCLIP: Contrastive Learning from Unpaired Medical Images and Texts
This is a data generator of SRNet which is the model of paper Editing Text in the wild.