-
DeepGlint
- Beijing
-
18:22
(UTC +08:00) - https://kaicheng-yang0828.github.io
Lists (1)
Sort Name ascending (A-Z)
Starred repositories
MALLET is a Java-based package for statistical natural language processing, document classification, clustering, topic modeling, information extraction, and other machine learning applications to t…
Scaling Sentence Embeddings with Large Language Models
[NeurIPS 2023] A faithful benchmark for vision-language compositionality
Lightweight Python framework that provides a high-level API for creating and rendering scenes with Blender.
GLM-4 series: Open Multilingual Multimodal Chat LMs | 开源多语言多模态对话模型
jina-embeddings-v2-base-en is an English, monolingual embedding model supporting 8192 sequence length. It is based on a BERT architecture (JinaBERT) that supports the symmetric bidirectional varian…
The official CLIP training codebase of Inf-CL: "Breaking the Memory Barrier: Near Infinite Batch Size Scaling for Contrastive Loss". A super memory-efficiency CLIP training scheme.
Official PyTorch implementation of the paper "DisCo-CLIP: A Distributed Contrastive Loss for Memory Efficient CLIP Training".
Run Effective Large Batch Contrastive Learning Beyond GPU/TPU Memory Constraint
[ECCV 2024] official code for "Long-CLIP: Unlocking the Long-Text Capability of CLIP"
Croc: Pretraining Large Multimodal Models with Cross-Modal Comprehension
This is a user guide for the MiniCPM and MiniCPM-V series of small language models (SLMs) developed by ModelBest. “面壁小钢炮” focuses on achieving exceptional performance on the edge.
A generative speech model for daily dialogue.
Codebase for Aria - an Open Multimodal Native MoE
[NeurIPS 2024] Official Implementation of CLIPAway
[ECCV 2024 Oral] Code for paper: An Image is Worth 1/2 Tokens After Layer 2: Plug-and-Play Inference Acceleration for Large Vision-Language Models
A project page template for academic papers. Demo at https://eliahuhorwitz.github.io/Academic-project-page-template/
A linear estimator on top of clip to predict the aesthetic quality of pictures
MoVQGAN - model for the image encoding and reconstruction
The code used to train and run inference with the ColPali architecture.
Header-only C++/python library for fast approximate nearest neighbors