- Anyang, Korea
Highlights
- Pro
-
sglang-hip12 Public
Forked from sgl-project/sglangSGLang is a fast serving framework for large language models and vision language models.
Python Apache License 2.0 UpdatedJan 3, 2025 -
InfiniteBench-hip Public
Forked from OpenBMB/InfiniteBenchCodes for the paper "∞Bench: Extending Long Context Evaluation Beyond 100K Tokens": https://arxiv.org/abs/2402.13718
Python MIT License UpdatedDec 28, 2024 -
EXAONE-3.5 Public
Forked from LG-AI-EXAONE/EXAONE-3.5Official repository for EXAONE 3.5 built by LG AI Research
Other UpdatedDec 10, 2024 -
loft-hip Public
Forked from google-deepmind/loftLOFT: A 1 Million+ Token Long-Context Benchmark
Python Apache License 2.0 UpdatedNov 22, 2024 -
-
sea-attention Public
Official Implementation of SEA: Sparse Linear Attention with Estimated Attention Mask (ICLR 2024)
-
LongBench-hip Public
Forked from THUDM/LongBenchLongBench: A Bilingual, Multitask Benchmark for Long Context Understanding
-
-
RULER-hip Public
Forked from NVIDIA/RULERThis repo contains the source code for RULER: What’s the Real Context Size of Your Long-Context Language Models?
-
triton-fix-autotune Public
Forked from triton-lang/tritonDevelopment repository for the Triton language and compiler
C++ MIT License UpdatedSep 20, 2024 -
-
InfiniGen Public
Forked from snu-comparch/InfiniGenInfiniGen: Efficient Generative Inference of Large Language Models with Dynamic KV Cache Management (OSDI'24)
-
hip-attention Public
Forked from DeepAuto-AI/hip-attentionTraining-free Post-training Efficient Sub-quadratic Complexity Attention. Implemented with OpenAI Triton.
Python UpdatedJun 25, 2024 -
-
-
gmlwns2000.github.io Public
Forked from RayeRen/acad-homepage.github.ioAcadHomepage: A Modern and Responsive Academic Personal Homepage
SCSS MIT License UpdatedJun 10, 2024 -
lm-evaluation-harness Public
Forked from EleutherAI/lm-evaluation-harnessA framework for few-shot evaluation of language models.
Python MIT License UpdatedJun 6, 2024 -
-
vllm-timber Public archive
Forked from vllm-project/vllmA high-throughput and memory-efficient inference and serving engine for LLMs
Python Apache License 2.0 UpdatedMay 14, 2024 -
streaming-llm-triton Public
OpenAI Triton Implementation of Streaming LLM
-
LongLM Public
Forked from datamllab/LongLMLLM Maybe LongLM: Self-Extend LLM Context Window Without Tuning
Python MIT License UpdatedMar 13, 2024 -
-
-
pypareto-native Public
Forked from kummahiih/pyparetoNumba optimized version of `pypareto`. Sorting chains for pareto frontier extraction
-
sharkshark-4k Public
Upscale Twitch stream and restream into Twitch or RTMP or File.
-
transformers Public
Forked from huggingface/transformers🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
Python Apache License 2.0 UpdatedJun 10, 2023 -
sttabt Public
[ICLR2023] Official code of Sparse Token Transformer with Attention Back-Tracking
-
-
latextable Public
Forked from JAEarly/latextableA Python library that adds Latex functionality to the Texttable package.
Python MIT License UpdatedAug 16, 2022 -