-
guozhiyu.github.io Public
Forked from RayeRen/acad-homepage.github.ioAcadHomepage: A Modern and Responsive Academic Personal Homepage
SCSS MIT License UpdatedFeb 1, 2025 -
glu_dass Public
[TMLR] Dependency-Aware Semi-Structured Sparsity of GLU Variants in Large Language Models
Python UpdatedJan 20, 2025 -
vatp Public
[EMNLP 2024 (main)] Attention Score is not All You Need for Token Importance Indicator in KV Cache Reduction: Value Also Matters
-
-
-
-
albert_pytorch Public
Forked from lonePatient/albert_pytorchalbert_zh对应的pytorch版本
Python UpdatedOct 8, 2019 -
xlnet Public
Forked from zihangdai/xlnetXLNet: Generalized Autoregressive Pretraining for Language Understanding
Python Apache License 2.0 UpdatedAug 27, 2019 -
examples Public
Forked from pytorch/examplesA set of examples around pytorch in Vision, Text, Reinforcement Learning, etc.
Python BSD 3-Clause "New" or "Revised" License UpdatedAug 23, 2019