kyegomez / attn_res Sponsor Star 10 Code Issues Pull requests A clean, single-file PyTorch implementation of Attention Residuals (Kimi Team, MoonshotAI, 2026), integrated with Grouped Query Attention (GQA), SwiGLU feed-forward networks, and Rotary Position Embeddings (RoPE). open-source research ai ml transformers torch pytorch attention diffusion residuals llms open-implementation attention-transformer Updated Mar 16, 2026 Python
kyegomez / Open-NAMM Sponsor Star 6 Code Issues Pull requests An open source implementation of the paper: "AN EVOLVED UNIVERSAL TRANSFORMER MEMORY" open-source ai deep-learning ml transformers attention gpt papers attention-mechanism agora attention-model chatgpt arxviv open-implementation agoralab Updated Oct 6, 2025 Python