-
Notifications
You must be signed in to change notification settings - Fork 600
[Proposal] Add HunYuan dense adapter (HunYuanDenseV1ForCausalLM) #1407
Copy link
Copy link
Open
Labels
TransformerBridgeBug specific to the new TransformerBridge systemBug specific to the new TransformerBridge systemcomplexity-simpleSimple issues, which may be good for beginnersSimple issues, which may be good for beginnersgood first issueGood for newcomersGood for newcomershelp wantedExtra attention is neededExtra attention is needednew-architectureThis card involves adding a new architecture .This card involves adding a new architecture .
Metadata
Metadata
Assignees
Labels
TransformerBridgeBug specific to the new TransformerBridge systemBug specific to the new TransformerBridge systemcomplexity-simpleSimple issues, which may be good for beginnersSimple issues, which may be good for beginnersgood first issueGood for newcomersGood for newcomershelp wantedExtra attention is neededExtra attention is needednew-architectureThis card involves adding a new architecture .This card involves adding a new architecture .
Type
Fields
Give feedbackNo fields configured for issues without a type.
Proposal
Add a TransformerBridge adapter for
HunYuanDenseV1ForCausalLM(Tencent HunYuan dense family).Motivation
Tencent ships small dense HunYuan models at 0.5B, 1.8B, 4B, and 7B, all runnable on a single consumer GPU. They fit interpretability compute budgets well, which makes the family high-leverage despite modest downloads.
Gap scan (2026-06-18): 13 models, ~114K downloads.
Pitch
The architecture is close to a standard dense decoder with GQA. Check the normalization and the dual fast/slow ("thinking") mode against the HF config while implementing.
/add-model-support tencent/Hunyuan-1.8B-Instruct.tencent/Hunyuan-0.5B, thentencent/Hunyuan-1.8B-Instruct.Additional context
hf_scraperarchitecture-gaps pass (2026-06-18).Checklist