Skip to content

[Proposal] Add HunYuan dense adapter (HunYuanDenseV1ForCausalLM) #1407

@jlarson4

Description

@jlarson4

Proposal

Add a TransformerBridge adapter for HunYuanDenseV1ForCausalLM (Tencent HunYuan dense family).

Motivation

Tencent ships small dense HunYuan models at 0.5B, 1.8B, 4B, and 7B, all runnable on a single consumer GPU. They fit interpretability compute budgets well, which makes the family high-leverage despite modest downloads.

Gap scan (2026-06-18): 13 models, ~114K downloads.

Pitch

The architecture is close to a standard dense decoder with GQA. Check the normalization and the dual fast/slow ("thinking") mode against the HF config while implementing.

  • Claude Code users can scaffold with /add-model-support tencent/Hunyuan-1.8B-Instruct.
  • Register at the four sites listed in contributing.md.
  • Verify smallest-first: tencent/Hunyuan-0.5B, then tencent/Hunyuan-1.8B-Instruct.

Additional context

Checklist

  • I have checked that there is no similar issue in the repo (required)

Metadata

Metadata

Labels

TransformerBridgeBug specific to the new TransformerBridge systemcomplexity-simpleSimple issues, which may be good for beginnersgood first issueGood for newcomershelp wantedExtra attention is needednew-architectureThis card involves adding a new architecture .

Type

No type
No fields configured for issues without a type.

Projects

No projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions