[Model] Exaone4 Support #980

roycho96 · 2025-12-16T02:21:21Z

Summary

Add Liger kernel support for EXAONE4 models (LG AI Research's EXAONE 4.0 series).

Changes

Add src/liger_kernel/transformers/model/exaone4.py with fused linear cross entropy forward
Add apply_liger_kernel_to_exaone4() function in monkey_patch.py
Register in __init__.py
Add revert_liger_kernel_to_exaone4() in test/utils.py
Add convergence tests in test/convergence/bf16/ and test/convergence/fp32/

Supported Kernels

RMSNorm (including QK-Norm in attention)
SwiGLU MLP
RoPE
Fused Linear Cross Entropy

Note on in_place=False for RMSNorm

EXAONE4 requires in_place=False for RMSNorm due to its attention implementation pattern:

# EXAONE4 pattern - separate assignment
query_states = self.q_proj(hidden_states).view(...).transpose(1, 2)
query_states = self.q_norm(query_states)  # reassignment to same variable

# vs Qwen3 pattern - chained
query_states = self.q_norm(self.q_proj(hidden_states).view(...)).transpose(1, 2)

The view/transpose operations share storage with the original tensor. In-place modification corrupts the autograd graph, causing NaN gradients.

References

Model: https://huggingface.co/LGAI-EXAONE/EXAONE-4.0-1.2B

Testing Done

Hardware Type: H100
run make test to ensure correctness
run make checkstyle to ensure code style
run make test-convergence to ensure convergence

shimizust

Thanks for the contribution!

Sunghyun Cho and others added 5 commits December 16, 2025 11:07

add exaone4 model

29e0179

fix: use in_place=False for EXAONE4 RMSNorm to prevent gradient NaN

c8086dd

add tests for exaone4

3785098

style: fix import ordering and code formatting

fe28c78

Merge branch 'main' into feature/exaone4-support

366f917

shimizust approved these changes Jan 7, 2026

View reviewed changes

shimizust merged commit 13e1bbe into linkedin:main Jan 7, 2026
3 of 7 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[Model] Exaone4 Support #980

[Model] Exaone4 Support #980

Uh oh!

roycho96 commented Dec 16, 2025

Uh oh!

shimizust left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

[Model] Exaone4 Support #980

[Model] Exaone4 Support #980

Uh oh!

Conversation

roycho96 commented Dec 16, 2025

Summary

Changes

Supported Kernels

Note on in_place=False for RMSNorm

References

Testing Done

Uh oh!

shimizust left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants