Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Update supported models for Liger Kernel #1875

Merged
merged 3 commits into from
Sep 1, 2024

Conversation

DocShotgun
Copy link
Contributor

@DocShotgun DocShotgun commented Aug 27, 2024

Description

Integrates the following updates from the main branch of Liger Kernel:

Mistral LCE (linkedin/Liger-Kernel#93)
Gemma LCE (linkedin/Liger-Kernel#111)
Gemma 2 without LCE (softcapping is not yet implemented for Gemma in Liger Kernel LCE forward at the moment, see: linkedin/Liger-Kernel#121 (comment))
Phi3 LCE (linkedin/Liger-Kernel#76) (linkedin/Liger-Kernel#103)

Motivation and Context

How has this been tested?

Screenshots (if appropriate)

Types of changes

Social Handles (Optional)

@@ -24,6 +24,8 @@
from liger_kernel.transformers.geglu import LigerGEGLUMLP
from liger_kernel.transformers.model.llama import lce_forward
from liger_kernel.transformers.model.qwen2 import lce_forward as qwen2_lce_forward
from liger_kernel.transformers.model.mistral import lce_forward as mistral_lce_forward
from liger_kernel.transformers.model.gemma import lce_forward as gemma_lce_forward
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

please move these down into the various elif clauses below

otherwise the plugin will fail to load on older versions of the liger-kernel dependency, even if the user isn't necessarily dealing with mistral or gemma models themselves

cc #1876

Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I went ahead and moved these.

DocShotgun and others added 3 commits August 30, 2024 17:51
Add Mistral LCE, Gemma LCE, Gemma 2 without LCE (softcapping is not yet implemented for Gemma in Liger Kernel LCE forward), Phi3 without LCE
@winglian
Copy link
Collaborator

Updated liger to 0.2.1 in main, so rebased to make sure everything is kosher.

@winglian winglian merged commit 15408d0 into axolotl-ai-cloud:main Sep 1, 2024
7 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants