Skip to content

Conversation

mudler
Copy link
Owner

@mudler mudler commented May 1, 2025

Description

This pull request adds a new model entry to the gallery/index.yaml file, introducing the "Fast-Math-Qwen3-14B" model with detailed metadata and performance improvements.

Additions to the model gallery:

  • A new model, "Fast-Math-Qwen3-14B," has been added to the gallery. This model is an efficiency-optimized version of Qwen3-14B, offering approximately 65% faster inference on average with minimal performance loss.
  • Metadata includes:
    • URLs for accessing the model on Hugging Face.
    • A detailed description of the model's development, performance enhancements, and technical specifics.
    • File information with a SHA-256 checksum and URI for downloading the model.

Notes for Reviewers

Signed commits

  • Yes, I signed my commits.

Signed-off-by: Ettore Di Giacinto <mudler@localai.io>
@mudler mudler merged commit 9271956 into master May 1, 2025
19 of 20 checks passed
@mudler mudler deleted the models/fast-math-qwen3-14b branch May 1, 2025 08:14
Copy link

netlify bot commented May 1, 2025

Deploy Preview for localai ready!

Name Link
🔨 Latest commit d046c67
🔍 Latest deploy log https://app.netlify.com/sites/localai/deploys/68132d4d7ae1000009e8c668
😎 Deploy Preview https://deploy-preview-5287--localai.netlify.app
📱 Preview on mobile
Toggle QR Code...

QR Code

Use your smartphone camera to open QR code link.

To edit notification comments on pull requests, go to your Netlify site configuration.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

Successfully merging this pull request may close these issues.

1 participant