Add inference example notebooks for LFM2 models #50

alay2shah · 2026-02-02T18:48:51Z

Transformers: text generation, streaming, vision
vLLM: text generation, batching, vision
llama.cpp: CLI and server API, vision
MLX: text generation, streaming, vision (Apple Silicon note)
Ollama: Python/curl API examples, vision

- Transformers: text generation, streaming, vision - vLLM: text generation, batching, vision - llama.cpp: CLI and server API, vision - MLX: text generation, streaming, vision (Apple Silicon note) - Ollama: Python/curl API examples, vision Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>

Use return_dict=True with apply_chat_template and access input_ids properly to avoid AttributeError on shape access. Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>

- Use dtype="bfloat16" (not torch_dtype) - Use batch_decode as in docs - Keep tokenize=True as per docs Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>

IMPORTANT: tie lm_head to input embeddings to fix missing weights issue in transformers v5. Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>

- Pin torch==2.9.0 to prevent errors - Use dtype instead of torch_dtype (deprecated) - Remove fallback note Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>

- Use pip with --extra-index-url for CUDA support (--torch-backend is a uv flag, not pip) - Fix chat() API to use message format instead of raw strings Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>

Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>

alay2shah requested a review from Paulescu as a code owner February 2, 2026 18:48

mintlify bot deployed to staging February 2, 2026 18:49 View deployment

Fix Transformers notebook for v5 compatibility

0345ec6

Use return_dict=True with apply_chat_template and access input_ids properly to avoid AttributeError on shape access. Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>

mintlify bot deployed to staging February 2, 2026 20:45 View deployment

Fix vision model to match docs exactly

da2b090

- Use dtype="bfloat16" (not torch_dtype) - Use batch_decode as in docs - Keep tokenize=True as per docs Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>

mintlify bot deployed to staging February 2, 2026 21:41 View deployment

Add lm_head weight tying fix for transformers v5

3fd4bd1

IMPORTANT: tie lm_head to input embeddings to fix missing weights issue in transformers v5. Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>

mintlify bot deployed to staging February 2, 2026 21:54 View deployment

Update Transformers notebook for v5 compatibility

8c1bb41

- Pin torch==2.9.0 to prevent errors - Use dtype instead of torch_dtype (deprecated) - Remove fallback note Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>

mintlify bot deployed to staging February 3, 2026 18:07 View deployment

Fix vLLM notebook installation and chat API

4cfaf4b

- Use pip with --extra-index-url for CUDA support (--torch-backend is a uv flag, not pip) - Fix chat() API to use message format instead of raw strings Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>

mintlify bot deployed to staging February 3, 2026 20:21 View deployment

mintlify bot deployed to staging February 4, 2026 03:37 View deployment

Fix vLLM notebook for Colab compatibility

9a7c999

Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>

alay2shah force-pushed the notebook-examples branch from 97bbed6 to 9a7c999 Compare February 4, 2026 03:38

mintlify bot deployed to staging February 4, 2026 03:39 View deployment

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add inference example notebooks for LFM2 models #50

Add inference example notebooks for LFM2 models #50

Uh oh!

alay2shah commented Feb 2, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Add inference example notebooks for LFM2 models #50

Are you sure you want to change the base?

Add inference example notebooks for LFM2 models #50

Uh oh!

Conversation

alay2shah commented Feb 2, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants