Steer agent to HF kernels instead of pip install flash-attn by DarshanCode2005 · Pull Request #204 · huggingface/ml-intern

DarshanCode2005 · 2026-05-01T15:19:12Z

Resolves #202

The agent kept trying to pip install flash-attn in jobs, which often takes
ages to compile or fails outright on the job's CUDA/torch combo. The HF
kernels library lets you pull a prebuilt flash-attn (and friends) straight
from the Hub via attn_implementation="kernels-community/flash-attn2".

Rewrote the HARDCODED UNAVAILABLE PACKAGES bullet in system_prompt_v3.yaml
to recommend kernels first and only fall back to pip install when no Hub
kernel covers the need. Listed the common kernel ids (flash-attn2,
vllm-flash-attn3, paged-attention) so the agent doesn't have to guess.
Fixed the kernels entry in explore_hf_docs. The old description
("Lightweight execution environments and notebook-style workflows") was
describing something else entirely and would have sent the agent the wrong
way when it actually went looking.

Refs: https://huggingface.co/docs/kernels/index, https://huggingface.co/docs/trl/kernels_hub

DarshanCode2005 added 2 commits May 1, 2026 20:37

chore: update the agent system prompt

ab21f84

chore: update the tool documentation

680e489

akseljoonas merged commit 7599843 into huggingface:main May 1, 2026
1 check failed

Suvradippaul mentioned this pull request May 7, 2026

feat(reliability): expand pre-flight static checks for hf_jobs approval #238

Open

3 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Steer agent to HF kernels instead of pip install flash-attn#204

Steer agent to HF kernels instead of pip install flash-attn#204
akseljoonas merged 2 commits into
huggingface:mainfrom
DarshanCode2005:feat-kernels

DarshanCode2005 commented May 1, 2026 •

edited

Loading

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

DarshanCode2005 commented May 1, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

DarshanCode2005 commented May 1, 2026 •

edited

Loading