[docs] Model docs #36469

stevhliu · 2025-02-27T23:58:47Z

🚧 WIP 🚧

Explores a new design for several of the most commonly visited model docs. It is pretty stripped down at the moment, but we can add back the essentials as needed (or not). The main idea is to only show developers what they need to know to use the model.

A badge indicating whether a model supports certain features/frameworks (PyTorch, SDPA, FlashAttention, etc.) in the upper right side of the doc.
A brief description of the model and what makes it different. Some of these abstracts just don't tell you that much about the model itself or what makes it unique, while others are very technical and the details probably don't matter that much if you aren't a researcher. The description aims for somewhere in between.
Code snippets demonstrating inference with all the optimizations included.

HuggingFaceDocBuilderDev · 2025-02-28T00:36:38Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

ArthurZucker

Very nice thanks! Image should be on the hub I think but LGTM

NielsRogge · 2025-04-20T11:30:02Z

docs/source/en/model_doc/vit.md

-Vision Transformer (ViT) attains excellent results compared to state-of-the-art convolutional networks while requiring
-substantially fewer computational resources to train.*
-
-<img src="https://huggingface.co/datasets/huggingface/documentation-images/resolve/main/transformers/model_doc/vit_architecture.jpg"


I noticed this PR removed the architecture image, could we please keep images in the docs?

They are stored at https://huggingface.co/datasets/huggingface/documentation-images.

cc @stevhliu

Hi! While the image is a nice extra for users I'm not sure if it adds that much more value to the doc?

In general, I think it's better to move away from the older model card format which mirrored the model papers quite a bit (abstract + architecture image) to give it less of an academic/researchy vibe and something more immediately practical for developers. Users can click on the linked paper, which contains all that info, if they really want to learn more.

I think most users are more interested in just knowing how to use a model for inference and don't care too much about the details of the architecture. I also feel like some of the images are also a bit overwhelming for new users because there's a lot going on visually.

I'm not completely against the idea of removing the architecture image though and I'm open to adding it back 🙂

* initial * fix * fix * update * fix * fixes * quantization * attention mask visualizer * multimodal * small changes * fix code samples

stevhliu force-pushed the model-doc-design branch 3 times, most recently from f353e9e to 8416e96 Compare March 14, 2025 21:37

ArthurZucker approved these changes Mar 18, 2025

View reviewed changes

stevhliu force-pushed the model-doc-design branch from a172045 to fcc9f21 Compare March 20, 2025 22:22

stevhliu marked this pull request as ready for review March 20, 2025 23:15

stevhliu force-pushed the model-doc-design branch 2 times, most recently from 9a3e0ca to 3d2a172 Compare March 21, 2025 18:22

stevhliu added 10 commits March 21, 2025 12:33

initial

9df680a

fix

0f3d546

fix

83c7f6a

update

83ed1d1

fix

1f64a0c

fixes

c367c70

quantization

a987b1c

attention mask visualizer

70d1d4f

multimodal

e578a2e

small changes

7c93f33

stevhliu force-pushed the model-doc-design branch from 3d2a172 to 7c93f33 Compare March 21, 2025 20:15

fix code samples

fa64371

stevhliu merged commit d253de6 into huggingface:main Mar 21, 2025
10 checks passed

stevhliu deleted the model-doc-design branch March 21, 2025 22:35

stevhliu mentioned this pull request Mar 25, 2025

[Community contributions] Model cards #36979

Closed

EricCousineau-TRI mentioned this pull request Apr 2, 2025

Bug in Paligemma usage docs for v4.50.3 #37181

Closed

NielsRogge reviewed Apr 20, 2025

View reviewed changes

zucchini-nlp pushed a commit to zucchini-nlp/transformers that referenced this pull request May 14, 2025

[docs] Model docs (huggingface#36469)

a57efa4

* initial * fix * fix * update * fix * fixes * quantization * attention mask visualizer * multimodal * small changes * fix code samples

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[docs] Model docs #36469

[docs] Model docs #36469

Uh oh!

stevhliu commented Feb 27, 2025

Uh oh!

HuggingFaceDocBuilderDev commented Feb 28, 2025

Uh oh!

ArthurZucker left a comment

Uh oh!

Uh oh!

NielsRogge Apr 20, 2025

Uh oh!

stevhliu Apr 22, 2025

Uh oh!

Uh oh!

[docs] Model docs #36469

[docs] Model docs #36469

Uh oh!

Conversation

stevhliu commented Feb 27, 2025

Uh oh!

HuggingFaceDocBuilderDev commented Feb 28, 2025

Uh oh!

ArthurZucker left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

NielsRogge Apr 20, 2025

Choose a reason for hiding this comment

Uh oh!

stevhliu Apr 22, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!