Docs: script to auto-generate ggml operations docs #14598

am17an · 2025-07-09T15:29:40Z

Maintaining parity between backends is desirable, and developers have to dig in the code to understand which operations/backends need implementation. But adding these ops is good for newcomers as it's straightforward to test.

This PR adds basic documentation for ops support using test-backend-ops

Also I added a github action to this PR, but I'm not sure how to test it out

tests/test-backend-ops.cpp

docs/ops/rtx3090.csv

slaren · 2025-07-09T18:03:04Z

.github/workflows/update-ops-docs.yml

You can test this in your own fork, but adding commits will be trickier than this. It may be preferable to only check if the document is out of date, and fail the job if it is not, but not actually update it. server.yml does this for the WebUI bundle file.

IMO we don't necessarily need to commit the result to the repo. Managing CI permission can be tricky.

Maybe a simple command to print this list to console is enough. Something like: test-backend-ops --support --output table which outputs markdown table directly to the console.

Some extra things to concern about:

~~I feel like the python script is not necessary. It's kinda a hack on top of existing cpp code.~~ Ok maybe unfair to say this because the python code actually takes multiple CSV from multiple cpp runs. But still, I feel like there is an easier way to do so..

Having a "checking" pipeline like @slaren suggest can lead to extra works whenever someone wants to implement a trivial kernel. For example, when adding ggml_gelu_erf, having to regenerate the CSV, then regenerate the table - that's 2 extra steps.

What about the case where we have rtx3090.csv but I only have an RTX 4060? Which file will be taken for generating the table?

Just to be clear, I do think this feature is useful, but I just want to do it without too much over-engineering

The workflow only runs if the csv files changes, so adding a new op alone won't make it fail.

3. What about the case where we have rtx3090.csv but I only have an RTX 4060? Which file will be taken for generating the table?

Yeah we should only have one for each backend. Multiple runs with different devices can be merged into a single file if necessary and to add more details in the future, but not really very important.

am17an · 2025-07-10T09:55:38Z

The action seems to run fine https://github.com/ggml-org/llama.cpp/actions/runs/16191720675/job/45708814939?pr=14598

scripts/create_ops_docs.py

* origin/master: Smoldocling support (ggml-org#14597) Docs: script to auto-generate ggml operations docs (ggml-org#14598)

am17an requested a review from slaren July 9, 2025 15:29

github-actions bot added documentation Improvements or additions to documentation script Script related testing Everything test related python python script changes devops improvements to build systems and github actions labels Jul 9, 2025

am17an force-pushed the add_docs branch from 6fb16f6 to b012fe5 Compare July 9, 2025 15:35

Docs: script to auto-generate ggml operations docs

0f644e9

am17an force-pushed the add_docs branch from b012fe5 to 0f644e9 Compare July 9, 2025 15:38

slaren reviewed Jul 9, 2025

View reviewed changes

am17an force-pushed the add_docs branch from 819dddb to 546d627 Compare July 10, 2025 09:41

Review: formatting changes + change github action

5270390

am17an force-pushed the add_docs branch from 546d627 to 5270390 Compare July 10, 2025 09:43

slaren approved these changes Jul 10, 2025

View reviewed changes

CISC reviewed Jul 10, 2025

View reviewed changes

scripts/create_ops_docs.py Outdated Show resolved Hide resolved

Use built-in types instead of typing

b8a6ff4

ggerganov approved these changes Jul 10, 2025

View reviewed changes

CISC approved these changes Jul 10, 2025

View reviewed changes

docs : add BLAS and Metal ops

5b17e72

am17an merged commit 11ee0fe into ggml-org:master Jul 10, 2025
51 checks passed

am17an deleted the add_docs branch July 10, 2025 15:29

gabe-l-hart added a commit to gabe-l-hart/llama.cpp that referenced this pull request Jul 10, 2025

Merge remote-tracking branch 'origin/master' into GraniteFour

d7d5b01

* origin/master: Smoldocling support (ggml-org#14597) Docs: script to auto-generate ggml operations docs (ggml-org#14598)

am17an mentioned this pull request Jul 12, 2025

Add CUDA non-contiguous Unary Ops support #14639

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Docs: script to auto-generate ggml operations docs #14598

Docs: script to auto-generate ggml operations docs #14598

Uh oh!

am17an commented Jul 9, 2025 •

edited

Loading

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

slaren Jul 9, 2025 •

edited

Loading

Uh oh!

ngxson Jul 9, 2025 •

edited

Loading

Uh oh!

slaren Jul 9, 2025

Uh oh!

am17an commented Jul 10, 2025

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Docs: script to auto-generate ggml operations docs #14598

Docs: script to auto-generate ggml operations docs #14598

Uh oh!

Conversation

am17an commented Jul 9, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

slaren Jul 9, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

ngxson Jul 9, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

slaren Jul 9, 2025

Choose a reason for hiding this comment

Uh oh!

am17an commented Jul 10, 2025

Uh oh!

Uh oh!

Uh oh!

Uh oh!

am17an commented Jul 9, 2025 •

edited

Loading

slaren Jul 9, 2025 •

edited

Loading

ngxson Jul 9, 2025 •

edited

Loading