[v4] Improve download progress tracking (model cache registry and define which files will be loaded for pipelines) by nico-martin · Pull Request #1511 · huggingface/transformers.js

nico-martin · 2026-02-03T15:24:25Z

Improved Download Progress Tracking

Problem

Transformers.js couldn't reliably track total download progress because:

File lists weren't known before downloads started
File sizes were inconsistent (compressed vs uncompressed)
No cache awareness before initiating downloads

Solution

New Exported Functions

get_files(): Determines required files before downloading
get_model_files() / get_tokenizer_files() / get_processor_files(): Helper functions to identify files for each component
get_file_metadata(): Fetches file metadata using Range requests without downloading full content
- Returns fromCache boolean to identify cached files
- Ensures consistent uncompressed file sizes
is_cached(): Checks if all files from a model are already in cache

Enhanced Progress Tracking

readResponse() with expectedSize: Falls back to metadata when content-length header is missing
total_progress callback: Provides aggregate progress across all files

Review

One thing I am not super confident is the get_model_files function. I tried to test it with different model architectures, but maybe I missed some that load files that are not in that function. @xenova, could you smoke-test some models and write mie the models that fail?

Easiest way to do that is:

import {
  get_files,
  pipeline,
} from "@huggingface/transformers";

const expectedFiles = await get_files(
  "onnx-community/gemma-3-270m-it-ONNX",
  {
    dtype: "fp32",
    device: "webgpu",
  }
);
const loadedFiles = new Set();
const pipe = await pipeline(
  "text-generation",
  "onnx-community/gemma-3-270m-it-ONNX",
  {
    dtype: "fp32",
    device: "webgpu",
    progress_callback: (e) => {
      if (e.file) loadedFiles.add(e.file);
    },
  }
);

console.log(
  "SAME FILES:",
  expectedFiles.sort().join(",") === Array.from(loadedFiles).sort().join(",")
);

HuggingFaceDocBuilderDev · 2026-02-03T15:33:41Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

xenova

Very exciting PR! 🙌 Just a quick review from scanning the PR briefly.

packages/transformers/src/transformers.js

xenova · 2026-02-03T15:42:18Z

packages/transformers/src/utils/dtypes.js

 });
 /** @typedef {keyof typeof DATA_TYPES} DataType */

+export const DEFAULT_DEVICE_DTYPE = DATA_TYPES.fp32;


currently, we do a bit of a funny thing when loading models:

if wasm, and no dtype set in config, we use q8 (8-bit) model as it's pretty fast on CPU

if node cpu, and no dtype set in config, we use fp32 model

if webgpu, and no dtype set in config, we use fp32 model

the main reason is that many models on WASM can encounter an out of memory issue if using fp32 on CPU in the browser. Something I think we can do is to do a scan of models on the hub and specify the default dtype there, especially on a per-device basis. We can check in a separate PR whether we have a good way to support per-device dtypes based on configs.

packages/transformers/src/utils/core.js

packages/transformers/src/utils/hub/is_cached.js

packages/transformers/src/utils/cache/get_model_files.js

…tion that does not check for tokenizer files or processor files if the task does not use them

Co-authored-by: Joshua Lochner <admin@xenova.com>

xenova

Solid progress! Thanks 🔥

packages/transformers/docs/source/_toctree.yml

packages/transformers/src/utils/cache/clear_cache.js

packages/transformers/src/utils/cache/ModelRegistry.js

packages/transformers/src/utils/hub/files.js

packages/transformers/src/utils/core.js

packages/transformers/src/utils/hub.js

packages/transformers/src/pipelines/index.js

Co-authored-by: Joshua Lochner <admin@xenova.com>

…s.js into v4-cache-handler

nico-martin added 5 commits January 29, 2026 22:56

added progress_total progress callback status info

dfebb4a

added get_file_metadata helper

9216326

some clean up

17f9855

improved get_file_metadata and get_files

4fbe9a1

added functions to main export

ba4a4ba

nico-martin requested a review from xenova February 3, 2026 15:24

xenova reviewed Feb 3, 2026

View reviewed changes

nico-martin and others added 8 commits February 3, 2026 17:05

removed dynamic import

6249f31

restructuring

be8d7bb

refactored the pipeline tasks so I can have a get_pipeline_files func…

9f3e224

…tion that does not check for tokenizer files or processor files if the task does not use them

updated doc

7b327a6

Update packages/transformers/src/utils/core.js

32dad76

Co-authored-by: Joshua Lochner <admin@xenova.com>

added is_pipeline_cached and improved return object

46433d6

fixes after review

4eeed39

added ModelRegistry to doc

f19ccfd

nico-martin assigned xenova Feb 13, 2026

xenova changed the base branch from v4 to main February 13, 2026 17:03

added clear_cache and clear_pipeline_cache

3430421

xenova self-requested a review February 18, 2026 17:08

xenova requested changes Feb 19, 2026

View reviewed changes

xenova changed the title ~~V4 cache handler~~ [v4] Improve download progress tracking (model cache registry and define which files will be loaded for pipelines) Feb 19, 2026

nico-martin and others added 8 commits February 19, 2026 16:04

Update packages/transformers/src/utils/cache/clear_cache.js

be5a6b2

Co-authored-by: Joshua Lochner <admin@xenova.com>

small doc fix

563e872

Merge branch 'v4-cache-handler' of github.com:huggingface/transformer…

01d5b23

…s.js into v4-cache-handler

changed delete logic for cache

856d302

fixed examples in cache utilitiy files

4cb293d

fixed examples

83170d1

renamed type

b9d8c33

refactoring get_file_metadata

056843f

nico-martin and others added 12 commits February 19, 2026 17:06

moved src/utils/pipeline-tasks.js to src/pipelines/index.js

4f38c6d

fixed doc builder

138a6d9

fixed doc builder

e45bbbb

created shared getFetchHeaders function

ef6c196

added case for DecoderOnlyWithoutHead and DecoderOnly

dfdfa41

Merge branch 'main' into v4-cache-handler

ffc3870

improved console.warn

3a19ca9

changed to modelType = MODEL_TYPES.EncoderOnly if not foundInMapping

ded0a38

removed full download from get_file_metadata

908e5bb

solved merge conflicts

6bf83cf

Merge branch 'main' into v4-cache-handler

e095ee8

Merge branch 'main' into v4-cache-handler

6a89067

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[v4] Improve download progress tracking (model cache registry and define which files will be loaded for pipelines)#1511

[v4] Improve download progress tracking (model cache registry and define which files will be loaded for pipelines)#1511
nico-martin wants to merge 34 commits intomainfrom
v4-cache-handler

nico-martin commented Feb 3, 2026

Uh oh!

HuggingFaceDocBuilderDev commented Feb 3, 2026

Uh oh!

xenova left a comment

Uh oh!

Uh oh!

xenova Feb 3, 2026

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

xenova left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Conversation

nico-martin commented Feb 3, 2026

Improved Download Progress Tracking

Problem

Solution

New Exported Functions

Enhanced Progress Tracking

Review

Uh oh!

HuggingFaceDocBuilderDev commented Feb 3, 2026

Uh oh!

xenova left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

xenova Feb 3, 2026

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

xenova left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants