reduce log #1022

copybara-service · 2026-01-29T19:01:46Z

reduce log

abheesht17 · 2026-01-29T19:55:38Z

/gemini review

gemini-code-assist

Code Review

This pull request aims to reduce logging noise during training. It achieves this by ensuring the training mesh information is logged only once, and the JIT compilation cache size is logged only when it changes to a previously unseen value. These are good changes for improving log readability.

My review includes two points:

A potential issue with the lifecycle of the newly introduced _jit_cache, which could lead to missed log messages about recompilations when the trainer is reused.
A minor correction to a pytype: disable comment.

gemini-code-assist · 2026-01-29T19:56:36Z

tunix/sft/peft_trainer.py

    self._buffered_eval_metrics: MetricsBuffer | None = None
    self.training_hooks = None
    self.data_hooks = None
+    self._jit_cache = set()


This cache is used to track JIT compilation cache sizes to avoid verbose logging. However, it appears this set is never cleared during the lifetime of the trainer instance. If the trainer is reused in a way that causes recompilations (e.g., by calling with_loss_fn or with_gen_model_input_fn), this cache will not be reset. This could lead to misleading logs, as new compilations might not be reported if their cache size has been seen before.

Consider clearing this set in the clear_jit_cache method, alongside resetting _jitted_train_step_fn and _jitted_eval_step_fn.

gemini-code-assist · 2026-01-29T19:56:37Z

tunix/sft/peft_trainer.py

-          "Training with mesh: %s. Compiled train_step cache size: %s",
-          pxla.thread_resources.env.physical_mesh,
-          train_step.jitted_fn._cache_size(),  # pytype: disable=attribute-error,protected-access
+      cache_size = train_step.jitted_fn._cache_size()  # pytype: disable=attribute-error


The pytype: disable comment is missing protected-access. Accessing _cache_size is an access to a protected member. It would be good to add it back for correctness of static analysis suppression.

Suggested change

cache_size = train_step.jitted_fn._cache_size() # pytype: disable=attribute-error

cache_size = train_step.jitted_fn._cache_size() # pytype: disable=attribute-error,protected-access

PiperOrigin-RevId: 863438904

copybara-service bot requested review from abheesht17, hgao327, jiangyangmu, lc5211, sizhit2, tianshub and wang2yn84 as code owners January 29, 2026 19:01

copybara-service bot had a problem deploying to testing January 29, 2026 19:02 Error

copybara-service bot force-pushed the test_862810881 branch from 7f1931b to 8e7d230 Compare January 29, 2026 19:21

copybara-service bot had a problem deploying to testing January 29, 2026 19:21 Failure

copybara-service bot temporarily deployed to testing January 29, 2026 19:21 Inactive

gemini-code-assist bot reviewed Jan 29, 2026

View reviewed changes

copybara-service bot force-pushed the test_862810881 branch from 8e7d230 to 8cc7bfb Compare January 29, 2026 20:25

copybara-service bot temporarily deployed to testing January 29, 2026 20:26 Inactive

copybara-service bot had a problem deploying to testing January 29, 2026 20:26 Failure

copybara-service bot force-pushed the test_862810881 branch from 8cc7bfb to f925cdb Compare January 30, 2026 23:43

copybara-service bot temporarily deployed to testing January 30, 2026 23:43 Inactive

reduce log

f237d7c

PiperOrigin-RevId: 863438904

copybara-service bot force-pushed the test_862810881 branch from f925cdb to f237d7c Compare January 31, 2026 00:32

copybara-service bot merged commit f237d7c into main Jan 31, 2026

copybara-service bot deleted the test_862810881 branch January 31, 2026 00:33

copybara-service bot temporarily deployed to testing January 31, 2026 00:33 Inactive

copybara-service bot temporarily deployed to testing January 31, 2026 00:36 Inactive

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

reduce log #1022

reduce log #1022

copybara-service bot commented Jan 29, 2026

Uh oh!

abheesht17 commented Jan 29, 2026

Uh oh!

gemini-code-assist bot left a comment

Uh oh!

gemini-code-assist bot Jan 29, 2026

Uh oh!

gemini-code-assist bot Jan 29, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

	cache_size = train_step.jitted_fn._cache_size() # pytype: disable=attribute-error
	cache_size = train_step.jitted_fn._cache_size() # pytype: disable=attribute-error,protected-access

reduce log #1022

reduce log #1022

Conversation

copybara-service bot commented Jan 29, 2026

Uh oh!

abheesht17 commented Jan 29, 2026

Uh oh!

gemini-code-assist bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

gemini-code-assist bot Jan 29, 2026

Choose a reason for hiding this comment

Uh oh!

gemini-code-assist bot Jan 29, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants