Improved analytics for tracking usage of different fit modes #646

safaricd · 2025-11-25T17:38:56Z

Issue

Previously, we had no knowledge of the different fit_mode params that were used, meaning it was difficult to decide on the exact focus area for improving any of the fit modes.

Public API Changes

No Public API changes

How Has This Been Tested?

Unit tests and manual testing.

Checklist

The changes have been tested locally.
Documentation has been updated (if the public API or usage changes).
A entry has been added to CHANGELOG.md (if relevant for users).
The code follows the project's style guidelines.
I have considered the impact of these changes on the public API.

gemini-code-assist

Code Review

This pull request adds telemetry to track the usage of different fit_mode options in TabPFNClassifier and TabPFNRegressor. This is achieved by bumping the tabpfn-common-utils dependency and using the new set_init_params function.

The implementation is straightforward and correct. I have one suggestion to refactor the duplicated telemetry initialization logic into a helper function to improve maintainability.

Additionally, the documentation in TELEMETRY.md should be updated to reflect that fit_mode is now being collected as part of the anonymous usage data. This is important for transparency with users.

gemini-code-assist · 2025-11-25T17:40:13Z

src/tabpfn/classifier.py

This telemetry initialization logic, including the call to set_init_params, is also present in TabPFNRegressor.__init__. To improve maintainability and reduce code duplication, consider creating a new helper function in src/tabpfn/base.py that encapsulates this logic.

For example, you could create a function in base.py:

from tabpfn_common_utils.telemetry import set_init_params def initialize_telemetry_with_params(**params: Any) -> None: """Initializes telemetry and sets additional anonymous parameters.""" initialize_telemetry() if params: set_init_params(params)

Then you could replace these lines in both TabPFNClassifier and TabPFNRegressor with:

initialize_telemetry_with_params(fit_mode=self.fit_mode)

This would centralize the telemetry setup and make it easier to add more parameters in the future.

oscarkey · 2025-11-25T18:08:05Z

src/tabpfn/classifier.py

        initialize_telemetry()

+        # Only anonymously record `fit_mode` usage
+        set_init_params({"fit_mode": self.fit_mode})


Should we do the same thing as for model_path and validate that it's a known fit mode? To avoid accidentally collecting PII.
We could define FitMode = Literal["low_memory","fit_preprocessors","fit_with_cache","batched"] in inference.py, import it here and in the regressor interface, and then use typing.get_args() to check the provided one is valid?

Improved analytics for tracking usage of different fit modes

7ee9be7

safaricd requested a review from oscarkey November 25, 2025 17:38

safaricd requested a review from a team as a code owner November 25, 2025 17:38

gemini-code-assist bot reviewed Nov 25, 2025

View reviewed changes

oscarkey reviewed Nov 25, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Improved analytics for tracking usage of different fit modes #646

Improved analytics for tracking usage of different fit modes #646

Uh oh!

safaricd commented Nov 25, 2025

Uh oh!

gemini-code-assist bot left a comment

Uh oh!

gemini-code-assist bot Nov 25, 2025

Uh oh!

oscarkey Nov 25, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Improved analytics for tracking usage of different fit modes #646

Are you sure you want to change the base?

Improved analytics for tracking usage of different fit modes #646

Uh oh!

Conversation

safaricd commented Nov 25, 2025

Issue

Public API Changes

How Has This Been Tested?

Checklist

Uh oh!

gemini-code-assist bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

gemini-code-assist bot Nov 25, 2025

Choose a reason for hiding this comment

Uh oh!

oscarkey Nov 25, 2025

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants