feat: Integrate MLflow for metrics and artifacts #229

linusseelinger · 2025-06-17T15:31:46Z

Relevant issue or PR

N/A

Description of changes

tesseract engine now wraps each evaluation in an MLflow run -> users can simply call mlflow.log_metric and friends in their code
tesseract serve spins up an MLflow server, ensures all tesseracts are on the same network (shared network only existed within each individual docker compose file before), and ensures all tesseracts find the MLflow server
tesseract run (and custom docker client under the hood) now passes through env. variables and network. This can be used to point a tesseract to an external MLflow server, e.g. in P4D.

HOW TO

Option A: Serve tesseract

Serving a tesseract as always via

tesseract serve helloworld

automatically spins up an MLflow server (shared among all served tesseracts), which you can access at http://localhost:5000 in order to view metrics.
After requesting an apply (or jacobian, ...) from the tesseract, a new entry will show up in the MLflow server.

Option B: Single tesseract execution

tesseract run helloworld apply --network=host --env=MLFLOW_TRACKING_URI="http:/
/localhost:5000" '{"inputs": {"name": "Osborne"}}'

In this case, we can't spin up an MLflow server. Instead, you'll have to point the tesseract to an existing one (e.g. one running on your host system).
If you don't specify MLFLOW_TRACKING_URI, the Mlflow client falls back to logging locally within the tesseract, making logged metrics nearly inaccessible.

TODO

Clean up helloworld example (now logs a metric in MLflow; should probably be removed later)
Add documentation
By default, the MLflow client simply blocks if it doesn't find a tracking server. We should introduce a timeout and make MLflow fall back to local logging of metrics.
Investigate whether tesseracts triggered thousands of times flood the MLflow server with noticeable performance impact; maybe ensure that if the user code doesn't log a single metric, we simply don't submit a run to the tracking server?
Support and document custom MLFLOW_TRACKING_URI in tesseract serve

Testing done

Local testing for calling via tesseract run and via tesseract serve.

codecov · 2025-06-17T15:34:55Z

Codecov Report

Attention: Patch coverage is 29.41176% with 12 lines in your changes missing coverage. Please review.

Project coverage is 66.82%. Comparing base (5beaafa) to head (000bbfa).
Report is 10 commits behind head on main.

Files with missing lines	Patch %	Lines
tesseract_core/sdk/docker_client.py	0.00%	8 Missing and 2 partials ⚠️
tesseract_core/sdk/engine.py	71.42%	1 Missing and 1 partial ⚠️

❗ There is a different number of reports uploaded between BASE (5beaafa) and HEAD (000bbfa). Click for more details.

HEAD has 16 uploads less than BASE

Flag BASE (5beaafa) HEAD (000bbfa)

24 8

Additional details and impacted files

@@            Coverage Diff             @@
##             main     #229      +/-   ##
==========================================
- Coverage   75.98%   66.82%   -9.16%     
==========================================
  Files          28       28              
  Lines        3110     3171      +61     
  Branches      489      510      +21     
==========================================
- Hits         2363     2119     -244     
- Misses        529      857     +328     
+ Partials      218      195      -23

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

🚀 New features to boost your workflow:

❄️ Test Analytics: Detect flaky tests, report on failures, and find test suite problems.

joglekara · 2025-06-17T15:38:53Z

very nice!

how is this exposed for tesseract use via tesseract-jax? Or is that out of scope and this implementation requires the use of the CLI?

linusseelinger · 2025-06-18T07:40:06Z

how is this exposed for tesseract use via tesseract-jax?

We log metrics to an MLflow server outside of tesseract-core, so that's where you can view metrics (added a HOW TO section above).
This should all work through tesseract-jax as well. Would be great if you could give it a try :) Rebuilding a tesseract with this branch (and adding metrics to it) should do the trick!

joglekara · 2025-06-18T13:32:40Z

Hmm, one more thing. I run a self-hosted mlflow server that I would like to push to rather than to a new one and then figure out how to export/import. Can we read in a MLFLOW_TRACKING_URI from somewhere?

linusseelinger · 2025-06-20T09:04:04Z

Hmm, one more thing. I run a self-hosted mlflow server that I would like to push to rather than to a new one and then figure out how to export/import. Can we read in a MLFLOW_TRACKING_URI from somewhere?

Good point! As it stands, you can't set MLFLOW_TRACKING_URI in the docker-compose setup. Making a note on that though, we should support that!

…facts in docker-compose case

Integrate MLflow

890bbf3

linusseelinger added 2 commits June 27, 2025 11:17

Do not automatically trigger mlflow runs on API endpoints

7ca45a2

Add mlflow example tesseract; add docs; shared volume for mlflow arti…

000bbfa

…facts in docker-compose case

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

feat: Integrate MLflow for metrics and artifacts #229

feat: Integrate MLflow for metrics and artifacts #229

Uh oh!

linusseelinger commented Jun 17, 2025 •

edited

Loading

Uh oh!

codecov bot commented Jun 17, 2025 •

edited

Loading

Uh oh!

joglekara commented Jun 17, 2025

Uh oh!

linusseelinger commented Jun 18, 2025

Uh oh!

joglekara commented Jun 18, 2025

Uh oh!

linusseelinger commented Jun 20, 2025

Uh oh!

Uh oh!

feat: Integrate MLflow for metrics and artifacts #229

Are you sure you want to change the base?

feat: Integrate MLflow for metrics and artifacts #229

Uh oh!

Conversation

linusseelinger commented Jun 17, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Relevant issue or PR

Description of changes

HOW TO

Option A: Serve tesseract

Option B: Single tesseract execution

TODO

Testing done

Uh oh!

codecov bot commented Jun 17, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

joglekara commented Jun 17, 2025

Uh oh!

linusseelinger commented Jun 18, 2025

Uh oh!

joglekara commented Jun 18, 2025

Uh oh!

linusseelinger commented Jun 20, 2025

Uh oh!

Uh oh!

linusseelinger commented Jun 17, 2025 •

edited

Loading

codecov bot commented Jun 17, 2025 •

edited

Loading