Feature: Option to set the tracking URI for MLflowCallback. #29032

seanswyi · 2024-02-15T07:19:23Z

What does this PR do?

Previously, the MLflowCallback was only able to set MLflow experiments or runs. This PR adds the option to also set the tracking URI.

Fixes #28961

Before submitting

This PR fixes a typo or improves the docs (you can dismiss the other checks if that's the case).
Did you read the contributor guideline,
Pull Request section?
Was this discussed/approved via a Github issue or the forum? Please add a link
to it if that's the case. Option to set the tracking URI for MLflowCallback. #28961
Did you make sure to update the documentation with your changes? Here are the
documentation guidelines, and
here are tips on formatting docstrings.
Did you write any new necessary tests?

Who can review?

Anyone in the community is free to review the PR once the tests have passed. Feel free to tag
members/contributors who may be interested in your PR.

Trainer maintainers: @muellerzr @pacman100
Original author of MLflowCallback: @noise-field (c48b16b)

…:seanswyi/transformers into feature/add-mlflow-uri-to-mlflowcallback

amyeroberts

Thanks for adding! Changes LGTM

Third-party integrations are maintained by their original contributors, rather than the transformers team. So let's get a review from @noise-field

seanswyi · 2024-02-15T20:45:02Z

Sounds good, thanks for the feedback @amyeroberts! I just made another minor change in the docstring and committed it: the default value of MLFLOW_TRACKING_URI should be an empty string rather than None.

seanswyi · 2024-02-16T05:58:04Z

Is there any way to rerun tests? The failed tests_torch seems to be a timeout-related issue and wasn't there before my most recent commit. If passing all tests is absolutely necessary regardless of whether or not my code is related to it, then I feel like a simple rerun may solve this.

Some tests failed!

============================= FAILURES SHORT STACK =============================
_ LayoutLMTokenizationTest.test_add_tokens [type (microsoft/layoutlm-base-uncased)] _


self = <huggingface_hub.utils._http.UniqueRequestIdAdapter object at 0x7f75b9f8c730>
request = <PreparedRequest [GET]>, stream = True
timeout = Timeout(connect=10, read=10, total=None), verify = True, cert = None
proxies = OrderedDict([('no', '127.0.0.1,localhost,circleci-internal-outer-build-agent')])

        try:
            resp = conn.urlopen(
                method=request.method,
                url=url,
                body=request.body,
                headers=request.headers,
                redirect=False,
                assert_same_host=False,
                preload_content=False,
                decode_content=False,
                retries=self.max_retries,
                timeout=timeout,
                chunked=chunked,
            )
        ...
        except (_SSLError, _HTTPError) as e:
            if isinstance(e, _SSLError):
                # This branch is for urllib3 versions earlier than v1.22
                raise SSLError(e, request=request)
            elif isinstance(e, ReadTimeoutError):
>               raise ReadTimeout(e, request=request)
E               requests.exceptions.ReadTimeout: (ReadTimeoutError("HTTPSConnectionPool(host='huggingface.co', port=443): Read timed out. (read timeout=10)"), '(Request ID: e8e1debd-e8f7-45eb-b23d-fa63400a0c0b)')

../.pyenv/versions/3.8.12/lib/python3.8/site-packages/requests/adapters.py:532: ReadTimeout




Exited with code exit status 255

amyeroberts · 2024-02-16T11:52:56Z

@seanswyi I can re-run tests for you. I've just set tests_torch run off again

muellerzr

Thanks for the quick turnaround!

HuggingFaceDocBuilderDev · 2024-02-16T13:23:13Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

amyeroberts · 2024-02-16T14:47:15Z

@seanswyi As everything is passing and both @muellerzr and I approve, let's just merge and if @noise-field has any comments we can do a follow-up PR.

Thanks again for adding!

harupy · 2024-02-19T02:38:47Z

I'm an MLflow maintainer. Found this PR while I'm investigating https://github.com/mlflow-automation/mlflow/actions/runs/7949278234/job/21702921711#step:15:2462. What if we run this code like this?

import mlflow
import os

assert "MLFLOW_TRACKING_URI" not in os.environ
mlflow.set_tracking_uri("sqlite:///my.db")

# train
...

# Since MLFLOW_TRACKING_URI is not set, the tracking URI is set to an empty string, which is undesired in this case.

seanswyi · 2024-02-19T03:06:55Z

@harupy Could you elaborate a bit on why you think that would be better? The reason I wrote the initial PR the way I did is because to me it made sense to allow the user to keep MLFLOW_TRACKING_URI as an environment variable along with the other MLflow-related environment variables such as MLFLOW_RUN_NAME. I also left the default value as an empty string because that seemed to be consistent with MLflow's documentation as well (https://mlflow.org/docs/latest/python_api/mlflow.html#mlflow.set_tracking_uri). I feel like the snippet you provided takes away that control from the user, as it actually seems to be enforcing the user keep the tracking URI to a hardcoded value of "sqlite:///my.db".

Please let me know if I'm misunderstanding or missing something. I haven't been able to fully look into the failed tests you provided but it seems like there's an issue between this PR and the autologging module?

harupy · 2024-02-19T03:17:49Z

@seanswyi

I feel like the snippet you provided takes away that control from the user, as it actually seems to be enforcing the user keep the tracking URI to a hardcoded value of "sqlite:///my.db".

True, but there may be users who use the code like my snippet (we do something similar in our tests, no MLFLOW_TRACKING_URI environment variable, just call mlflow.set_tracking_uri with a temporary sqlite file). Can we call set_tracking_uri only when MLFLOW_TRACKING_URI exists?

seanswyi · 2024-02-19T03:34:16Z

@harupy If you believe that only calling the function when the environment variable exists is better then how about changing the current code to something like this?

if "MLFLOW_TRACKING_URI" in os.environ:
    self._tracking_uri = os.environ["MLFLOW_TRACKING_URI"]
    logger.debug(f"MLflow tracking URI is set to {self._tracking_uri}")
    self._ml_flow.set_tracking_uri(self._tracking_uri)
else:
    logger.debug(f"Environment variable `MLFLOW_TRACKING_URI` is not provided and therefore will not be explicitly set.")

Just curious, is there any reason why a temporary SQLite DB is used? It seems like the documentation would suggest that setting a tracking URI isn't necessary and that the data will just be stored locally at ./mlruns.

harupy · 2024-02-19T03:37:29Z

Yeah that looks good to me.

is there any reason why a temporary SQLite DB is used?

To clean up runs and experiments after each test invocation.
To run tests faster.

…ace#29032) * Added option to set tracking URI for MLflowCallback. * Added option to set tracking URI for MLflowCallback. * Changed to in docstring.

njbrake · 2024-02-26T17:51:01Z

Hi! I'm here because this PR broke my code :) I think there might be a compatibility break here?

In my code I call mlflow.set_tracking_uri(my_mlflow_tracking_uri) , but I never set the MLFLOW_TRACKING_URI env var. Is there a way to change this logic so that it checks if the tracking_uri was already set in the mlflow library? In my case it appears that the code

        self._tracking_uri = os.getenv("MLFLOW_TRACKING_URI", "")
        self._experiment_name = os.getenv("MLFLOW_EXPERIMENT_NAME", None)
        self._flatten_params = os.getenv("MLFLOW_FLATTEN_PARAMS", "FALSE").upper() in ENV_VARS_TRUE_VALUES
        self._run_id = os.getenv("MLFLOW_RUN_ID", None)
        logger.debug(
            f"MLflow experiment_name={self._experiment_name}, run_name={args.run_name}, nested={self._nested_run},"
            f" tags={self._nested_run}"
            f" tags={self._nested_run}, tracking_uri={self._tracking_uri}"
        )

means that the tracking_uri is getting overriden to be "" because I didn't set the env var.

I can change my implementation to be compatible with the change that was merged, but others may have similar issues if their implementation was similar to mine.

seanswyi · 2024-02-26T22:55:32Z

Hi @njbrake! Thanks for the comment, that's something that I missed earlier. The MLflow API seems to provide a mlflow.get_tracking_uri() and mlflow.is_tracking_uri_set() function. It would probably be better to check before initializing self._tracking_uri.

Maybe I could incorporate this in the other PR (#29096) that addresses @harupy's previous comment as well? (cc. @amyeroberts)

amyeroberts · 2024-02-27T09:48:01Z

@seanswyi Yes please!

Signed-off-by: harupy <17039389+harupy@users.noreply.github.com>

Signed-off-by: harupy <17039389+harupy@users.noreply.github.com> Signed-off-by: Arthur Jenoudet <arthur.jenoudet@databricks.com>

* Added option to set tracking URI for MLflowCallback. * Added option to set tracking URI for MLflowCallback. * Changed to in docstring.

seanswyi added 3 commits February 15, 2024 15:37

Added option to set tracking URI for MLflowCallback.

59b14ab

Added option to set tracking URI for MLflowCallback.

85c9188

Merge branch 'feature/add-mlflow-uri-to-mlflowcallback' of github.com…

bf202ef

…:seanswyi/transformers into feature/add-mlflow-uri-to-mlflowcallback

amyeroberts approved these changes Feb 15, 2024

View reviewed changes

Changed to in docstring.

40db40f

muellerzr approved these changes Feb 16, 2024

View reviewed changes

amyeroberts merged commit 161fe42 into huggingface:main Feb 16, 2024
21 checks passed

seanswyi mentioned this pull request Feb 19, 2024

Fix: Fixed the previous tracking URI setting logic to prevent clashes with original MLflow code. #29096

Merged

5 tasks

harupy mentioned this pull request Feb 22, 2024

Avoid transformers 4.38.1 in cross-version tests mlflow/mlflow#11221

Merged

37 tasks

harupy added a commit to harupy/mlflow that referenced this pull request Mar 14, 2024

Remove workaround for huggingface/transformers#29032

3621a16

Signed-off-by: harupy <17039389+harupy@users.noreply.github.com>

harupy added a commit to mlflow/mlflow that referenced this pull request Mar 14, 2024

Remove workaround for huggingface/transformers#29032 (#11417)

d6a94b6

Signed-off-by: harupy <17039389+harupy@users.noreply.github.com>

chenmoneygithub pushed a commit to chenmoneygithub/mlflow that referenced this pull request Mar 18, 2024

Remove workaround for huggingface/transformers#29032 (mlflow#11417)

e65fc04

Signed-off-by: harupy <17039389+harupy@users.noreply.github.com>

artjen pushed a commit to artjen/mlflow that referenced this pull request Mar 26, 2024

Remove workaround for huggingface/transformers#29032 (mlflow#11417)

b800830

Signed-off-by: harupy <17039389+harupy@users.noreply.github.com> Signed-off-by: Arthur Jenoudet <arthur.jenoudet@databricks.com>

itazap pushed a commit that referenced this pull request May 14, 2024

Feature: Option to set the tracking URI for MLflowCallback. (#29032)

a9b7250

* Added option to set tracking URI for MLflowCallback. * Added option to set tracking URI for MLflowCallback. * Changed to in docstring.

cecheta mentioned this pull request Oct 21, 2024

Feature: Add MLFLOW_MAX_LOG_PARAMS to MLflowCallback #34279

Merged

5 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Feature: Option to set the tracking URI for MLflowCallback. #29032

Feature: Option to set the tracking URI for MLflowCallback. #29032

seanswyi commented Feb 15, 2024 •

edited

Loading

amyeroberts left a comment

seanswyi commented Feb 15, 2024

seanswyi commented Feb 16, 2024

amyeroberts commented Feb 16, 2024

muellerzr left a comment

HuggingFaceDocBuilderDev commented Feb 16, 2024

amyeroberts commented Feb 16, 2024

harupy commented Feb 19, 2024 •

edited

Loading

seanswyi commented Feb 19, 2024

harupy commented Feb 19, 2024 •

edited

Loading

seanswyi commented Feb 19, 2024

harupy commented Feb 19, 2024 •

edited

Loading

njbrake commented Feb 26, 2024

seanswyi commented Feb 26, 2024

amyeroberts commented Feb 27, 2024

Feature: Option to set the tracking URI for MLflowCallback. #29032

Feature: Option to set the tracking URI for MLflowCallback. #29032

Conversation

seanswyi commented Feb 15, 2024 • edited Loading

What does this PR do?

Before submitting

Who can review?

amyeroberts left a comment

Choose a reason for hiding this comment

seanswyi commented Feb 15, 2024

seanswyi commented Feb 16, 2024

amyeroberts commented Feb 16, 2024

muellerzr left a comment

Choose a reason for hiding this comment

HuggingFaceDocBuilderDev commented Feb 16, 2024

amyeroberts commented Feb 16, 2024

harupy commented Feb 19, 2024 • edited Loading

seanswyi commented Feb 19, 2024

harupy commented Feb 19, 2024 • edited Loading

seanswyi commented Feb 19, 2024

harupy commented Feb 19, 2024 • edited Loading

njbrake commented Feb 26, 2024

seanswyi commented Feb 26, 2024

amyeroberts commented Feb 27, 2024

seanswyi commented Feb 15, 2024 •

edited

Loading

harupy commented Feb 19, 2024 •

edited

Loading

harupy commented Feb 19, 2024 •

edited

Loading

harupy commented Feb 19, 2024 •

edited

Loading