fix: reachability check uses mlflow username/password if provided #416

nbowen3 · 2025-12-09T16:53:25Z

Relevant issue or PR

Description of changes

If auth is present in MLFlow, we need to pass the creds in via environment variables (https://mlflow.org/docs/latest/self-hosting/security/basic-http-auth/#using-environment-variables). While this works for when we use the mlflow package, the reachability check just uses requests. This adds the username/password to the requests.

Note: I thought about using the /health endpoint of mlflow instead (which is un-authed on mlflow) but it is (a) undocumented so might change and (b) doesn't replicate actual usage as well.
We also don't use the traditional tesseract config package (prepand config with TESSERACT) for the same reason (want to replicate the actual process the mlflow package will use when it makes requests).

Testing done

Local testing

PasteurBot · 2025-12-09T16:53:50Z

CLA signatures confirmed

All contributors have signed the Contributor License Agreement.
_{Posted by the CLA Assistant Lite bot.}

xalelax

In principle LGTM, but

fix linting (you can use pre-commit for it)
the comments here are superfluous, I'd delete them

nbowen3 · 2025-12-09T17:00:57Z

@PasteurBot I have read the CLA Document and I hereby sign the CLA

codecov · 2025-12-09T17:05:46Z

Codecov Report

❌ Patch coverage is 71.42857% with 4 lines in your changes missing coverage. Please review.
✅ Project coverage is 76.80%. Comparing base (2147b14) to head (6440621).

Files with missing lines	Patch %	Lines
tesseract_core/runtime/mpa.py	69.23%	3 Missing and 1 partial ⚠️

Additional details and impacted files

@@            Coverage Diff             @@
##             main     #416      +/-   ##
==========================================
+ Coverage   76.69%   76.80%   +0.10%     
==========================================
  Files          29       29              
  Lines        3420     3427       +7     
  Branches      533      533              
==========================================
+ Hits         2623     2632       +9     
- Misses        565      566       +1     
+ Partials      232      229       -3

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

🚀 New features to boost your workflow:

❄️ Test Analytics: Detect flaky tests, report on failures, and find test suite problems.

xalelax · 2025-12-09T17:09:41Z

I approved, but maybe let's also document this somewhere in the docs if you find a nice spot 🙂

sakoush · 2025-12-09T17:14:11Z

tesseract_core/runtime/mpa.py

+                username = os.environ.get("MLFLOW_TRACKING_USERNAME")
+                password = os.environ.get("MLFLOW_TRACKING_PASSWORD")


should we integrate these envars with the config, cc @xalelax?

@sakoush The mlflow package expects those specific env variables so we have to set those no matter what. We could add these to config but then either need to set them twice or take the config version and apply it to these variables somewhere within the Tesseract. This felt cleanest.

I think that the reason of why this was done that @nbowen3 wrote in the description of the pr (and here) is satisfying, not all env variables need to be there, only stuff super associated with Tesseracts (like "mount these volumes", "use a gpu", and so on). These ones are associated with mlflow, and with the exact same name that one would use with its client, so it seems to me like the best choice.

Maybe @dionhaefner can also chime in

(also, to add to my comment above, maybe we could also consider renaming TESSERACT_MLFLOW_TRACKING_URI to just MLFLOW_TRACKING_URI, as the former might be a bit confusing)

We used to have MLFLOW_TRACKING_URI but it interfered with people rolling their own MLFlow solution within Tesseracts. The same logic may apply here, but I'm currently not seeing how all the dots connect, so will leave it to @xalelax to decide.

@nbowen3 let's add it to runtime config (which also implies having the TESSERACT_ prefix).

The main reason is to keep the vars very separate when one is using the mpa module of tesseract vs a custom client-defined mlflow client. In principle we could use the same var names, given that likely one would only do one of the two options described above, but this is safer for now.

@xalelax I'm not sure I entirely follow here. We have to have these variables (MLFLOW_TRACKING_USERNAME/MLFLOW_TRACKING_PASSWORD) set for the package to work properly. In your solution, where do those get set? Does the user need to set both environment variables?

We have to have these variables (MLFLOW_TRACKING_USERNAME/MLFLOW_TRACKING_PASSWORD) set for the package to work properly

Just tested it, and they are not necessary -- although mlflow docs suck for this, but the source is relatively clear.

It should just be basic http auth, so we can just get from the environment whatever we want (like TESSERACT_MLFLOW_TRACKING_USERNAME), and use them in the uri as follows:

mlflow.set_tracking_uri("http://user:[email protected]:5000")

(and also in the auth parameters in the get you added). mlflow will parse that uri properly.

To try it locally, one can start a container via

$ docker run -d -p 5000:5000 --name mlflow-server -e MLFLOW_FLASK_SERVER_SECRET_KEY="secret-key" --entrypoint bash ghcr.io/mlflow/mlflow:latest -c "pip install mlflow[auth] && mlflow server --host 0.0.0.0 --port 5000 --app-name basic-auth --backend-store-uri sqlite:///mlflow.db --default-artifact-root ./mlartifacts"

(by default this will create a user called admin with pass password1234).

Then use this python script

import mlflow mlflow.set_tracking_uri("http://admin:password1234@localhost:5000") mlflow.set_experiment("test-creds") with mlflow.start_run(): mlflow.log_param("method", "uri-creds") mlflow.log_metric("test", 1) print("Success!")

tbh, one could also not change anything at all in this pr and just add those creds in the tracking uri (it's an env var anyway), but having this might be convenient.

@xalelax Thanks for the pointer on this! I wasn't aware that worked. I updated the PR to account for this.

jpbrodrick89 · 2025-12-09T22:55:57Z

Note that mlflow also accepts a credentials file, we should support mounting that as an option. Mlflow should automatically read this file if environment variable aren't set.

xalelax · 2025-12-10T15:17:15Z

@nbowen3 feel free to implement @jpbrodrick89 's suggestion if you can -- or leave it open, and I will implement in a follow-up pr 👍

feat: pass mlflow username/password if provided

9ac8f31

nbowen3 requested review from apaleyes, dionhaefner and xalelax as code owners December 9, 2025 16:53

pre commit fixes

5cea258

xalelax reviewed Dec 9, 2025

View reviewed changes

pasteurlabs deleted a comment from codecov bot Dec 9, 2025

delete comments

f65fdde

nbowen3 requested a review from xalelax December 9, 2025 17:02

linting

d0f7d34

xalelax approved these changes Dec 9, 2025

View reviewed changes

nbowen3 enabled auto-merge (squash) December 9, 2025 17:11

sakoush reviewed Dec 9, 2025

View reviewed changes

nbowen3 mentioned this pull request Dec 9, 2025

doc: add mlflow auth information to docs #417

Open

nbowen3 added 2 commits December 11, 2025 18:56

address comments

123e4c1

pre commit

6440621

		username = os.environ.get("MLFLOW_TRACKING_USERNAME")
		password = os.environ.get("MLFLOW_TRACKING_PASSWORD")

fix: reachability check uses mlflow username/password if provided #416

Are you sure you want to change the base?

fix: reachability check uses mlflow username/password if provided #416

Uh oh!

Conversation

nbowen3 commented Dec 9, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Relevant issue or PR

Description of changes

Testing done

Uh oh!

PasteurBot commented Dec 9, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

CLA signatures confirmed

Uh oh!

xalelax left a comment

Choose a reason for hiding this comment

Uh oh!

nbowen3 commented Dec 9, 2025

Uh oh!

codecov bot commented Dec 9, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

xalelax commented Dec 9, 2025

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

xalelax Dec 11, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

jpbrodrick89 commented Dec 9, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

xalelax commented Dec 10, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

7 participants

nbowen3 commented Dec 9, 2025 •

edited

Loading

PasteurBot commented Dec 9, 2025 •

edited

Loading

codecov bot commented Dec 9, 2025 •

edited

Loading

xalelax Dec 11, 2025 •

edited

Loading

jpbrodrick89 commented Dec 9, 2025 •

edited

Loading