- 
          
- 
                Notifications
    You must be signed in to change notification settings 
- Fork 635
fix: fixes to prepare for making bootstrap=script the default for Linux #2760
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
fix: fixes to prepare for making bootstrap=script the default for Linux #2760
Conversation
902ca8a    to
    0fb8f52      
    Compare
  
    | Hrm. CI flagged an issue relating to the runtime-env toolchain: it doesn't respect the virtual env. This is because the  There's another CI failure relating to compile_pip_requirements, but I haven't had a chance to look yet. | 
| note to self: 
 | 
| Ok, so what I've figured out is: 
 (1) and (2) mean, in order to use py3.9 with runtime-venv toolchain, the only way to make it even see the venv is to create it at runtime with the typical symlink. This would also solve (3) (symlink lib/python3.11 to python3.10; technically wrong, but matches historical behavior); I can think of some alternatives for (3) that might work right now (PYTHONPATH, addsitedir(), or sys.path setup in stage2), but in order to have a normally functioning venv site-packages dir, we have to create lib/pythonX.Y matching the current runtime version. Anyways, what I'm thinking is to add something to the toolchain definition that says "recreate the venv at runtime", and then the runtime env toolchain sets this. We already have a flag for this due to rules_pkg not handling raw symlinks. I think having a "create venv at runtime" thing is gonna be a fact of life for awhile, at least until 3.11 is the minimum supported version. | 
| To fix the situation when  
 The logic itself wasn't so bad; much of it already exists because of the zip logic and "don't use declare_symlink" logic. | 
| Next failure: //tests/multiple_inputs/... Basically, the compile_pip_requirements() rule breaks if...toml files are used? There's 3 tests, 1 pass, 2 fail: 
 The error seems to stem from a temporary venv (used to run pip compile) being created (by piptools and/or build) from within the py_binary's venv. It's odd that one test passes and others fail, though. Maybe the venv-in-venv thing is a red-herring? Or toml triggers this venv-in-venv thing? 
 Finally figured it out! The missing "home" key in the pyvenv.cfg file causes  Under the hood, compile_pip_requirements uses piptools, which uses the  Fixed by having bazel_site_init fixup sys._base_executable. | 
299f60f    to
    ca41e7d      
    Compare
  
    ca41e7d    to
    dfa4fe9      
    Compare
  
    dfa4fe9    to
    91d1072      
    Compare
  
    | OK, ready for review. I've change scope slightly here: this PR fixes various issues setting bootstrap=script as the default revealed. A separate PR will actually change it to the default. This grew a bit bigger than I anticipated, so quick summary: 
 | 
| fi | ||
|  | ||
| mkdir -p "$venv/bin" | ||
| ln -s "$python_exe_actual" "$python_exe" | 
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Not blocking, just thinking here.
These symlinks are done at runtime, this may be troublesome on read-only environments - for example docker image running with read-only filesystem where only particular directories are writeable, e.g. /tmp.
I think this might be OK, but somebody will definitely come and say that they want the python from the docker container and I am curious how all of this will go.
We cannot create the symlinks at build time though, because they are not known?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
We cannot create the symlinks at build time though, because they are not known?
Correct. When the python being run is coming from PATH at runtime, then we can't know it at build-time.
The next closest approximation of this are local toolchains: they can run which python3 during the repo phase, figure out the absolute path, and have that written as the symlink. This can be host-specific, though (e.g. on my machine it'll resolve to /home/richard/pyvenv/3.11/bin/python3, which won't be in a docker container).
Troublesome for read-only environments
Yeah, unfortunately, what options are available depends on the combination of (1) what python version is used, (2) if the build time and runtime versions match, and (3) if a wrapper script is used a runtime.
In order to have the combination of (1) read-only runtime environment, (2) use the build-time generated venv, and (3) use python from the current runtime environment, then...
At the least, the runtime and build time python versions have to match. Without that, things get dicey -- PYTHONPATH is the only other thing i could come up with, but that's going to pollute subprocesses and change sys.path ordering. So then more hacks to try and workaround that (a second envvar to indicate "undo PYTHONPATH hack" ?)
If Python 3.11+ is used, then the PYTHONEXECUTABLE environment variable will handle things. Stated another way: if you're using python 3.11, changing the runtime env toolchain to have supports_build_time_venv=True should Just Work (and no temp venv need be created). I suppose I could add a flag for that? Or a --runtime_version_matches_build_version type of flag (these flags would be specific to the runtime_env toolchain). Or maybe put a select() on the runtime_env toolchain: set True if --python_version >= 3.11, False otherwise?
For earlier versions, I think so long as the $actual value for exec -a $actual $venv_bin_python3 points to the actual interpreter (/usr/bin/python3, or whatever sys._base_executable would tell -- basically a binary that can find its python home), then that also works (I think. Can't remember after so many days of hammering on this).
For this case, making the venv/bin/python3 a wrapper script to handle this (instead of stage1 / runtime_env_interpreter.sh) might work better. We'd might need another setting on the toolchain to know whether to do that? Not entirely sure.
Part of the problem here is switching from non-venv to venv style of execution. Non-venv execution used PYTHONPATH and shoved everything on front, not caring about python version stuff. Venv execution doesn't use PYTHONPATH, but now cares about the python version, since it uses it as part of where it tries to find site-packages.
A saving grace here is using the runtime env toolchain with bootstrap=script didn't work before this PR anyways.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
py3.11, build time venv, runtime-env toolchain Just works
i checked this -- yeah, it just works and doesn't require a writable scratch space. I updated the runtime_env toolchain definition to use a select().
Side note: i had to implement an is_python_at_least helper flag. See config_settings.bzl. Maybe we should factor our a generic "is_python_version_within" flag, and give it args like "gt/gte/lt/lte" ? Seems like that might be useful to the pypi generation stuff.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Hmmm, regarding factoring out the python flag, +1, We could use env markers for that as well. Could you please create a ticket for that?
Regarding the rest, thanks for the explanation, it's great to have it here.
Co-authored-by: Ignas Anikevicius <[email protected]>
…ckeylev/rules_python into feat.default.bootstrap.script
…snt support the build time venv
…into feat.default.bootstrap.script
| current = tuple( | ||
| ctx.attr._major_minor[config_common.FeatureFlagInfo].value.split("."), | ||
| ) | ||
| value = "yes" if current >= at_least else "no" | 
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
This version matching is a little bit brittle. Might be better to use evaluate from pep508_evaluate where we use "python_version >= {}".format(ctx.attr.at_least) for the marker and then evaluate by passing in the env.
I think it is fine for now to have your implementation, but at some point it may be nicer to use the standard evaluation.
| current = tuple( | ||
| ctx.attr._major_minor[config_common.FeatureFlagInfo].value.split("."), | ||
| ) | ||
| value = "yes" if current >= at_least else "no" | 
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
LGTM on using yes and `no.
Bumps most dependency versions (except `rules_python` under `WORKSPACE`) to the latest available before releasing `rules_scala` v7.1.0. Also bumps the `rules_scala` version number in `MODULE.bazel` to `7.1.0`. - Go: 1.24.4 => 1.24.5 - Scalafmt: 3.9.7 => 3.9.8 - `bazel_skylib`: 1.7.1 => 1.8.1 - `gazelle`: 0.43.0 => 0.44.0 - `golang.org/x/tools`: v0.34.0 => v0.35.0 - `org.scala-sbt:util-interface`: 1.11.1 => 1.11.3 - `proto-google-common-protos`: 2.58.0 => 2.59.2 - `rules_cc`: 0.1.1 => 0.1.4 - `rules_go`: 0.55.0 => 0.55.1 - `rules_java`: 8.12.0 => 8.14.0 - `rules_jvm_external`: 6.7 => 6.8 - `rules_python`: 1.4.1 => 1.5.1 (Bzlmod only) Also: - Moves the `http_archive` instantiations of `rules_python` and `rules_shell` from individual `WORKSPACE` files to `//scala:latest_deps.bzl`. Removes `load` statements for `http_archive` throughout all `WORKSPACE` files. - Removes the `--incompatible_autoload_externally` flag from `.bazelrc`. - Fixes `_get_compiler_srcjar` from `//scala/private:macros/scala_repositories.bzl` to return empty `compiler_srcjar_objects` instead of returning a valid object. Caught by `test_compiler_srcjar_error 2.12.11` from `dt_patches/dt_patch_test.sh` when testing under `WORKSPACE`. Added `test_fail_if_compiler_srcjar_object_is_empty` to `test/shell/test_compiler_sources_integrity.sh` to catch the specific problem sooner. --- This is to ensure that we're compatible with the latest dependencies available before releasing a new minor version. I moved the `rules_python` instantiation to `//scala:latest_deps.bzl` because it was such a pain to update everywhere. `latest_deps.bzl` is essentially a development file, or a "use at your own risk" file, and the primary consumers are internal test modules. This will make future `rules_python` updates less noisy. The only `rules_shell` instantiations were in the top level `WORKSPACE` and `test_version/WORKSPACE.template`. I moved `rules_shell` into `latest_deps.bzl` to eliminate these direct instantations as well, and to remove the `http_archive` imports from those files. I'd added `--incompatible_autoload_externally` in bazel-contrib#1748 when fixing problems with `WORKSPACE` builds under Bazel 8.2.1. However, it's not really necessary, so I removed it. Updating `rules_python` from 1.4.1 to 1.5.1 produced the following error in Bazel 7.6.1 WORKSPACE builds, even with `--incompatible_autoload_externally=+@rules_python`: ```txt $ bazel build //src/... //test/... WARNING: Couldn't auto load rules or symbols, because no dependency on module/repository 'rules_python' found. This will result in a failure if there's a reference to those rules or symbols. ERROR: test/BUILD:824:10: in py_binary rule //test:py_resource_binary: Traceback (most recent call last): File ".../external/rules_python/python/private/py_binary_rule.bzl", line 24, column 30, in _py_binary_impl return py_executable_impl( File ".../external/rules_python/python/private/py_executable.bzl", line 255, column 35, in py_executable_impl py_executable_base_impl( File ".../external/rules_python/python/private/py_executable.bzl", line 1140, column 46, in py_executable_base_impl exec_result = semantics.create_executable( File ".../external/rules_python/python/private/py_executable.bzl", line 365, column 33, in _create_executable _create_stage1_bootstrap( File ".../external/rules_python/python/private/py_executable.bzl", line 809, column 27, in _create_stage1_bootstrap if runtime and runtime.supports_build_time_venv: Error: 'PyRuntimeInfo' value has no field or method 'supports_build_time_venv' Available attributes: bootstrap_template, coverage_files, coverage_tool, files, interpreter, interpreter_path, python_version, stub_shebang ERROR: test/BUILD:824:10: Analysis of target '//test:py_resource_binary' failed ERROR: Analysis of target '//test:py_resource_binary' failed; build aborted ``` This is due to bazel-contrib/rules_python#2760. In contrast, Bazel 8.3.1 builds work fine, as do Bazel 7.6.1 builds under Bzlmod. Trying `--incompatible_autoload_externally=PyRuntimeInfo` to work around the problem under Bazel 7.6.1 produced: ```txt $ bazel build //src/... //test/... FATAL: bazel crashed due to an internal error. Printing stack trace: java.lang.IllegalStateException: Symbol in 'PyRuntimeInfo' can't be removed, because it's still used by: py_binary, py_test, py_library at com.google.devtools.build.lib.packages.AutoloadSymbols.<init>(AutoloadSymbols.java:192) [...snip...] ``` And with `--incompatible_autoload_externally=@rules_python` produced: ```txt WARNING: Couldn't auto load rules or symbols, because no dependency on module/repository 'rules_python' found. This will result in a failure if there's a reference to those rules or symbols. ERROR: .../external/bazel_tools/tools/python/toolchain.bzl:25:44: name 'PyRuntimeInfo' is not defined ERROR: .../external/bazel_tools/tools/python/toolchain.bzl:33:44: name 'PyRuntimeInfo' is not defined ERROR: .../external/bazel_tools/tools/python/toolchain.bzl:64:26: name 'PyRuntimeInfo' is not defined ERROR: .../external/bazel_tools/tools/python/toolchain.bzl:72:26: name 'PyRuntimeInfo' is not defined WARNING: Target pattern parsing failed. ERROR: Skipping '//test/...': error loading package under directory 'test': error loading package 'test': at .../external/rules_python/python/defs.bzl:21:6: at .../external/rules_python/python/py_runtime_pair.bzl:17:6: compilation of module 'tools/python/toolchain.bzl' failed ERROR: error loading package under directory 'test': error loading package 'test': at .../external/rules_python/python/defs.bzl:21:6: at .../external/rules_python/python/py_runtime_pair.bzl:17:6: compilation of module 'tools/python/toolchain.bzl' failed ``` `tools/python/toolchain.bzl` from Bazel 7.6.1 breaks because it references the builtin `PyRuntimeInfo` instead of `load`ing it from `@rules_python`. Bazel 8 replaces that file's entire implementation with a function that `fail`s with a deprecation error and doesn't use `PyRuntimeInfo`: - https://github.com/bazelbuild/bazel/blob/7.6.1/tools/python/toolchain.bzl - https://github.com/bazelbuild/bazel/blob/8.0.0/tools/python/toolchain.bzl Using `+@rules_python` or `+PyRuntimeInfo` produced the original error. Nothing I could do could avoid using the builtin `PyRuntimeInfo` under Bazel 7.6.1. Since it's only a development dependency for `WORKSPACE` builds, I decided to keep `rules_python` at 1.4.1 in `latest_deps.bzl` for now. I've filed bazel-contrib/rules_python#3119 to get it on the record.
* Bump dependency versions for `rules_scala` v7.1.0 Bumps most dependency versions (except `rules_python` under `WORKSPACE`) to the latest available before releasing `rules_scala` v7.1.0. Also bumps the `rules_scala` version number in `MODULE.bazel` to `7.1.0`. - Go: 1.24.4 => 1.24.5 - Scalafmt: 3.9.7 => 3.9.8 - `bazel_skylib`: 1.7.1 => 1.8.1 - `gazelle`: 0.43.0 => 0.44.0 - `golang.org/x/tools`: v0.34.0 => v0.35.0 - `org.scala-sbt:util-interface`: 1.11.1 => 1.11.3 - `proto-google-common-protos`: 2.58.0 => 2.59.2 - `rules_cc`: 0.1.1 => 0.1.4 - `rules_go`: 0.55.0 => 0.55.1 - `rules_java`: 8.12.0 => 8.14.0 - `rules_jvm_external`: 6.7 => 6.8 - `rules_python`: 1.4.1 => 1.5.1 (Bzlmod only) Also: - Moves the `http_archive` instantiations of `rules_python` and `rules_shell` from individual `WORKSPACE` files to `//scala:latest_deps.bzl`. Removes `load` statements for `http_archive` throughout all `WORKSPACE` files. - Removes the `--incompatible_autoload_externally` flag from `.bazelrc`. - Fixes `_get_compiler_srcjar` from `//scala/private:macros/scala_repositories.bzl` to return empty `compiler_srcjar_objects` instead of returning a valid object. Caught by `test_compiler_srcjar_error 2.12.11` from `dt_patches/dt_patch_test.sh` when testing under `WORKSPACE`. Added `test_fail_if_compiler_srcjar_object_is_empty` to `test/shell/test_compiler_sources_integrity.sh` to catch the specific problem sooner. --- This is to ensure that we're compatible with the latest dependencies available before releasing a new minor version. I moved the `rules_python` instantiation to `//scala:latest_deps.bzl` because it was such a pain to update everywhere. `latest_deps.bzl` is essentially a development file, or a "use at your own risk" file, and the primary consumers are internal test modules. This will make future `rules_python` updates less noisy. The only `rules_shell` instantiations were in the top level `WORKSPACE` and `test_version/WORKSPACE.template`. I moved `rules_shell` into `latest_deps.bzl` to eliminate these direct instantations as well, and to remove the `http_archive` imports from those files. I'd added `--incompatible_autoload_externally` in #1748 when fixing problems with `WORKSPACE` builds under Bazel 8.2.1. However, it's not really necessary, so I removed it. Updating `rules_python` from 1.4.1 to 1.5.1 produced the following error in Bazel 7.6.1 WORKSPACE builds, even with `--incompatible_autoload_externally=+@rules_python`: ```txt $ bazel build //src/... //test/... WARNING: Couldn't auto load rules or symbols, because no dependency on module/repository 'rules_python' found. This will result in a failure if there's a reference to those rules or symbols. ERROR: test/BUILD:824:10: in py_binary rule //test:py_resource_binary: Traceback (most recent call last): File ".../external/rules_python/python/private/py_binary_rule.bzl", line 24, column 30, in _py_binary_impl return py_executable_impl( File ".../external/rules_python/python/private/py_executable.bzl", line 255, column 35, in py_executable_impl py_executable_base_impl( File ".../external/rules_python/python/private/py_executable.bzl", line 1140, column 46, in py_executable_base_impl exec_result = semantics.create_executable( File ".../external/rules_python/python/private/py_executable.bzl", line 365, column 33, in _create_executable _create_stage1_bootstrap( File ".../external/rules_python/python/private/py_executable.bzl", line 809, column 27, in _create_stage1_bootstrap if runtime and runtime.supports_build_time_venv: Error: 'PyRuntimeInfo' value has no field or method 'supports_build_time_venv' Available attributes: bootstrap_template, coverage_files, coverage_tool, files, interpreter, interpreter_path, python_version, stub_shebang ERROR: test/BUILD:824:10: Analysis of target '//test:py_resource_binary' failed ERROR: Analysis of target '//test:py_resource_binary' failed; build aborted ``` This is due to bazel-contrib/rules_python#2760. In contrast, Bazel 8.3.1 builds work fine, as do Bazel 7.6.1 builds under Bzlmod. Trying `--incompatible_autoload_externally=PyRuntimeInfo` to work around the problem under Bazel 7.6.1 produced: ```txt $ bazel build //src/... //test/... FATAL: bazel crashed due to an internal error. Printing stack trace: java.lang.IllegalStateException: Symbol in 'PyRuntimeInfo' can't be removed, because it's still used by: py_binary, py_test, py_library at com.google.devtools.build.lib.packages.AutoloadSymbols.<init>(AutoloadSymbols.java:192) [...snip...] ``` And with `--incompatible_autoload_externally=@rules_python` produced: ```txt WARNING: Couldn't auto load rules or symbols, because no dependency on module/repository 'rules_python' found. This will result in a failure if there's a reference to those rules or symbols. ERROR: .../external/bazel_tools/tools/python/toolchain.bzl:25:44: name 'PyRuntimeInfo' is not defined ERROR: .../external/bazel_tools/tools/python/toolchain.bzl:33:44: name 'PyRuntimeInfo' is not defined ERROR: .../external/bazel_tools/tools/python/toolchain.bzl:64:26: name 'PyRuntimeInfo' is not defined ERROR: .../external/bazel_tools/tools/python/toolchain.bzl:72:26: name 'PyRuntimeInfo' is not defined WARNING: Target pattern parsing failed. ERROR: Skipping '//test/...': error loading package under directory 'test': error loading package 'test': at .../external/rules_python/python/defs.bzl:21:6: at .../external/rules_python/python/py_runtime_pair.bzl:17:6: compilation of module 'tools/python/toolchain.bzl' failed ERROR: error loading package under directory 'test': error loading package 'test': at .../external/rules_python/python/defs.bzl:21:6: at .../external/rules_python/python/py_runtime_pair.bzl:17:6: compilation of module 'tools/python/toolchain.bzl' failed ``` `tools/python/toolchain.bzl` from Bazel 7.6.1 breaks because it references the builtin `PyRuntimeInfo` instead of `load`ing it from `@rules_python`. Bazel 8 replaces that file's entire implementation with a function that `fail`s with a deprecation error and doesn't use `PyRuntimeInfo`: - https://github.com/bazelbuild/bazel/blob/7.6.1/tools/python/toolchain.bzl - https://github.com/bazelbuild/bazel/blob/8.0.0/tools/python/toolchain.bzl Using `+@rules_python` or `+PyRuntimeInfo` produced the original error. Nothing I could do could avoid using the builtin `PyRuntimeInfo` under Bazel 7.6.1. Since it's only a development dependency for `WORKSPACE` builds, I decided to keep `rules_python` at 1.4.1 in `latest_deps.bzl` for now. I've filed bazel-contrib/rules_python#3119 to get it on the record. * Fix test_compiler_sources_integrity.sh on non-Mac The unescaped `{` in the expected failure message caused `test_fail_if_compiler_srcjar_object_is_empty` to fail on Linux and Windows. Rather than escape it, it works just as well to remove it entirely. See also the "Escape '{', '}' in `test/shell/test_bzlmod_macros`" message from #1722 and commit 3f37e26. * Add job to test with Bazel 9.0.0-pre.20250710.1 This is temporary until a fix lands for bazelbuild/bazel#26579, but will ensure that new changes get some degree of prerelease testing.
Various cleanup and prep work to switch bootstrap=script to be the default.
Change
bootstrap_implto always be disabled for windows. This allows setting it totrue in a bazelrc without worrying about the target platform. This is done by using
FeatureFlagInfo to force the value to disabled for windows. This allows any downstream
usages of the flag to Just Work and not have to add selects() for windows themselves.
Switch pip_repository_annotations test to
import python.runfiles. The script bootstrapdoesn't add the runfiles root to sys.path, so
import rules_pythonstops working.Switch gazelle workspace to using the runtime-env toolchain. It was previously
implicitly using the deprecated one built into bazel, which doesn't provide various
necessary provider fields.
Make the local toolchain use
sys._base_executableinstead ofsys.executablewhen finding the interpreter. Otherwise, it might find a venv interpreter or not
properly handle wrapper scripts like pyenv.
Adds a toolchain attribute/field to indicate if the toolchain supports a build-time
created venv. This is due to the runtime_env toolchain. See PR comments for details,
but in short: if we don't know the python interpreter path and version at
build time, the venv may not properly activate or find site-packages.
If it isn't supported, then the stage1 bootstrap creates a temporary venv, similar
to how the zip case is handled. Unfortunately, this requires invoking Python itself
as part of program startup, but I don't see a way around that -- note this is
only triggered by the runtime-env toolchain.
Make the runtime-env toolchain better support virtualenvs. Because it's a wrapper
that re-invokes Python, Python can't automatically detect its in a venv. Two
tricks are used (
exec -aand PYTHONEXECUTABLE) to help address this (but theyaren't guaranteed to work, hence the "recreate at runtime" logic).
Fix a subtle issue where
sys._base_executableisn't set correctly due tohomemissing in the pyvenv.cfg file. This mostly only affected the creation of venvs
from within the bazel-created venv.
Change the bazel site init to always add the build-time created site-packages
(if it exists) as a site directory. This matches the system_python bootstrap
behavior a bit better, which just shoved everything onto sys.path using
PYTHONPATH.
Skip running runtime_env_toolchains tests on RBE. RBE's system python is 3.6,
but the script bootstrap uses 3.9 features. (Running it on RBE is questionable
anyways).
Along the way...
paths. The legacy behavior is disabled in Bazel 8+ by default.
Work towards #2521