Use a script to populate parts of tox.ini #3920

sentrivana · 2025-01-13T15:26:22Z

Right now, our tox.ini has to be kept up-to-date with new releases manually. This is not feasible as we have tens of test suites.

This is the first step towards fully automating the process. This PR adds a script that's capable of figuring out which releases of a package are available, are supported, and should be tested.

How it works

For each package, test the minimum supported version (defined by MIN_VERSION in integrations/__init__.py, the Python versions supported by both the SDK and the package, and the date of the release -- only packages not older than 5 years will be considered).

Additionally, if the package has majors, test the last version of each major as well as the first release in the last major.

If the package doesn't have majors, take the first and last supported release and two in between.

The future

Currently, the script is only responsible for generating the tox.ini entries for ~28 of our integration test suites.

More to follow in follow up PRs.

Ideally at some point the Tox entries for all integrations are generated by this script. We can then also get rid of the -latest test suites once the script is running and regenerating tox.ini on an automated basis (since it'll catch newest releases and add them to tox.ini on its own).

Relates #3808

codecov · 2025-01-13T15:32:02Z

Codecov Report

All modified and coverable lines are covered by tests ✅

Project coverage is 80.15%. Comparing base (4ae94a5) to head (64ced2f).
Report is 1 commits behind head on master.

✅ All tests successful. No failed tests found.

Additional details and impacted files

@@            Coverage Diff             @@
##           master    #3920      +/-   ##
==========================================
- Coverage   80.21%   80.15%   -0.07%     
==========================================
  Files         139      139              
  Lines       15394    15394              
  Branches     2596     2596              
==========================================
- Hits        12349    12339      -10     
- Misses       2202     2206       +4     
- Partials      843      849       +6

Files with missing lines	Coverage Δ
sentry_sdk/integrations/__init__.py	`77.01% <ø> (ø)`

... and 10 files with indirect coverage changes

szokeasaurusrex

I started reviewing this PR – I am fully on board with the idea of automating our tox setup, but this PR is quite large and difficult to review.

Would you be open to splitting this into multiple smaller PRs? I think, for example, the changes to the Python versions in the .github directory could be in their own PR, adding the script could possibly also be its own thing, and we might even consider making PRs for each individual integration we move over to the script (that level of division might admittedly be overkill though if moving an individual integration to the script is a trivial change)

szokeasaurusrex · 2025-01-20T14:02:39Z

.github/workflows/ci.yml

@@ -45,7 +45,10 @@ jobs:
          python-version: 3.12

      - run: |
+          pip install -e .


General question (not related to this PR): Have we thought about using uv pip here? In my experience, it is much faster than pip

See #3920 (comment), we can consider this, but not in this PR.

szokeasaurusrex · 2025-01-20T14:03:14Z

.github/workflows/test-integrations-ai.yml

@@ -29,7 +29,7 @@ jobs:
    strategy:
      fail-fast: false
      matrix:
-        python-version: ["3.7","3.9","3.11","3.12","3.13"]
+        python-version: ["3.7","3.9","3.11","3.12"]


Why are we removing 3.13 here?

See #3920 (comment) (also for the other CI YAML changes)

szokeasaurusrex · 2025-01-20T14:05:33Z

.github/workflows/test-integrations-flags.yml

@@ -22,70 +22,6 @@ env:
  CACHED_BUILD_PATHS: |
    ${{ github.workspace }}/dist-serverless
 jobs:
-  test-flags-latest:


Could you please explain why we need to delete these lines (and the lines in the other files)? It is unclear to me from the PR description.

Explanation here: #3974 (comment)

Basically, the script doesn't generate a -latest target anymore, just pinned versions. This is fine since it always makes sure to include the latest version out there. So if we run this script often enough (which is the plan), it effectively substitutes the -latest category.

szokeasaurusrex · 2025-01-20T14:06:21Z

.github/workflows/test-integrations-network.yml

@@ -29,7 +29,7 @@ jobs:
    strategy:
      fail-fast: false
      matrix:
-        python-version: ["3.8","3.9","3.11","3.12","3.13"]
+        python-version: ["3.9","3.12","3.13"]


Why are we removing 3.8 and 3.11?

szokeasaurusrex · 2025-01-20T14:07:23Z

.github/workflows/test-integrations-web-2.yml

@@ -29,7 +29,7 @@ jobs:
    strategy:
      fail-fast: false
      matrix:
-        python-version: ["3.6","3.7","3.8","3.9","3.11","3.12","3.13"]
+        python-version: ["3.8","3.9","3.11","3.12","3.13"]


Why do we remove versions here, but add them later in the file?

szokeasaurusrex · 2025-01-20T14:08:06Z

scripts/generate-test-files.sh

+pip install -e ..
+pip install -r populate_tox/requirements.txt
+pip install -r split_tox_gh_actions/requirements.txt


Same general idea of using uv pip instead of pip (just wondering if it is possible)

It probably is possible but would be a bit of a larger overhaul. I think @antonpirker was already looking into switching to uv and was also happy about the performance. So this is something we can definitely consider.

scripts/populate_tox/README.md

szokeasaurusrex · 2025-01-20T14:10:33Z

scripts/populate_tox/README.md

+dependencies, optionally gated behind specific conditions; and optionally
+the Python versions to test on.
+
+The format is:


Should we consider formalizing these formats as TypedDicts, or by using some other structured data type?

We can do this in a follow up PR, might be nice.

szokeasaurusrex · 2025-01-20T14:11:40Z