-
Notifications
You must be signed in to change notification settings - Fork 2.8k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Load test classes with runtime classloader #34681
Conversation
f5a756d
to
f661328
Compare
2c4ddb4
to
8f88f8a
Compare
This comment has been minimized.
This comment has been minimized.
This comment has been minimized.
This comment has been minimized.
aba5d5c
to
68e6194
Compare
This comment has been minimized.
This comment has been minimized.
Still watching this one. I've created a minimal not-working-example to see if this fixes it. :) |
Still going on it ... :) The profile support in normal mode turned out to be a bit thorny, so looking at that now. I haven't checked for a while, but at one point I had reproducers for three of the test-classloading-related defects. One was passing, but two (annoyingly) were failing. If your reproducer is shareable, I'm happy to take it and include it in what I'm checking, or to add it into the test suites, if it's a gap in what we're testing now. (It'd be worth checking what I've added in #35124 to see if one of those covers your scenario, too.) |
4307712
to
181a730
Compare
I'm favor of merging (and dealing with any consequences in |
@holly-cummins could you rebase please? There is a conflict |
4c2be34
to
9c8f547
Compare
@holly-cummins I think you can probably squash now as I think the idea is to merge this ASAP. |
Co-Authored-By: Alexey Loubyansky <[email protected]> Co-Authored-By: Roberto Cortez <[email protected]>
9c8f547
to
ac829ad
Compare
+1 |
Status for workflow
|
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Checked out the PR and run the OTel related tests for execution time differences. Nothing meaningful was spotted.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Release the KRAAAKEN!
(Assuming after merge of quarkusio/quarkus#34681)
Just an FYI note for anyone else later, I had to update my extensions tests to Surefire 3.x to fix a break which I'm guessing began due to this change (apologies if not!). The extension test run against Quarkus main started failing last night, with:
The 'same tests' (repackaged) worked ok in last nights quarkus-platform run against Quarkus main however. Checking for differences, I noticed that the platform run was using Surefire 3.0.0 whilst the extension builds own runs were still on 2.22.1. Updating to the latest surefire and failsafe (3.5.2) the extension tests now work ok against quarkus main again. |
Thanks for the tip, @gemmellr! I noticed several ecosystem CI runs started failing with this change (which isn't too surprising, given how big the change is). Investigating the failures is today's job, but I think you might have saved me a lot of trouble. I'm surprised the surefire version would make a difference. I wonder what's different in surefire? One possibility is how the classpath gets passed to the FacadeClassLoader. I might be able to make my stuff tolerate Surefire 2, I will investigate. |
Np, I was also surprised so thought it worth a mention. I was also pleased though when I realised that was the case; it meant it was an easy fix, and also doesnt need me to release a new version for the next platform hehe. I had skimmed the description here and couldn't see anything suggesting a break should really be expected in my case, or anything specific I could even change that would fix whatever the issue was. It was only then I went looking for how much broke in the platform build that I was then surprised to find the quarkus-qpid-jms tests had actually worked there, which pretty quickly pointed out the difference/solution. |
Bugs fixed by this PR
@Nested
tests #45349@QuarkusTest
method parameter fails withClassNotFoundException
#42006Bugs created by this PR (doh!)
Outstanding issues/breaking changes (input to release notes)
@TestProfile
on@Nested
tests are not supported anymore (fixes QuarkusTest: consider removing the test profile support for@Nested
tests #45349).What problem is this solving?
We see a lot of problems caused by the fact that we load test classes with the deployment classloader, and then intercept the execution and reload the classes with the runtime classloader. Although the new test is loaded with the runtime classloader, its arguments are still loaded with the system classloader. To work around that we sometimes need to clone the arguments by serializing and de-serializing. This was always brittle and no longer worked at all on Java 17+ (until #40601 fixed that). We also see issues because parts of the test infrastructure see the 'wrong' instance of the class. See, for example, quarkiverse/quarkus-pact#73 and #22611.
We have several feature raised against the JUnit team to allow us more control over classloading. The first of this features was introduced in JUnit 5.10, and allows an interceptor to be registered before any tests are launched. This interceptor can set a thread context classloader, which is then used by JUnit to load tests.
My experiments with this feature were thoroughly disappointing. It turns out, setting a TCCL early in the test lifecycle doesn't really help us, because we overwrite our 'early' TCCL with other TCCLs later in the test lifecycle. The following diagram shows some of the places we set the TCCL.
Source: https://excalidraw.com/#json=HFPHIKx8wv0iiyXgNhAzw,8IlEmPcMRvm9pfCGShdClQ
What if we just used one of the existing interception points to set the 'right' classloader, before tests are loaded? If the tests were loaded with our preferred classloader, we wouldn’t need to intercept the factory. Loading the tests with the runtime classloader needs us to move some of our app initialisation earlier in the lifecycle, but I don't think there's any fundamental barrier to this. (We would have had to do this with a solution based on the new JUnit Launcher Interceptor anyway.)
The logic for starting Quarkus needs to be in the test discovery phase, rather than in the extension. This allows us to create the runtime classloader before the test is loaded. The JUnitTest runner already knows about the Quarkus Extension, so it’s only a small extra bit of knowledge to do some of the startup actions.
This only gets us part of the way, though. @stuartwdouglas raised the point that if we have to set only a single classloader, that's not very flexible, because we have a runtime classloader for each test profile. A Quarkus test run doesn't just use one classloader, it uses several. Every resource/unique profile triggers an app relaunch, which means a new classloader. What I've done to handle this is create a FacadeClassLoader. It takes the classloading requests, and then either routes them on to the quarkus application (for vanilla @QuarkusTests), or, if there's a profile/resource, it makes a new app + classloader and sends the request to that.
What we used to before was load a throwaway copy of the the test, pass it to JUnit discovery, let JUnit launch it, and then intercept the execution, figure out what profiles+resources the test declares, create a quarkus app with that information, start the quarkus app, reload the test with the runtime classloader of the quarkus app (and clone its parameters), and execute the test.
The new model is load a throwaway copy of the the test, figure out what profiles+resources the test declares, create a quarkus app with that information, reload the test with the runtime classloader of the quarkus app, pass the ‘right’ class to JUnit discovery, let JUnit launch it, and then intercept the execution, start the quarkus app, and execute the test.
One fundamental limitation of "load tests with the classloader used to execute them" is that a single test cannot run with multiple classloaders, which means it cannot support multiple profiles. We know some people do use this feature, but we also know there have been suggestions that we drop support for it, since it is complex to support (#45349). There is an easy workaround, which is to use one test per profile.
Thoughts on serialization and cloning
A big initial goal of this PR was to get rid of the xstream serialization, since it didn't work on Java 17+. #40601 fixes this issue by switching to use the JBoss marshaller for serialization. Does that mean this work item isn't needed any more? No, although it does mean its benefits are smaller. Here's why it's still useful:
@TestTemplate
) does not see Quarkus bytecode transformations done by extensionssun.misc.unsafe
, butunsafe
is shrinking. It seems certain the JDK team will have to come up with some solution and API to open up access for serializers, but the final design could have security implications (perhaps opening up access in a blanket way), or performance implications (reflection fun), or user experience implications (a need to manually set flags such as--enable-serialization
?). If we can avoid serialization, we avoid all that.Todo before this merges
Todo after this merges
QuarkusTestExtension