-
Notifications
You must be signed in to change notification settings - Fork 156
petri: lower severity of com3 emitted logs for all vmm_tests #2003
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Conversation
petri/src/vm/mod.rs
Outdated
Some(client_ref) => { | ||
tracing::info!("Agent is online now"); | ||
let sh = client_ref.unix_shell(); | ||
pipette_client::cmd!(sh, "dmesg -n 3").run().await?; |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
We shouldn't need to do any system level filtering like this, instead we can alter the value for OPENVMM_LOG that we pass to the command line during vm construction.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
That said, we probably shouldn't be doing this in tests anyways. If a test is too noisy we should lower the levels of individual events from debug to trace.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
i disagree - i think we want com3 output, but we want to lower the output we spit out once we have better channels available (like kmsg). com3 is really really slow, so we shouldn't rely on verbose output there of any kind.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
note that this is because early output (kernel/bootshim) we have no other mechanism. we could also perhaps look at virtio serial for both as well in a future change.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
In that case we'd need to handle reboots here too.
petri/src/vm/mod.rs
Outdated
self.wait_for_agent().await | ||
let client = self.wait_for_agent().await?; | ||
|
||
match self.openhcl_diag() { |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Will refactor the duplicated code, but I wanted to get two things done first:
- List of test failures
- Feedback on the nature of these changes, to see if I'm on the right track.
#2003 Introduced a change to potentially fix a flaky vmm_test related to validating mnf usage in guests. However, it looks like the test flakiness has not been resolved. I am disabling the test for now to unblock contributors while I have a proper chance to investigate the issue. --------- Co-authored-by: Daman Mulye <[email protected]>
…icrosoft#2003)" This reverts commit a1ac181.
#1937 introduced new vmm_tests which seem to be flaky as they will occasionally timeout. Upon further investigation, the following message can be observed in the petri logs:
hv_vmbus: probe failed for device 766e96f8-2ceb-437e-afe3-a93169e48a7b
. It's likely that openhcl is busy with other tasks (such as writing to com3) so it is unable to handle the vmbus probe request from vtl0. This PR attempts to fix that by only emitting high severity logs once kmsg is available.