fix data race in module reload tests #5433

michaellee1019 · 2025-10-31T21:42:53Z

My last PR introduced data racing issue in tests that wasn't seen during development. This PR suppresses the status spinners while tests are running. There are separate tests that exist in progress_manager_test.go to test the spinner functionality, so I think its fine to not test as the higher level.

dgottlieb · 2025-11-03T15:00:40Z

cli/progress_manager.go

 		pm.currentSpinner.Success(" " + prefix + message + elapsed)
 		pm.currentSpinner = nil
+		// Give the spinner goroutine time to finish to avoid race conditions
+		time.Sleep(10 * time.Millisecond)


Do we know what the root cause is? I feel this isn't a fix but rather just a guess at making something less likely.

Oops I did not mean to commit this. It was troubleshooting originally. I'll remove this and the other two below.

stuqdog

looks reasonable to me, modulo Dan's comment.

dgottlieb · 2025-11-03T19:29:27Z

cli/module_reload_test.go

 				map[string]any{
 					moduleFlagPath: manifestPath, generalFlagPartID: "part-123",
 					moduleBuildFlagNoBuild: true, moduleFlagLocal: true,
+					generalFlagNoProgress: true, // Disable progress spinner to avoid race conditions in tests


It seems the progress spinner is considered harmful. Did we identify what the race is?

Sorry I missed that part of your comment earlier! It looks like the race condition is internal to the pterm library, specifically in the Success() call. I tried adding a mutex wrapper around all calls to the library, for example the following method. It still resulted in a data race:

func (s *synchronizedSpinner) Success(message ...any) { s.mu.Lock() defer s.mu.Unlock() if s.spinner == nil { return } s.spinner.Success(message...) }

I found an open GH issue related to this the problem pterm/pterm#482.

When running the CLI outside of tests, there isn't any issues (I tested it a lot over the past 2 weeks) while running viam module reload.

Will think about it. There isn't really good libs for this in golang.

Alright, all cleaned up and made sure tests passed in --race mode.

dgottlieb

I'm bought in on the approach. Now I'm just curious why there's a bunch of other code changes as part of the PR.

dgottlieb · 2025-11-04T00:42:51Z

cli/module_reload_test.go

 		t.Run("addsServiceWhenMissing", func(t *testing.T) {
-			part, _ := vc.getRobotPart("id")
-			_, err := addShellService(cCtx, vc, logging.NewTestLogger(t), part.Part, false)
+			// Create isolated setup for this subtest to avoid shared state


I buy that the underlying library is the cause and that disabling it serves our goals here.

What's with these other changes? Were there other races not reported by the failure I saw that this fixes?

Yeah I was too aggressive with trying out different mechanisms to avoid the race. My bad, I will cleanup my PR and delete everything except setting the --no-progress flag.

dgottlieb · 2025-11-04T00:43:43Z

cli/module_reload_test.go

 			)

-			err = reloadModuleActionInner(cCtx, vc, parseStructFromCtx[reloadModuleArgs](cCtx), logger, false)
+			// Create isolated logger for this subtest


I see this is done elsewhere -- but do we know what the consequence is of not creating a separate logger? And just using the top-level test logger?

fix data race in module reload tests

ad8b1b2

viambot added the safe to test This pull request is marked safe to test from a trusted zone label Oct 31, 2025

michaellee1019 requested review from dgottlieb and stuqdog November 3, 2025 14:58

dgottlieb reviewed Nov 3, 2025

View reviewed changes

stuqdog reviewed Nov 3, 2025

View reviewed changes

remove debugging sleeps

2fb8d4f

viambot added safe to test This pull request is marked safe to test from a trusted zone and removed safe to test This pull request is marked safe to test from a trusted zone labels Nov 3, 2025

dgottlieb reviewed Nov 3, 2025

View reviewed changes

dgottlieb reviewed Nov 4, 2025

View reviewed changes

cleanup

a5b52c1

viambot added safe to test This pull request is marked safe to test from a trusted zone and removed safe to test This pull request is marked safe to test from a trusted zone labels Nov 4, 2025

dgottlieb approved these changes Nov 5, 2025

View reviewed changes

dgottlieb merged commit 8119fc9 into viamrobotics:main Nov 5, 2025
18 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

fix data race in module reload tests #5433

fix data race in module reload tests #5433

Uh oh!

michaellee1019 commented Oct 31, 2025 •

edited

Loading

Uh oh!

dgottlieb Nov 3, 2025

Uh oh!

michaellee1019 Nov 3, 2025

Uh oh!

stuqdog left a comment

Uh oh!

dgottlieb Nov 3, 2025

Uh oh!

michaellee1019 Nov 4, 2025

Uh oh!

michaellee1019 Nov 4, 2025

Uh oh!

dgottlieb left a comment

Uh oh!

dgottlieb Nov 4, 2025

Uh oh!

michaellee1019 Nov 4, 2025

Uh oh!

dgottlieb Nov 4, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

fix data race in module reload tests #5433

fix data race in module reload tests #5433

Uh oh!

Conversation

michaellee1019 commented Oct 31, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

stuqdog left a comment

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

dgottlieb left a comment

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

michaellee1019 commented Oct 31, 2025 •

edited

Loading