Worker-Versioning GA: Tests for AutoUpgrade workflows not bouncing back and forth. #8635

Shivs11 · 2025-11-13T21:15:17Z

What changed?

WISOTT

Why?

To be sure of the revision number mechanics we have implemented.

How did you test it?

Potential risks

None until we flip the DC switch :)

Note

Enables and rewrites AutoUpgrade no-bounce tests using revision numbers, adds several new lag/child/CAN/retry tests, and updates idle workflow poller to accept context for cancellable polling.

Tests (worker versioning v3):
- Enable/Rewrite: Implement TestAutoUpgradeWorkflows_NoBouncingBetweenVersions using revision-number mechanics and cancellable idle pollers.
- New/Expanded Scenarios (revision numbers):
  - Workflow/Activity TQ lag transitions: TestWorkflowTQLags_DependentActivityStartsTransition, TestActivityTQLags_DependentActivityCompletesOnTheNewVersion.
  - Child workflow behaviors across TQ/version lag: TestChildStartsWithParentRevision_SameTQ_TQAhead, TestChildStartsWithParentRevision_SameTQ_TQLags, TestChildStartsWithNoInheritedAutoUpgradeInfo_CrossTQ.
  - Continue-as-new and retry stability: TestContinueAsNewOfAutoUpgradeWorkflow_RevisionNumberMechanics, testRetryNoBounceBack (and callers) to avoid bounce-back under rollback.
- Skip gating: Gate several tests on useRevisionNumbers instead of useNewDeploymentData.
Infrastructure:
- idlePollWorkflow now takes context.Context and passes it via taskpoller.WithContext, enabling time-bounded idle pollers; update all call sites accordingly.
- Remove ad-hoc sync.WaitGroup uses in favor of context-based cancellation/timeouts.

^{Written by Cursor Bugbot for commit 3a6fab3. This will update automatically on new commits. Configure here.}

tests/versioning_3_test.go

ShahabT · 2025-11-25T18:24:11Z

tests/versioning_3_test.go

+	}}, []string{}, tqTypeWf, tqTypeAct)
+
+	// Wait until all task queue partitions know that v1 is current.
+	s.waitForDeploymentDataPropagation(tv1, versionStatusCurrent, false, tqTypeWf, tqTypeAct)


btw, later when we clean up all old tests we should maybe change this function to wait for the particular revision number propagation rather than status of the version.

IMHO I think this test should be thinking of "propagation complete" == "the right revision number has been synced to matching partitions. We should have tests in the worker_deployment_suite that should test out if the right revision number is synced to these partitions alongside the statuses of each version.

I think having this differences, given we have two suites, would make test writing simpler and easier to read for a new user.

tests/versioning_3_test.go

ShahabT · 2025-11-25T18:27:50Z

tests/versioning_3_test.go

+
 func (s *Versioning3Suite) TestAutoUpgradeWorkflows_NoBouncingBetweenVersions() {
-	s.T().Skip("This test is flaky right now and shall be fixed in a future PR.") // TODO (Shivam)
+	if !s.useNewDeploymentData {


…llWorkflow function so that it can terminate early

tests/versioning_3_test.go

Shivs11 · 2025-11-27T18:08:26Z

tests/versioning_3_test.go

 }

 func (s *Versioning3Suite) idlePollWorkflow(
+	ctx context.Context,


this function shall now take in a context. The reason this was done is as follows:

There are some tests that kickstart a v0 poller (v0 is just an example) for the sole purpose of checking if a task were to ever go this worker. These tests initialize them in a waitGroup and then towards the end of the test, just before it gets cleaned up, there is a wg.Wait() present which ensures that this poller completes running before the test can be terminated safely.

However, this does mean that the minimum wait time for the test will never be lesser than the minimum poll time of this v0 poller. This makes our tests long to run. Thus, passing this context and cancelling it is a safe and efficient way to terminate the test.

ShahabT · 2025-11-29T02:19:52Z

tests/versioning_3_test.go

 }

 func (s *Versioning3Suite) idlePollWorkflow(
+	ctx context.Context,


Shivs11 added 3 commits November 13, 2025 15:45

[draft]

26f1052

single autoupgrade workflow should not bounce back and forth

8fc77bb

multiple autoupgrade workflows should not bounce

0fb6385

Shivs11 marked this pull request as ready for review November 13, 2025 22:03

Shivs11 requested review from a team as code owners November 13, 2025 22:03

ShahabT reviewed Nov 25, 2025

View reviewed changes

Shivs11 mentioned this pull request Nov 26, 2025

Add revision number mechanics for child and CAN workflows #8632

Merged

5 tasks

Shivs11 added 2 commits November 27, 2025 12:01

Merge branch 'main' into ss/fix-autoupgrade-bouncing-tests

34a2fe7

changed the test to make it use the SDK + pass in a context to idlePo…

7b07cf7

…llWorkflow function so that it can terminate early

cursor bot reviewed Nov 27, 2025

View reviewed changes

tests/versioning_3_test.go Outdated Show resolved Hide resolved

Shivs11 commented Nov 27, 2025

View reviewed changes

Shivs11 added 2 commits November 27, 2025 13:17

remove waitGroup from tests and pass in pollerCtx

81b8c8a

nits

3a6fab3

ShahabT approved these changes Nov 29, 2025

View reviewed changes

tests/versioning_3_test.go

}

func (s *Versioning3Suite) idlePollWorkflow(

ctx context.Context,

Copy link

Contributor

ShahabT Nov 29, 2025

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

nice!

Shivs11 merged commit f0544f0 into main Dec 1, 2025
59 checks passed

Shivs11 deleted the ss/fix-autoupgrade-bouncing-tests branch December 1, 2025 17:44

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Worker-Versioning GA: Tests for AutoUpgrade workflows not bouncing back and forth. #8635

Worker-Versioning GA: Tests for AutoUpgrade workflows not bouncing back and forth. #8635

Uh oh!

Shivs11 commented Nov 13, 2025 •

edited by cursor bot

Loading

Uh oh!

Uh oh!

ShahabT Nov 25, 2025

Uh oh!

Shivs11 Nov 27, 2025

Uh oh!

Uh oh!

Uh oh!

ShahabT Nov 25, 2025

Uh oh!

Uh oh!

Shivs11 Nov 27, 2025

Uh oh!

ShahabT Nov 29, 2025

Uh oh!

ShahabT Nov 29, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Worker-Versioning GA: Tests for AutoUpgrade workflows not bouncing back and forth. #8635

Worker-Versioning GA: Tests for AutoUpgrade workflows not bouncing back and forth. #8635

Uh oh!

Conversation

Shivs11 commented Nov 13, 2025 • edited by cursor bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

What changed?

Why?

How did you test it?

Potential risks

Uh oh!

Uh oh!

ShahabT Nov 25, 2025

Choose a reason for hiding this comment

Uh oh!

Shivs11 Nov 27, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

ShahabT Nov 25, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Shivs11 Nov 27, 2025

Choose a reason for hiding this comment

Uh oh!

ShahabT Nov 29, 2025

Choose a reason for hiding this comment

Uh oh!

ShahabT Nov 29, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Shivs11 commented Nov 13, 2025 •

edited by cursor bot

Loading