Defer updating workflow completion metrics until completion accepted by server by cretz · Pull Request #2742 · temporalio/sdk-java

cretz · 2025-12-03T13:47:42Z

What was changed

Defer updating workflow completion metrics until completion accepted by server. This is done by capturing the metrics to update in the task handler result and then providing a post-completion runnable to be invoked if the gRPC method is a success.

This technically changes behavior to no longer record metrics in situations like unhandled commands, which is a more accurate approach.

Checklist

Closes workflow_completed counter counts successful completions of workflow method instead of workflow executions #1590

…by server Fixes temporalio#1590

cretz · 2025-12-03T14:14:05Z

temporal-sdk/src/main/java/io/temporal/internal/replay/ReplayWorkflowExecutor.java


    if (context.isCancelRequested()) {
      workflowStateMachines.cancelWorkflow();
-      metricsScope.counter(MetricsType.WORKFLOW_CANCELED_COUNTER).inc(1);


These completion counters are now tracked inside workflow state machines object

temporal-sdk/src/test/java/io/temporal/client/functional/MetricsTest.java

Quinn-With-Two-Ns · 2025-12-03T22:55:02Z

@maciejdudko as co-maintainer may want to review as well

maciejdudko

Tests could be more split up but it's not blocking. LGTM

maciejdudko · 2025-12-08T22:42:51Z

temporal-sdk/src/test/java/io/temporal/client/functional/MetricsTest.java

+          assertEquals(
+              2, counts.get(io.temporal.worker.MetricsType.WORKFLOW_COMPLETED_COUNTER).intValue());
+          assertEquals(
+              1, counts.get(io.temporal.worker.MetricsType.WORKFLOW_FAILED_COUNTER).intValue());
+          assertEquals(
+              1,
+              counts
+                  .get(io.temporal.worker.MetricsType.WORKFLOW_CONTINUE_AS_NEW_COUNTER)
+                  .intValue());
+          assertEquals(
+              1, counts.get(io.temporal.worker.MetricsType.WORKFLOW_CANCELED_COUNTER).intValue());


Nit: splitting each workflow run into separate testcase would more strongly verify that the right counter was incremented in each scenario. Could also add checking presence of WORKFLOW_E2E_LATENCY.

I split the tests. I wasn't going to and was going to add e2e check in here, but it turns out we calculate e2e as current - start event time, and the latter is extremely skewed with time skipping causing negative values (which Tally ignores), so it was easier to check in separate tests.

Defer updating workflow completion metrics until completion accepted …

8fbd115

…by server Fixes temporalio#1590

cretz requested a review from a team as a code owner December 3, 2025 13:47

cretz commented Dec 3, 2025

View reviewed changes

Quinn-With-Two-Ns reviewed Dec 3, 2025

View reviewed changes

temporal-sdk/src/test/java/io/temporal/client/functional/MetricsTest.java Outdated Show resolved Hide resolved

Quinn-With-Two-Ns approved these changes Dec 3, 2025

View reviewed changes

Use assertEventually

e25b588

maciejdudko approved these changes Dec 8, 2025

View reviewed changes

cretz and others added 3 commits December 9, 2025 14:46

Split tests

7e57085

Merge branch 'master' into metrics-after-rpc

d67f50e

Test fix

e85b5ee

cretz merged commit a39d10c into temporalio:master Dec 10, 2025
23 of 24 checks passed

cretz deleted the metrics-after-rpc branch December 10, 2025 13:20

cretz mentioned this pull request Dec 18, 2025

[Feature Request] Workflow completion metrics should not be updated if recording task response fails temporalio/sdk-core#1084

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Defer updating workflow completion metrics until completion accepted by server#2742

Defer updating workflow completion metrics until completion accepted by server#2742
cretz merged 5 commits intotemporalio:masterfrom
cretz:metrics-after-rpc

cretz commented Dec 3, 2025

Uh oh!

cretz Dec 3, 2025

Uh oh!

Uh oh!

Quinn-With-Two-Ns commented Dec 3, 2025

Uh oh!

maciejdudko left a comment

Uh oh!

maciejdudko Dec 8, 2025

Uh oh!

cretz Dec 9, 2025 •

edited

Loading

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

cretz commented Dec 3, 2025

What was changed

Checklist

Uh oh!

cretz Dec 3, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Quinn-With-Two-Ns commented Dec 3, 2025

Uh oh!

maciejdudko left a comment

Choose a reason for hiding this comment

Uh oh!

maciejdudko Dec 8, 2025

Choose a reason for hiding this comment

Uh oh!

cretz Dec 9, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

cretz Dec 9, 2025 •

edited

Loading