Add baseline:detect job for microbenchmarks (dd-octo-sts auth) by p-datadog · Pull Request #5539 · DataDog/dd-trace-rb

p-datadog · 2026-04-01T16:44:33Z

What does this PR do?

Adds a microbenchmarks-baseline-detect job that detects the PR's target branch using dd-octo-sts + gh CLI, then passes it to the microbenchmarks job as MANUAL_BASELINE_BRANCH via a dotenv artifact. This bypasses github-find-merge-into-branch inside bp-runner, which crashes because its GitHub App installation is suspended (incident-51987).

This is the more robust alternative to #5538 (which uses a simpler one-line fallback to the default branch). This PR gets the actual PR target branch from GitHub, so it's correct even for PRs targeting non-default branches.

⚠️ Blocked: Requires dd-octo-sts trust policy gitlab.github-access.read to be provisioned for DataDog/dd-trace-rb. This policy exists for dd-trace-py but not for dd-trace-rb. See failing job log:

Error: http error with status code 404: {"code":5,"message":"unable to find trust policy for \"gitlab.github-access.read\"","details":[]}

Prerequisites before this can merge:

Provision dd-octo-sts trust policy gitlab.github-access.read for scope DataDog/dd-trace-rb (admin action)

Motivation:

All dd-trace-rb PRs that touch code paths are currently failing microbenchmarks because the benchmarking platform's GitHub App is suspended. See #5538 for the full root cause analysis.

This PR models the approach dd-trace-py has used since March 2025: a separate baseline detection job that authenticates via dd-octo-sts (independent of the benchmarking platform's GitHub App) and writes the result to a dotenv artifact consumed by the benchmark jobs.

How it works:

New microbenchmarks-baseline-detect job runs in the package stage (before microbenchmarks):
- Uses dd-octo-sts to get a GitHub token scoped to DataDog/dd-trace-rb
- Calls gh pr view --json baseRefName to find the PR's target branch
- Falls back to master if the lookup fails
- Writes MANUAL_BASELINE_BRANCH=<branch> to a dotenv artifact
The microbenchmarks job now has needs: [{job: microbenchmarks-baseline-detect, optional: true, artifacts: true}] instead of needs: []:
- When the detect job runs, MANUAL_BASELINE_BRANCH is set from the dotenv artifact
- The shared run-microbenchmarks__setup.sh template checks for this variable and skips github-find-merge-into-branch entirely
- When the detect job is skipped (e.g., on master where we skip detection), optional: true ensures microbenchmarks still runs — it falls through to the shared template's default behavior
The detect job has the same rules: as the microbenchmarks job (minus the master rule, since on master we always compare against master anyway), so it only runs when microbenchmarks would run.

Comparison with #5538:

	#5538 (Option A)	This PR (Option C)
Change size	1 line	New job (~35 lines)
Baseline accuracy	Default branch only	Actual PR target branch
Auth dependency	None (detects from git remote)	`dd-octo-sts` (not yet provisioned)
PRs targeting non-master	Wrong baseline	Correct baseline
Complexity	Minimal	Moderate

Recommend merging #5538 first to unblock CI immediately, then this PR once dd-octo-sts is provisioned.

Change log entry

None.

Additional Notes:

Cross-tracer analysis of how every tracer handles baseline detection:

Python: Separate baseline:detect job with dd-octo-sts + gh pr view ... || echo "main" (this PR's model)
Go, Cpp, PHP: Custom run-benchmarks.sh with github-find-merge-into-branch || :
Java: Hardcodes origin/master
Dotnet: Downloads baseline from S3
JS: Uses shared template (same bug as rb, but survives accidentally due to bash set -e quirk with variable assignments — runs without baseline comparison)

How to test the change?

Once dd-octo-sts policy is provisioned:

Push a PR that touches lib/ or ext/ and verify:
- microbenchmarks-baseline-detect job runs and outputs MANUAL_BASELINE_BRANCH=master (or the actual target branch)
- microbenchmarks jobs pick up the variable and skip github-find-merge-into-branch
- Baseline comparison runs correctly

Detect the PR's target branch via dd-octo-sts + gh CLI instead of relying on benchmarking-platform's github-find-merge-into-branch, which fails when the GitHub App is suspended (incident-51987). Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>

datadog-datadog-prod-us1 · 2026-04-01T17:02:25Z

✅ Tests

🎉 All green!

❄️ No new flaky tests detected
🧪 All tests passed

🎯 Code Coverage (details)
• Patch Coverage: 100.00%
• Overall Coverage: 95.36% (-0.02%)

_{This comment will be updated automatically if new data arrives.

🔗 Commit SHA: 6f80321 | Docs | Datadog PR Page | Was this helpful? React with 👍/👎 or give us feedback!}

p-datadog requested a review from a team as a code owner April 1, 2026 16:44

p-datadog added the AI Generated Largely based on code generated by an AI or LLM. This label is the same across all dd-trace-* repos label Apr 1, 2026

p-datadog mentioned this pull request Apr 1, 2026

Fix microbenchmarks crash caused by suspended GitHub App #5538

Closed

p-datadog marked this pull request as draft April 1, 2026 16:55

p-datadog closed this Apr 1, 2026

p-datadog deleted the fix/microbenchmarks-baseline-detect branch April 1, 2026 18:08

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add baseline:detect job for microbenchmarks (dd-octo-sts auth)#5539

Add baseline:detect job for microbenchmarks (dd-octo-sts auth)#5539
p-datadog wants to merge 1 commit intomasterfrom
fix/microbenchmarks-baseline-detect

p-datadog commented Apr 1, 2026 •

edited

Loading

Uh oh!

datadog-datadog-prod-us1 bot commented Apr 1, 2026 •

edited by datadog-official bot

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

p-datadog commented Apr 1, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

datadog-datadog-prod-us1 bot commented Apr 1, 2026 • edited by datadog-official bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

p-datadog commented Apr 1, 2026 •

edited

Loading

datadog-datadog-prod-us1 bot commented Apr 1, 2026 •

edited by datadog-official bot

Loading