Add GHA coordinator for performance evaluation task scatter/gather by cjonas9 · Pull Request #791 · stellar/stellar-rpc

cjonas9 · 2026-06-17T20:19:15Z

What

Adds a GitHub Actions coordinator (load-test-coordinator.yml) that, on a push to a release/** branch, launches the release performance-evaluation leg(s) as callable workflows and posts their consolidated results as a sticky comment on the release PR. The apply-load ingest load test (#741) is adapted into a callable leg that reports back (outputs + an S3 result object) instead of commenting itself.

Decomposes a great deal of the existing infrastructure into more modular components for future coordinator-driven load test. Of the diff, +870/-721 of it is purely refactoring existing code into a new harness package, so review of it does not necessarily have to be as thorough as coordinator-heavy sections. Here is the breakdown of that:

Generic code lifted out of runner/orchestrate.go and runner/instantiate.go into an importable harness package, mechanical changes only
- package consists of gather.go, s3.go, exec.go, harness.go, and harness_test.go
Generic code lifted out of run-load-test.sh and placed into bootstrap-common.sh mostly as helper functions
- run-load-test.sh is now extremely slim and only serves to actually run the ingestion load test
Generic code lifted out of load-test.yml and put in ec2-leg.yml, which handles the EC2 lifecycle with no test-specific hardcoding

Results of the load tests are commented on a sticky comment on the release PR. If >1 push is made/multiple runs are requested, previous run's results are folded into dropdown menus like this:

🧪 Performance Evaluation Test #N...
[recent results for run N]

Performance Evaluation Test #N-1

`[results for run N-1]`

Performance Evaluation Test #...

`[results for run ...]`

Performance Evaluation Test # 1

`[results for run 1]`

Or, you can just look in the comments below to see how this looks!

The high-level flow of the coordinator is below:

flowchart TB
  trigger["push to release/** "] --> plan["perf-eval-coordinator.yml<br/>- plan<br/>- resolve ref + PR"]
  plan --> lt["load-test.yml<br/>runs synthetic ledger ingestion test"]
  plan -.-> future["future legs"]
  lt --> report["report<br/>aggregate → sticky PR comment"]
  future -.-> report

Why

This is so that the remaining perf-eval tests slot in cleanly: each new test becomes another callable leg the coordinator launches and folds into the same report, with no per-leg reporting or permissions plumbing to duplicate.

Known limitations

N/A. Remaining work for the epic this task is a part of mainly includes finishing off the other perf eval tasks (only the ingest load test is complete at this point).

… real DB test

# Conflicts: # go.mod # go.sum

…M risks

# Conflicts: # .github/workflows/load-test.yml # cmd/stellar-rpc/internal/integrationtest/infrastructure/load-test/run-load-test.sh # cmd/stellar-rpc/internal/integrationtest/infrastructure/load-test/runner/orchestrate.go # cmd/stellar-rpc/internal/integrationtest/ingest_loadtest_test.go

…commenter

# Conflicts: # .github/workflows/load-test.yml # cmd/stellar-rpc/internal/ingest/service.go # cmd/stellar-rpc/internal/integrationtest/ingest_loadtest_test.go # go.mod # go.sum

Copilot

Pull request overview

Introduces a GitHub Actions “scatter/gather” coordinator for release performance-evaluation runs, converting the existing ingestion load test into a callable workflow “leg” that reports its results via outputs + an S3 result object, then consolidating leg results into a single sticky PR comment.

Changes:

Adds a new load-test-coordinator.yml workflow that resolves the release PR context, fans out to callable perf-eval legs, and posts an aggregated sticky PR comment.
Refactors the ingestion load test workflow (load-test.yml) into a workflow_call-only leg with structured outputs (bucket/key/verdict/etc.).
Adds a Go-based coordinator comment renderer (coordinator-runner) plus new apply-load scenario configs and supporting runner tweaks.

Reviewed changes

Copilot reviewed 8 out of 13 changed files in this pull request and generated 2 comments.

Show a summary per file

File	Description
`cmd/stellar-rpc/internal/integrationtest/infrastructure/perf-eval/ingest-load-test/testdata/apply-load-v27-soroswap.cfg`	Adds v27 Soroswap apply-load profile config for generating ingestible meta corpus.
`cmd/stellar-rpc/internal/integrationtest/infrastructure/perf-eval/ingest-load-test/testdata/apply-load-v27-sac.cfg`	Adds v27 SAC apply-load profile config (with disjoint classic payment window notes).
`cmd/stellar-rpc/internal/integrationtest/infrastructure/perf-eval/ingest-load-test/testdata/apply-load-v27-oz.cfg`	Adds v27 OZ (custom token) apply-load profile config.
`cmd/stellar-rpc/internal/integrationtest/infrastructure/perf-eval/ingest-load-test/runner/runner_test.go`	Adds unit tests for result encoding/decoding, S3 “not found” detection, and tail buffer behavior.
`cmd/stellar-rpc/internal/integrationtest/infrastructure/perf-eval/ingest-load-test/runner/orchestrate.go`	Extends leg outputs to include verdict/bucket/key; improves timeout reporting metadata.
`cmd/stellar-rpc/internal/integrationtest/infrastructure/perf-eval/ingest-load-test/runner/instantiate.go`	Clarifies result-object contract and scenario naming; improves failure-path explanation.
`cmd/stellar-rpc/internal/integrationtest/infrastructure/perf-eval/ingest-load-test/run-load-test.sh`	Updates bootstrap/runner handoff and adds a self-terminate ceiling; adds (currently always-on) S3 log upload hook.
`cmd/stellar-rpc/internal/integrationtest/infrastructure/perf-eval/coordinator-runner.go`	New tool to render the sticky “Performance Evaluation Test #N” comment by fetching leg results from S3 and folding history.
`cmd/stellar-rpc/internal/integrationtest/infrastructure/perf-eval/coordinator-runner_test.go`	Unit tests for numbering/history-folding and leg rendering behavior.
`.gitignore`	Ignores generated `.xdr.zstd` corpora and a refresh tool build artifact.
`.github/workflows/load-test.yml`	Converts the ingest load test to a callable workflow leg with outputs and artifacts; removes direct PR commenting.
`.github/workflows/load-test-coordinator.yml`	New coordinator workflow: plan → leg fan-out → aggregate/report sticky PR comment.
`.github/workflows/e2e.yml`	Pins the reusable system-test workflow reference to a specific commit SHA.

Comments suppressed due to low confidence (1)

cmd/stellar-rpc/internal/integrationtest/infrastructure/perf-eval/ingest-load-test/run-load-test.sh:78

This block is labeled “temporary scaffolding” but is currently always enabled and uploads the full user-data log to S3 on every run. If it’s intended only for debugging, it should be gated behind an opt-in env var (or removed) to avoid unexpected S3 writes and potential log retention concerns.

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

chatgpt-codex-connector

💡 Codex Review

Here are some automated review suggestions for this pull request.

Reviewed commit: f1b4473315

ℹ️ About Codex in GitHub

Codex has been enabled to automatically review pull requests in this repo. Reviews are triggered when you

Open a pull request for review
Mark a draft as ready
Comment "@codex review".

If Codex has suggestions, it will comment; otherwise it will react with 👍.

When you sign up for Codex through ChatGPT, Codex can also answer questions or update the PR, like "@codex address that feedback".

chatgpt-codex-connector · 2026-06-26T18:44:30Z

+  push:
+    # temporary scaffolding: before merge to main, replace with line
+    # branches: [release/**]
+    branches: [release/**, gha-coordinator]


Remove the temporary gha-coordinator trigger

With this branch filter left in place, any push to gha-coordinator will run the full release load-test coordinator, including assuming the AWS role and launching the c5.2xlarge leg, even though the workflow is meant to run only for release branches. The adjacent comment says this is temporary before merge; please drop the extra branch before shipping so non-release pushes cannot start expensive perf runs or post PR comments.

Useful? React with 👍 / 👎.

github-actions · 2026-06-26T20:54:10Z

🧪 Performance Evaluation Test #5

Commit: 4ced1a8834be (gha-coordinator)
Run: https://github.com/stellar/stellar-rpc/actions/runs/28541434157

✅ Apply-load ingestion — verdict: ok

📈 Ingest load test — `4ced1a8`

Profile	Ledgers	ms/ledger	p50 / p95 / p99 ms	max ms
load-test-ledgers-v27-oz	1000	1197.087	1116.440 / 1631.880 / 1908.876	2787.808
load-test-ledgers-v27-sac	1000	1098.122	1108.570 / 1180.084 / 1235.298	1306.027
load-test-ledgers-v27-soroswap	1000	792.490	804.127 / 868.934 / 930.758	1062.309

Metric	Value
Ledgers replayed	3000
Initial DB ledger count	120960
Throughput	0.94 ledgers/sec
Elapsed wall-clock	3188.469s
Ingest busy-time	3087.699s (96.8% utilization)
Per-ledger p50 / p95 / p99	1051.843 / 1415.404 / 1742.786 ms
Golden DB fetch+decompress	2435s
stellar-core	`v27.0.0`
Workflow run	#28541434157-1

Performance Evaluation Test #4

Commit: c91c61f6e113 (gha-coordinator)
Run: https://github.com/stellar/stellar-rpc/actions/runs/28408420248

✅ Apply-load ingestion — verdict: ok

📈 Ingest load test — `c91c61f`

Profile	Ledgers	ms/ledger	p50 / p95 / p99 ms	max ms
load-test-ledgers-v27-oz	1000	1194.497	1110.690 / 1625.262 / 1905.351	3036.549
load-test-ledgers-v27-sac	1000	1094.507	1103.723 / 1177.603 / 1239.945	1300.516
load-test-ledgers-v27-soroswap	1000	791.392	802.718 / 866.760 / 923.818	1037.579

Metric	Value
Ledgers replayed	3000
Initial DB ledger count	120960
Throughput	0.94 ledgers/sec
Elapsed wall-clock	3189.754s
Ingest busy-time	3080.396s (96.6% utilization)
Per-ledger p50 / p95 / p99	1047.597 / 1415.372 / 1741.242 ms
Golden DB fetch+decompress	2456s
stellar-core	`v27.0.0`
Workflow run	#28408420248-1

Performance Evaluation Test #3

Commit: d846b52d17c1 (gha-coordinator)
Run: https://github.com/stellar/stellar-rpc/actions/runs/28395013091

✅ Apply-load ingestion — verdict: ok

📈 Ingest load test — `d846b52`

Profile	Ledgers	ms/ledger	p50 / p95 / p99 ms	max ms
load-test-ledgers-v27-oz	1000	1198.571	1117.194 / 1624.044 / 1914.508	3391.985
load-test-ledgers-v27-sac	1000	1097.772	1107.969 / 1180.118 / 1237.781	1300.999
load-test-ledgers-v27-soroswap	1000	792.332	802.249 / 866.830 / 923.025	1033.783

Metric	Value
Ledgers replayed	3000
Initial DB ledger count	120960
Throughput	0.94 ledgers/sec
Elapsed wall-clock	3188.566s
Ingest busy-time	3088.675s (96.9% utilization)
Per-ledger p50 / p95 / p99	1051.098 / 1419.916 / 1730.795 ms
Golden DB fetch+decompress	2441s
stellar-core	`v27.0.0`
Workflow run	#28395013091-1

Performance Evaluation Test #2

Commit: c601be50f256 (gha-coordinator)
Run: https://github.com/stellar/stellar-rpc/actions/runs/28265430398

✅ Apply-load ingestion — verdict: ok

📈 Ingest load test — `c601be5`

Profile	Ledgers	ms/ledger	p50 / p95 / p99 ms	max ms
load-test-ledgers-v27-oz	1000	1197.610	1114.931 / 1630.206 / 1907.536	3407.476
load-test-ledgers-v27-sac	1000	1097.030	1106.600 / 1179.565 / 1235.090	1304.292
load-test-ledgers-v27-soroswap	1000	791.518	802.001 / 867.143 / 922.698	1039.465

Metric	Value
Ledgers replayed	3000
Initial DB ledger count	120960
Throughput	0.94 ledgers/sec
Elapsed wall-clock	3189.277s
Ingest busy-time	3086.158s (96.8% utilization)
Per-ledger p50 / p95 / p99	1050.350 / 1413.260 / 1733.772 ms
Golden DB fetch+decompress	2410s
stellar-core	`v27.0.0`
Workflow run	#28265430398-1

Performance Evaluation Test #1

Commit: f1b4473315a8 (gha-coordinator)
Run: https://github.com/stellar/stellar-rpc/actions/runs/28257646240

✅ Apply-load ingestion — verdict: ok

📈 Ingest load test — `f1b4473`

Profile	Ledgers	ms/ledger	p50 / p95 / p99 ms	max ms
load-test-ledgers-v27-oz	1000	1198.111	1116.909 / 1624.761 / 1918.004	2807.759
load-test-ledgers-v27-sac	1000	1098.417	1108.963 / 1180.559 / 1237.718	1301.012
load-test-ledgers-v27-soroswap	1000	792.573	803.499 / 868.011 / 923.528	1036.343

Metric	Value
Ledgers replayed	3000
Initial DB ledger count	120960
Throughput	0.94 ledgers/sec
Elapsed wall-clock	3188.317s
Ingest busy-time	3089.101s (96.9% utilization)
Per-ledger p50 / p95 / p99	1052.390 / 1416.426 / 1738.711 ms
Golden DB fetch+decompress	2436s
stellar-core	`v27.0.0`
Workflow run	#28257646240-1

chatgpt-codex-connector

💡 Codex Review

Here are some automated review suggestions for this pull request.

Reviewed commit: c601be50f2

ℹ️ About Codex in GitHub

Codex has been enabled to automatically review pull requests in this repo. Reviews are triggered when you

Open a pull request for review
Mark a draft as ready
Comment "@codex review".

If Codex has suggestions, it will comment; otherwise it will react with 👍.

When you sign up for Codex through ChatGPT, Codex can also answer questions or update the PR, like "@codex address that feedback".

chatgpt-codex-connector · 2026-06-26T21:16:09Z

+  report:
+    name: Aggregate + report
+    needs: [plan, load-test]
+    if: always()


Prevent canceled coordinator runs from posting stale reports

In this workflow I checked the top-level concurrency.cancel-in-progress: true block: when a newer push supersedes an in-flight release run while this aggregation job is running, this always() condition lets the canceled run keep executing; GitHub's workflow cancellation reference explicitly notes that jobs whose condition still evaluates true, such as always(), are not canceled. That stale report can still edit the sticky PR comment for the superseded SHA after a newer run has started or even completed, so keep reporting on failed legs but exclude canceled runs before posting (for example by adding !cancelled()).

Useful? React with 👍 / 👎.

chatgpt-codex-connector

💡 Codex Review

Here are some automated review suggestions for this pull request.

Reviewed commit: d846b52d17

ℹ️ About Codex in GitHub

Codex has been enabled to automatically review pull requests in this repo. Reviews are triggered when you

Open a pull request for review
Mark a draft as ready
Comment "@codex review".

If Codex has suggestions, it will comment; otherwise it will react with 👍.

When you sign up for Codex through ChatGPT, Codex can also answer questions or update the PR, like "@codex address that feedback".

chatgpt-codex-connector · 2026-06-29T18:52:11Z

+          fi
+
+          # Render the new body (fetches each leg's S3 result, folds prior runs).
+          printf '%s' "$PREV" | go run "$PERF_EVAL" > /tmp/comment.md


Stop when report rendering fails

If this go run exits non-zero (for example due to a transient Go setup/compile failure or renderer error), the script does not stop because the shell only set -uo pipefail and not -e; execution continues with /tmp/comment.md (created by the redirection, possibly empty or partial), so the workflow can warn/post bad report content and still let the final gate pass when the leg outputs say success. Keep the gh api calls tolerant if desired, but make renderer failures abort before posting.

Useful? React with 👍 / 👎.

…cript file

chatgpt-codex-connector · 2026-07-01T19:10:49Z

Codex usage limits have been reached for code reviews. Please check with the admins of this repo to increase the limits by adding credits.
Credits must be used to enable repository wide code reviews.

cjonas9 added 30 commits May 8, 2026 21:32

pull initial work from branch load-testing

3392f09

add ledger generation test adapted for RPC

ffe27bc

add apply load config

65da3b5

add generated ledger output to infrastructure/testdata/

34f086d

add basic ingestion of synthetic ledgers phase

80c982d

disable debug logs for load test for timeout reasons

94c69b7

add functions for snapshotting + restoring test DB

f4a16f9

improve ad restructure db restoration helpers/API

1d53b96

finish DB restoration logic flow and wiring

baf2255

skip migrations/fee-stats in load test mode

1647464

ingest test: refactor, minor semantic fixes

2f14765

test.go: add retention window to config, fix fake history archive for…

d7c90a9

… real DB test

minor db restore/trim helper fixes

0390757

rename restore backed-up ledgers function for accuracy

e0a86e7

refactor, add env vars, change DB helpers to take sequences

f151a35

remove db restoration functionality

bffb101

add performance metrics json emission functionality

e04d51d

migrate to polling getHealth, change ingest test limits to 1000 ledgers

bd8c784

remove ledger fixtures

c7bc001

add workflow and script

786423d

fix yaml referencing wrong path for script

1606829

fix yml parsing indentation bug

7d41b1a

use head-object for metadata rather than tags

b701108

refine workflow + instance script

b1cec1d

add apply load cfg

b9ef27e

testing: on-push runs

73df1e7

minor yml syntax fixes

241bdf8

set test e2e.yml + add debugging info from instance to ssm

1161e3f

skip e2e.yml for testing, add retry loop for root volume lookup

47437f4

build-libs over build-stellar-rpc to cut time back

008f327

cjonas9 added 11 commits June 23, 2026 14:45

bump go sdk again, merge SDK main into loadtest-patch

8333974

Merge remote-tracking branch 'origin/main' into apply-load

f3f3d58

# Conflicts: # go.mod # go.sum

update comments in light of changes

a6241f6

merge apply-load recent work into coordinator

855527c

rework handshake into instance->s3 results push, fix minor leaks + OO…

8d5f523

…M risks

replace polling-based ledger timing computation with daemon hook

21b8045

remove trigger-on-push behavior

a8931fe

simplify significantly, refactor into on-release-push job launcher + …

b15e2e3

…commenter

moved ingest load test go programs into separate folder

fbd8b2b

refactored coordinator logic into go

237578c

Base automatically changed from apply-load to main June 26, 2026 17:37

cjonas9 added 2 commits June 26, 2026 14:20

Merge remote-tracking branch 'origin/main' into gha-coordinator

4651084

# Conflicts: # .github/workflows/load-test.yml # cmd/stellar-rpc/internal/ingest/service.go # cmd/stellar-rpc/internal/integrationtest/ingest_loadtest_test.go # go.mod # go.sum

add temporary trigger and log->s3 for testing

f1b4473

cjonas9 marked this pull request as ready for review June 26, 2026 18:37

Copilot AI review requested due to automatic review settings June 26, 2026 18:37

Copilot started reviewing on behalf of cjonas9 June 26, 2026 18:38 View session

Copilot AI reviewed Jun 26, 2026

View reviewed changes

Comment thread .github/workflows/load-test.yml Outdated

Comment thread .github/workflows/load-test-coordinator.yml

chatgpt-codex-connector Bot reviewed Jun 26, 2026

View reviewed changes

bring main verbosity cleanups into coordinator

c601be5

chatgpt-codex-connector Bot reviewed Jun 26, 2026

View reviewed changes

decompose shared ec2/gha programming into reusable modular components

d846b52

chatgpt-codex-connector Bot reviewed Jun 29, 2026

View reviewed changes

cjonas9 added 2 commits June 29, 2026 15:15

move run-load-test.sh scripting into run_leg helper in common shell s…

f83c4d8

…cript file

refactor into callable workflow, factor out common main Run() func

c91c61f

cjonas9 linked an issue Jun 30, 2026 that may be closed by this pull request

Release Eval: Add the coordinator GitHub Action #707

Open

cjonas9 added this to the platform sprint 73 milestone Jun 30, 2026

Merge branch 'main' into gha-coordinator

4ced1a8

Uh oh!

Conversation

cjonas9 commented Jun 17, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

What

Why

Known limitations

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Reviewed changes

Uh oh!

Uh oh!

Uh oh!

chatgpt-codex-connector Bot left a comment

Choose a reason for hiding this comment

💡 Codex Review

Uh oh!

Uh oh!

chatgpt-codex-connector Bot Jun 26, 2026

Choose a reason for hiding this comment

Uh oh!

github-actions Bot commented Jun 26, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

🧪 Performance Evaluation Test #5

✅ Apply-load ingestion — verdict: ok

📈 Ingest load test — 4ced1a8

✅ Apply-load ingestion — verdict: ok

📈 Ingest load test — c91c61f

✅ Apply-load ingestion — verdict: ok

📈 Ingest load test — d846b52

✅ Apply-load ingestion — verdict: ok

📈 Ingest load test — c601be5

✅ Apply-load ingestion — verdict: ok

📈 Ingest load test — f1b4473

Uh oh!

chatgpt-codex-connector Bot left a comment

Choose a reason for hiding this comment

💡 Codex Review

Uh oh!

chatgpt-codex-connector Bot Jun 26, 2026

Choose a reason for hiding this comment

Uh oh!

chatgpt-codex-connector Bot left a comment

Choose a reason for hiding this comment

💡 Codex Review

Uh oh!

chatgpt-codex-connector Bot Jun 29, 2026

Choose a reason for hiding this comment

Uh oh!

chatgpt-codex-connector Bot commented Jul 1, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

cjonas9 commented Jun 17, 2026 •

edited

Loading

github-actions Bot commented Jun 26, 2026 •

edited

Loading

📈 Ingest load test — `4ced1a8`

📈 Ingest load test — `c91c61f`

📈 Ingest load test — `d846b52`

📈 Ingest load test — `c601be5`

📈 Ingest load test — `f1b4473`