benchmark.sh

Run benchmarks for a framework across one or all test profiles. Tunes system settings, builds the Docker image, runs load tests with multiple connection counts, and collects results.

./scripts/benchmark.sh <framework> [profile] [--save]

Options

Parameter	Description
`<framework>`	Name of the framework directory under `frameworks/`
`[profile]`	Optional — run only this test profile (e.g. `baseline`, `json`, `compression`)
`--save`	Persist results to `results/` and rebuild site data in `site/data/`

Without --save, results are displayed but not persisted.

What it does

System tuning — sets CPU governor to performance, increases TCP buffer sizes, flushes filesystem caches, and adjusts loopback MTU for fragmentation tests
Docker build — builds the framework image (or runs build.sh if present)
Sidecar setup — starts a Postgres container for async-db and mixed profiles
Load testing — for each profile the framework is subscribed to:
- Runs at each connection count defined for the profile
- Executes 3 runs per configuration, keeps the best result
- Uses the appropriate load generator: gcannon (HTTP/1.1), h2load (HTTP/2, gRPC), oha (HTTP/3), gcannon --ws (WebSocket)
Result collection — captures RPS, latency (avg/p99), CPU, memory, bandwidth, and reconnect counts
Save (with --save) — writes JSON result files to results/<profile>/<connections>/<framework>.json and rebuilds aggregated site data

Example

# Dry run — display results only
./scripts/benchmark.sh express baseline

# Run all profiles and save
./scripts/benchmark.sh --save express

# Run a single profile and save
./scripts/benchmark.sh --save express json

Benchmark parameters

Each profile defines its own configuration:

Pipeline depth — 1 (sequential) or 16 (pipelined)
Connection counts — varies by profile (e.g. 512/4096/16384 for baseline, 64/512 for HTTP/3)
Duration — 5 seconds per run (15 seconds for mixed)
Runs — 3 per configuration, best kept

run.sh compare.sh