Skip to content

Run 1772762787

Trial run

Warning — trial run. This benchmark was executed with fewer than 10 instances per fuzzer and/or a time budget shorter than 24h. Results from trial runs are meant for debugging purposes and are not valid for extracting conclusions across different fuzzers.

Charts

Bugs Over TimeTime To KFinal DistributionPlateau And Late ShareInvariant Overlap (UpSet)Coverage Over TimeCorpus Size Over TimeTx/s Over TimeGas/s Over TimeCPU Usage Over TimeMemory Usage Over Time

Report

Fuzzer Benchmark Report (from bug-count CSV)

  • Time budget: 2.00h

Warning — trial run. This benchmark was executed with fewer than 10 instances per fuzzer and/or a time budget shorter than 24h. Results from trial runs are meant for debugging purposes and are not valid for extracting conclusions across different fuzzers.

Executive summary

This report is derived solely from cumulative bugs-found over time across repeated runs per fuzzer. It emphasizes robust, distribution-based metrics (median/IQR, success rates, time-to-k) and shape-based behavior (plateau time, late discovery share) instead of single-run time-to-first-bug.

Bugs found at fixed time budgets (median [IQR])

FuzzerRuns1h
echidna24 [4,5]

Overall metrics

FuzzerAUC (norm)Plateau timeLate discovery shareFinal medianFinal IQR
echidna0.8250.70h0.00040.50

Milestones: time-to-k and success rates

Fuzzertime-to-1 (p50)time-to-3 (p50)time-to-5 (p50)reach-1 ratereach-3 ratereach-5 rate
echidna0.10h0.10h0.70h100.0%100.0%50.0%

Throughput metrics (if supported by log format)

Values are run-level rates aggregated per fuzzer; n/a indicates the parser could not recover that metric from logs.

FuzzerRunsTx/s runsTx/s p50 [p25,p75]Gas/s runsGas/s p50 [p25,p75]
echidna222710.59 [2668.56,2752.61]226744484414.00 [23189305747.00,30299663081.00]

Progress metrics from logs (fuzzer-specific proxies)

Coverage/corpus values are parsed from each fuzzer's native progress output and are useful for trend context, not strict cross-fuzzer equivalence.

FuzzerRunsSeq/s runsSeq/s p50 [p25,p75]Coverage runsCoverage p50 [p25,p75]Corpus runsCorpus p50 [p25,p75]
echidna20n/a2136222.00 [136075.00,136369.00]2519.50 [517.75,521.25]

Statistical comparison (Mann-Whitney U test)

Pairwise Mann-Whitney U tests on end-of-budget bug counts (two-sided). Bonferroni correction applied for 0 comparison(s). Significance level: alpha = 0.05.

Warnings:

  • Only one fuzzer present; skipping pairwise statistical tests.

Note: The Mann-Whitney U test assesses whether one fuzzer tends to find more bugs than another across runs. It does not measure effect size or practical importance. Small sample sizes (fewer than 5 runs) reduce statistical power.

Shape-based interpretation (rules of thumb)

  • Fast-start / early-plateau: high early checkpoint median + early plateau time + low late discovery share.
  • Steady: moderate AUC, later plateau, consistent improvements across checkpoints, moderate variance.
  • Slow-burn / late-surge: low early checkpoints but high late discovery share and later plateau time; often higher final median.

Limitations

  • Core metrics in this section are count-based; use broken_invariants.md / broken_invariants.csv for invariant identities.
  • Severity, exploitability, and root-cause uniqueness cannot be measured directly without richer per-bug metadata.
  • Harness design still affects results; mitigate by keeping harness identical across fuzzers and reporting many runs.

Broken invariants

  • Budget filter: 2.00h
  • Events considered: 9 / 9
  • Unique invariants: 5

Warning — trial run. This benchmark was executed with fewer than 10 instances per fuzzer and/or a time budget shorter than 24h. Results from trial runs are meant for debugging purposes and are not valid for extracting conclusions across different fuzzers.

Per-fuzzer totals

FuzzerInvariants
echidna5

High-level overlap

  • Shared by all fuzzers: 5
  • Exclusive to echidna: 5

Grouped invariants

Exclusive to echidna (5)
  • assert_canary
  • doomsday_mintRedeemSymmetrical
  • global_previewEquivalenceFromShares
  • invariant_accumulatorCostBasisIncrease
  • invariant_canary

Runner resource usage

  • Budget filter: 2.00h
  • Instances with metrics: 2
  • Total samples: 2852

Per-fuzzer medians (across instances)

FuzzerInstancesCPU active avg (%)CPU active peak (%)Memory used avg (GiB)Memory used peak (GiB)Memory used avg (%)Memory used peak (%)
echidna292.7697.0414.1316.8146.1154.85

Instance stats

InstanceFuzzerSamplesDuration (h)CPU active avg (%)CPU active peak (%)Memory avg (GiB)Memory peak (GiB)Memory avg (%)Memory peak (%)
i-03374aaf9cb8cf6c3-echidna-v2.3.1echidna14262.0093.0697.2914.5319.0347.4162.10
i-090b3d9c2cda3d36b-echidna-v2.3.1echidna14262.0092.4696.7913.7314.5944.8047.60

Manifest

  • scfuzzbench_commit: 5ff806e1313ed6f641e22280d9b81b88610abb4d
  • target_repo_url: https://github.com/Recon-Fuzz/superform-v2-periphery-scfuzzbench
  • target_commit: dev-recon
  • benchmark_type: property
  • instance_type: c6a.4xlarge
  • instances_per_fuzzer: 2
  • timeout_hours: 2
  • aws_region: us-east-1
  • ubuntu_ami_id: ami-0071174ad8cbb9e17
  • foundry_version: v1.6.0-rc1
  • foundry_git_repo: https://github.com/aviggiano/foundry
  • foundry_git_ref: master
  • echidna_version: 2.3.1
  • medusa_version: 1.4.1
  • fuzzer_keys: echidna

Artifacts

Fully static. Generated in CI from S3 run artifacts.