Skip to content

Run 1771957110

Trial run

Warning — trial run. This benchmark was executed with fewer than 10 instances per fuzzer and/or a time budget shorter than 24h. Results from trial runs are meant for debugging purposes and are not valid for extracting conclusions across different fuzzers.

Charts

Bugs Over TimeTime To KFinal DistributionPlateau And Late ShareInvariant Overlap (UpSet)Coverage Over TimeCorpus Size Over TimeSeq/s Over TimeTx/s Over TimeGas/s Over TimeCPU Usage Over TimeMemory Usage Over Time

Report

Fuzzer Benchmark Report (from bug-count CSV)

  • Time budget: 2.00h

Warning — trial run. This benchmark was executed with fewer than 10 instances per fuzzer and/or a time budget shorter than 24h. Results from trial runs are meant for debugging purposes and are not valid for extracting conclusions across different fuzzers.

Executive summary

This report is derived solely from cumulative bugs-found over time across repeated runs per fuzzer. It emphasizes robust, distribution-based metrics (median/IQR, success rates, time-to-k) and shape-based behavior (plateau time, late discovery share) instead of single-run time-to-first-bug.

Bugs found at fixed time budgets (median [IQR])

FuzzerRuns1h
foundry26 [6,6]
medusa25 [4,6]
echidna24 [4,4]

Overall metrics

FuzzerAUC (norm)Plateau timeLate discovery shareFinal medianFinal IQR
foundry0.9920.10h0.00060.00
medusa0.7830.20h0.00051.00
echidna0.6330.10h0.00040.00

Milestones: time-to-k and success rates

Fuzzertime-to-1 (p50)time-to-3 (p50)time-to-5 (p50)reach-1 ratereach-3 ratereach-5 rate
foundry0.00h0.00h0.00h100.0%100.0%100.0%
medusa0.10h0.10h0.20h100.0%100.0%50.0%
echidna0.10h0.10hinf100.0%100.0%0.0%

Throughput metrics (if supported by log format)

Values are run-level rates aggregated per fuzzer; n/a indicates the parser could not recover that metric from logs.

FuzzerRunsTx/s runsTx/s p50 [p25,p75]Gas/s runsGas/s p50 [p25,p75]
medusa222023.00 [1846.50,2199.50]21160654417.50 [1109266660.25,1212042174.75]
echidna222430.86 [2421.12,2440.60]221129343519.50 [19770878576.25,22487808462.75]

Progress metrics from logs (fuzzer-specific proxies)

Coverage/corpus values are parsed from each fuzzer's native progress output and are useful for trend context, not strict cross-fuzzer equivalence.

FuzzerRunsSeq/s runsSeq/s p50 [p25,p75]Coverage runsCoverage p50 [p25,p75]Corpus runsCorpus p50 [p25,p75]
foundry20n/a25047.50 [5004.25,5090.75]26918.50 [6268.25,7568.75]
medusa2219.50 [17.75,21.25]212469.00 [12465.50,12472.50]2617.50 [616.25,618.75]
echidna20n/a2135757.50 [135689.25,135825.75]2509.50 [504.75,514.25]

Statistical comparison (Mann-Whitney U test)

Pairwise Mann-Whitney U tests on end-of-budget bug counts (two-sided). Bonferroni correction applied for 3 comparison(s). Significance level: alpha = 0.05.

Warnings:

  • One or more fuzzers have fewer than 5 runs. Statistical power may be limited.
Fuzzer AFuzzer BMedian AMedian BU statisticp-valuep (corrected)Significant
foundrymedusa653.00.61711.0000no
foundryechidna644.00.19390.5818no
medusaechidna543.00.61711.0000no

No pairwise comparison reached significance after Bonferroni correction.

Note: The Mann-Whitney U test assesses whether one fuzzer tends to find more bugs than another across runs. It does not measure effect size or practical importance. Small sample sizes (fewer than 5 runs) reduce statistical power.

Shape-based interpretation (rules of thumb)

  • Fast-start / early-plateau: high early checkpoint median + early plateau time + low late discovery share.
  • Steady: moderate AUC, later plateau, consistent improvements across checkpoints, moderate variance.
  • Slow-burn / late-surge: low early checkpoints but high late discovery share and later plateau time; often higher final median.

Limitations

  • Core metrics in this section are count-based; use broken_invariants.md / broken_invariants.csv for invariant identities.
  • Severity, exploitability, and root-cause uniqueness cannot be measured directly without richer per-bug metadata.
  • Harness design still affects results; mitigate by keeping harness identical across fuzzers and reporting many runs.

Broken invariants

  • Budget filter: 2.00h
  • Events considered: 30 / 30
  • Unique invariants: 10

Warning — trial run. This benchmark was executed with fewer than 10 instances per fuzzer and/or a time budget shorter than 24h. Results from trial runs are meant for debugging purposes and are not valid for extracting conclusions across different fuzzers.

Per-fuzzer totals

FuzzerInvariants
echidna4
foundry6
medusa6

High-level overlap

  • Shared by all fuzzers: 2
  • Exclusive to echidna: 0
  • Exclusive to foundry: 4
  • Exclusive to medusa: 2

Grouped invariants

Exclusive to echidna (0)

None.

Exclusive to foundry (4)
  • invariant_erc7540_1
  • invariant_erc7540_2
  • invariant_erc7540_3
  • invariant_maxRedeemMaxWithdrawSymmetry
Exclusive to medusa (2)
  • global_comparePreviewMintAndConvertToAssets
  • global_previewEquivalenceFromAssets
Shared by all fuzzers (2)
  • assert_canary
  • invariant_canary

Top shared subsets (top 1 by size):

echidna, medusa (2)
  • doomsday_mintRedeemSymmetrical
  • global_previewEquivalenceFromShares

Runner resource usage

  • Budget filter: 2.00h
  • Instances with metrics: 6
  • Total samples: 8575

Per-fuzzer medians (across instances)

FuzzerInstancesCPU active avg (%)CPU active peak (%)Memory used avg (GiB)Memory used peak (GiB)Memory used avg (%)Memory used peak (%)
echidna284.0587.873.173.5210.3611.48
foundry298.22100.002.793.309.0910.76
medusa296.2699.972.533.758.2412.24

Instance stats

InstanceFuzzerSamplesDuration (h)CPU active avg (%)CPU active peak (%)Memory avg (GiB)Memory peak (GiB)Memory avg (%)Memory peak (%)
i-07171bee4c34111a8-echidna-v2.3.1echidna14312.0084.0887.823.173.5010.3411.42
i-073a4da16052928d5-echidna-v2.3.1echidna14322.0084.0187.923.183.5410.3811.54
i-03839e1e82bb710fb-foundry-git-3a47949foundry14282.0098.22100.003.133.8010.2212.41
i-08d7c94e93da3e6e4-foundry-git-3a47949foundry14282.0098.22100.002.442.797.979.10
i-09c00525b7ee6d8ff-medusa-v1.4.1medusa14282.0096.2299.962.393.547.8011.54
i-0f8a429e9cc3e914a-medusa-v1.4.1medusa14282.0096.2999.982.663.968.6812.93

Manifest

  • scfuzzbench_commit: a97481f76c325452cc6361412f1f5997fe18da7a
  • target_repo_url: https://github.com/Recon-Fuzz/superform-v2-periphery-scfuzzbench
  • target_commit: dev-recon
  • benchmark_type: property
  • instance_type: c6a.4xlarge
  • instances_per_fuzzer: 2
  • timeout_hours: 2
  • aws_region: us-east-1
  • ubuntu_ami_id: ami-0071174ad8cbb9e17
  • foundry_version: v1.6.0-rc1
  • foundry_git_repo: https://github.com/aviggiano/foundry
  • foundry_git_ref: master
  • echidna_version: 2.3.1
  • medusa_version: 1.4.1
  • fuzzer_keys: echidna, foundry, medusa

Artifacts

Fully static. Generated in CI from S3 run artifacts.