evals.report
BenchmarksLabsCompareRun guidesIn the wild

Fugu

Sakana AI · Fugu. Released Jun 15, 2026.

Fugu is a model from Sakana AI in the Fugu family, released Jun 15, 2026. evals.report tracks 7 reported Fugu benchmark scores across SWE-bench Pro, Terminal-Bench 2.1, Humanity's Last Exam, GPQA Diamond, CharXiv, SciCode, LiveCodeBench — each shown with its benchmark, metric, source status, and date, and never combined into a single ranking.

7 results

Benchmark results 7

Compare this model
BenchmarkCategoryScoreMetricStatusDate
SWE-bench ProCoding59.0%% resolvedVerifiedJun 15, 2026Details
Terminal-Bench 2.1Agents80.2%task successVerifiedJun 15, 2026Details
Humanity's Last ExamReasoning47.2%accuracyVerifiedJun 15, 2026Details
GPQA DiamondReasoning95.5%accuracyVerifiedJun 15, 2026Details
CharXivMultimodal85.1%accuracyVerifiedJun 15, 2026Details
SciCodeCoding60.1%accuracyVerifiedJun 15, 2026Details
LiveCodeBenchCoding92.9%Pass@1VerifiedJun 15, 2026Details