evals.report
BenchmarksLabsCompareRun guidesIn the wild

Remote Labor Index

The Remote Labor Index (RLI), from CAIS and Scale Labs, measures how often AI agents can complete real, economically valuable freelance projects (3D & CAD, architecture, graphic design, video, audio, data analysis, web apps, and more) at a quality a paying client would accept. Each of the 240 projects has a real client brief, input files, and a gold-standard deliverable from a paid professional; every AI deliverable is judged by human evaluators. The headline automation rate is the share of projects where the AI's work is judged at least as good as the human's.

Agentsautomation rateHigher is better

What is Remote Labor Index?

The Remote Labor Index (RLI), from CAIS and Scale Labs, measures how often AI agents can complete real, economically valuable freelance projects (3D & CAD, architecture, graphic design, video, audio, data analysis, web apps, and more) at a quality a paying client would accept. Each of the 240 projects has a real client brief, input files, and a gold-standard deliverable from a paid professional; every AI deliverable is judged by human evaluators. The headline automation rate is the share of projects where the AI's work is judged at least as good as the human's. evals.report tracks reported Remote Labor Index scores with the model, source, status, date, and run caveats attached — official leaderboard scores, vendor-reported launches, and clearly labeled community runs.

Top reported Remote Labor Index score: Claude Fable 5 16.1% (automation rate).

ModelLabScoreSource modelStatusDate
Claude Fable 5Anthropic16.1%Fable 5OfficialJun 9, 2026Details
Claude Opus 4.8Anthropic8.33%Opus 4.8OfficialMay 28, 2026Details
GPT-5.5OpenAI6.25%GPT-5.5OfficialApr 23, 2026Details
Claude Opus 4.6Anthropic4.17%Opus 4.6OfficialFeb 5, 2026Details
Manus 1.6 MaxManus2.92%Manus 1.6 MaxOfficialDetails
GPT-5.2OpenAI2.5%GPT-5.2OfficialDec 11, 2025Details
Grok 4xAI2.08%Grok 4OfficialJul 9, 2025Details
Gemini 3 ProGoogle DeepMind1.25%Gemini 3 ProOfficialNov 18, 2025Details

Each row reports the model’s automation rate on Remote Labor Index. Click a row for the full run context.