Remote Labor Index
The Remote Labor Index (RLI), from CAIS and Scale Labs, measures how often AI agents can complete real, economically valuable freelance projects (3D & CAD, architecture, graphic design, video, audio, data analysis, web apps, and more) at a quality a paying client would accept. Each of the 240 projects has a real client brief, input files, and a gold-standard deliverable from a paid professional; every AI deliverable is judged by human evaluators. The headline automation rate is the share of projects where the AI's work is judged at least as good as the human's.
What is Remote Labor Index?
The Remote Labor Index (RLI), from CAIS and Scale Labs, measures how often AI agents can complete real, economically valuable freelance projects (3D & CAD, architecture, graphic design, video, audio, data analysis, web apps, and more) at a quality a paying client would accept. Each of the 240 projects has a real client brief, input files, and a gold-standard deliverable from a paid professional; every AI deliverable is judged by human evaluators. The headline automation rate is the share of projects where the AI's work is judged at least as good as the human's. evals.report tracks reported Remote Labor Index scores with the model, source, status, date, and run caveats attached — official leaderboard scores, vendor-reported launches, and clearly labeled community runs.
Top reported Remote Labor Index score: Claude Fable 5 — 16.1% (automation rate).
| Model | Lab | Score↓ | Source model | Status | Date | |
|---|---|---|---|---|---|---|
| Claude Fable 5 | Anthropic | 16.1% | Fable 5 | Official | Jun 9, 2026 | Details |
| Claude Opus 4.8 | Anthropic | 8.33% | Opus 4.8 | Official | May 28, 2026 | Details |
| GPT-5.5 | OpenAI | 6.25% | GPT-5.5 | Official | Apr 23, 2026 | Details |
| Claude Opus 4.6 | Anthropic | 4.17% | Opus 4.6 | Official | Feb 5, 2026 | Details |
| Manus 1.6 Max | Manus | 2.92% | Manus 1.6 Max | Official | — | Details |
| GPT-5.2 | OpenAI | 2.5% | GPT-5.2 | Official | Dec 11, 2025 | Details |
| Grok 4 | xAI | 2.08% | Grok 4 | Official | Jul 9, 2025 | Details |
| Gemini 3 Pro | Google DeepMind | 1.25% | Gemini 3 Pro | Official | Nov 18, 2025 | Details |
Each row reports the model’s automation rate on Remote Labor Index. Click a row for the full run context.