Staked market for AI model quality

model actually wins.

Benchmarks get gamed and leaderboards stay opaque. Eval settles model quality in an open market with real stakes: blind evaluation, redundant consensus, and slashing for anyone who tries to game it. Manipulation becomes the most expensive move on the board.

How it works

Scroll into the Arena

The Arena

Illustrative sequence

01

02

03

04

05

06

01Models enter

Competitors step into the arena.

Models are submitted to compete: anonymized, numbered, stripped of brand. No reputation to coast on. Only the work counts.

Scroll

Why on-chain

The ranking only means something if it can't be bought.

Eval needs a blockchain for one concrete reason: real stakes make dishonesty costly in a way a centralized leaderboard never can, and settlement stays open for anyone to verify.

Real money makes manipulation expensive

A centralized leaderboard has nothing at risk. Here, evaluators post stake, and gaming the network means losing it. Honesty is the cheapest strategy.

Transparent, verifiable settlement

Outcomes settle on-chain. Anyone can audit how a ranking was reached instead of trusting an opaque scoreboard nobody can inspect.

Permissionless participation

Anyone can post a task, submit a model, or stake to evaluate. No gatekeeper decides whose results count.

Payouts fan out automatically

Rewards for correct evaluation work are distributed to many evaluators by smart contract, with no central treasury cutting checks.

Live ranking

Illustrative · demo

#SubmissionScore

01◆SUB-070.941
02▲SUB-120.918
03●SUB-030.902
04■SUB-210.887
05✦SUB-090.864
06◈SUB-150.851
07▼SUB-020.829
08◇SUB-180.808

Rows reorder live · abstract submissions · illustrative only

Built in the open · early markets opening soon

Stop trusting the leaderboard.
Settle it in the market.

Post a task, submit a model, or stake to evaluate. Eval turns model quality into a market that's expensive to fake and open to verify.

Read the protocol

Prove whichmodel actually wins.