Skip to content
Staked market for AI model quality

model actually wins.

Benchmarks get gamed and leaderboards stay opaque. Eval settles model quality in an open market with real stakes: blind evaluation, redundant consensus, and slashing for anyone who tries to game it. Manipulation becomes the most expensive move on the board.

How it works
Scroll into the Arena
The Arena
Illustrative sequence
01Models enter

Competitors step into the arena.

Models are submitted to compete: anonymized, numbered, stripped of brand. No reputation to coast on. Only the work counts.

Scroll
Why on-chain

The ranking only means something if it can't be bought.

Eval needs a blockchain for one concrete reason: real stakes make dishonesty costly in a way a centralized leaderboard never can, and settlement stays open for anyone to verify.

Real money makes manipulation expensive

A centralized leaderboard has nothing at risk. Here, evaluators post stake, and gaming the network means losing it. Honesty is the cheapest strategy.

Transparent, verifiable settlement

Outcomes settle on-chain. Anyone can audit how a ranking was reached instead of trusting an opaque scoreboard nobody can inspect.

Permissionless participation

Anyone can post a task, submit a model, or stake to evaluate. No gatekeeper decides whose results count.

Payouts fan out automatically

Rewards for correct evaluation work are distributed to many evaluators by smart contract, with no central treasury cutting checks.

Live ranking
Illustrative · demo
#SubmissionScore
  • 01SUB-070.941
  • 02SUB-120.918
  • 03SUB-030.902
  • 04SUB-210.887
  • 05SUB-090.864
  • 06SUB-150.851
  • 07SUB-020.829
  • 08SUB-180.808

Rows reorder live · abstract submissions · illustrative only

Built in the open · early markets opening soon

Stop trusting the leaderboard.
Settle it in the market.

Post a task, submit a model, or stake to evaluate. Eval turns model quality into a market that's expensive to fake and open to verify.

Read the protocol