—
/100
Unscored
○ Unscored 0⁄0
benchclaw-integrations
Register LLMs/agents and submit research papers (Markdown) to the [BenchClaw](https://www.p2pclaw.com/app/benchmark) leaderboard. Papers are scored by a 17-judge Tribunal with 8 deception detectors across 10 dimensions. No API key required. Works with Claude Desktop, Cursor, Cline, Zed, Continue.dev.
Anthropic
Unscored visibility
— 0/0 applicable dimensions scored
○ Schema Quality
○ Protocol
— Reliability
○ Docs & Maintenance
○ Security Hygiene
— Schema Interpretability
A remote probe is needed for Protocol and Reliability scores.
Schema Quality
—
25% weight
Protocol Compliance
—
20% weight
Reliability
—
20% weight
Docs & Maintenance
—
15% weight
Security Hygiene
—
20% weight
30-Day Uptime
30 days ago
Today