← Back to leaderboard
/100
Unscored ○ Unscored 00

EvalKit MCP Server

Enables testing AI safety classifier robustness against query decomposition, obfuscation, and multi-agent attacks. Provides tools for full evaluation pipelines, query previews, and status checks.

Unscored visibility — 0/0 applicable dimensions scored
○ Schema Quality ○ Protocol — Reliability ○ Docs & Maintenance ○ Security Hygiene — Schema Interpretability
A remote probe is needed for Protocol and Reliability scores.
Schema Quality
25% weight
Protocol Compliance
20% weight
Reliability
20% weight
Docs & Maintenance
15% weight
Security Hygiene
20% weight
30-Day Trend

30-Day Uptime

30 days ago Today
Embed Badge

Add this to your README to display your MCP Scoreboard grade:

MCP Score Badge
[![MCP Score](https://mcpscoreboard.com/badge/f650d901-7e0a-46e1-9420-020a2e87931b.svg)](https://mcpscoreboard.com/server/f650d901-7e0a-46e1-9420-020a2e87931b/)