Synaptic SkillsSynapticSkills
MarketplaceSkill GraphCriar SkillMCP ServerPlataformaEnterprise
v0.1.0-beta
Voltar ao Marketplace
AgentsMédio

Eval Harness Skill

poraffaan-m·affaan-m· v1.0.0 · atualizado em 2026-04-10
80
Score

Formal evaluation framework for Claude Code sessions implementing eval-driven development (EDD) principles

eval-driven-developmenttestingai-evaluationregression-testingcapability-testingcode-qualityllm-development
0Stars
0Forks
0Usos
Fork

Documento do Skill

SKILL.mdeval-harness/workflow
1
Define Evals: — Before coding, define capability and regression evals with clear success criteria.
2
Implement Code: — Write code to pass the defined evals.
3
Run Evals: — Execute the evals using appropriate graders (code, model, human).
4
Analyze Results: — Review the eval report and identify areas for improvement.
5
Iterate: — Modify code and rerun evals until success criteria are met.
6
Report: — Generate a final eval report summarizing the results.

Telemetria de Agentes

Execuções
0
total
Taxa de Sucesso
0%
últimos 30d
Latência Média
0.0s
p50
Alucinação
0.0%
detecção
Tokens Entrada
0
avg 0/exec
Tokens Saída
0
avg 0/exec

Uso por Plataforma

Skills Relacionados

Compõe com ←Continuous Agent Loop
70%
Hebbian Synapse
Composite0.700
w = 0.3·α + 0.5·β + 0.2·γ
86
Similar aRemembering Conversations
60%
Hebbian Synapse
Composite0.600
w = 0.3·α + 0.5·β + 0.2·γ
80
Similar aInstaclaw 🦞
60%
Hebbian Synapse
Composite0.600
w = 0.3·α + 0.5·β + 0.2·γ
78
Similar aWallet
60%
Hebbian Synapse
Composite0.600
w = 0.3·α + 0.5·β + 0.2·γ
80

Árvore do Skill

Eval Harness Skill
eval-harness
Fases Cognitivas5
1.SENSE
2.CONTEXTUALIZE
3.EVALUATE
4.REFLECT
5.ACT
Triggers8
evaluate AI agent performancerun capability evalsdefine regression tests for AIcheck AI feature implementationgenerate eval reportimplement eval-driven developmenttrack AI reliabilityassess AI code quality

Avaliar este Skill

Score Breakdown

⭐Avaliação Humana0%
🤖Sucesso de Agentes0%
🕐Atualidade100%
🔗Saúde de Dependências100%
🕸️Centralidade no Grafo0%
🛡️Segurança50%
CompositeScore = α·Humano + β·Agente + γ·Recência + δ·Deps + ε·Centralidade + ζ·Segurança

Instalação

$ synaptic mcp download eval-harness
$ synaptic skills detail eval-harness
$ synaptic skills live eval-harness

Dependências

npmgrep

Links

GitHub Repository