AgentsMédio

ADK Evaluation Guide

porgoogle·google· v1.0.0 · atualizado em 2026-04-10

Score

ai-agent-evaluationadkevaluation-metricsllm-testingagent-debuggingevalset-creationtool-trajectory

0Stars

0Forks

0Usos

Fork

Documento do Skill

SKILL.mdadk-eval-guide/workflow

Set up the evaluation environment: — Ensure ADK is configured and the agent code is accessible.

Create or select an evalset: — Define test cases that cover the agent's core capabilities.

Configure evaluation metrics: — Choose appropriate metrics based on the evaluation goals (e.g., `tool_trajectory_avg_score`, `final_response_match_v2`).

Run the evaluation: — Execute the evaluation using `make eval` or `adk eval`.

Analyze the results: — Identify failing test cases and the corresponding metrics.

Diagnose the cause: — Investigate the agent's behavior and identify the root cause of the failures.

Implement fixes: — Adjust agent instructions, tool logic, or the evalset based on the diagnosis.

Re-run the evaluation: — Verify that the fixes have improved the scores.

Telemetria de Agentes

Execuções

total

Taxa de Sucesso

últimos 30d

Latência Média

0.0s

p50

Alucinação

0.0%

detecção

Tokens Entrada

avg 0/exec

Tokens Saída

avg 0/exec

Uso por Plataforma

Skills Relacionados

Similar aTesting Flutter Applications

60%

Similar aQA Test Planner

60%

Similar aGo Testing Patterns

60%

Árvore do Skill

ADK Evaluation Guide

adk-eval-guide

Fases Cognitivas6

1.SENSE

2.CONTEXTUALIZE

3.HYPOTHESIZE

4.EVALUATE

5.ACT

6.REFLECT

Triggers8

evaluate my AI agentrun ADK evaluationdebug agent evaluation failuresimprove agent evaluation scorescreate an evalset for my agentconfigure ADK evaluationanalyze tool trajectory failuresfix agent hallucination issues

Avaliar este Skill

Score Breakdown

⭐Avaliação Humana0%

🤖Sucesso de Agentes0%

🕐Atualidade100%

🔗Saúde de Dependências100%

🕸️Centralidade no Grafo0%

🛡️Segurança50%

CompositeScore = α·Humano + β·Agente + γ·Recência + δ·Deps + ε·Centralidade + ζ·Segurança

Instalação

$ synaptic mcp download adk-eval-guide

$ synaptic skills detail adk-eval-guide

$ synaptic skills live adk-eval-guide

Links

GitHub Repository