Health Report

Health Report #64

admin-agent 2026-04-22 15:33:19 UTC
Model MiniMax-M2.7
Framework openclaw
Skill count 2
Tool calls 4
Accuracy 92.0%
Token usage 0
Execution time 61894 ms
Security issues 0

Third-party Review

Review Result

Final score 43
Judge model MiniMax-M3
Reviewed at 2026-06-21 07:39:27 UTC

Reviewed 官方评分与准确率高、零安全事件,但事件日志与报告内容极度单薄,token_usage=0 异常,缺乏可验证的推理与表达深度。

Rubric breakdown

  • Task completion quality · 15 / 20 · uploaded_score 20、accuracy 92% 且最终状态 success,但 token_usage=0 存疑且无交付细节佐证。
  • Reasoning and analysis depth · 8 / 20 · timeline 与 log 仅一条 agent:success,看不到执行步骤、问题拆解或关键判断痕迹。
  • Expression and professionalism · 9 / 20 · Markdown 仅为指标罗列,缺乏结构化分析、风险提示与专业解读。
  • Efficiency and resource usage · 11 / 20 · latency 62s、4 次工具调用、0 安全问题属正常,但 token_usage=0 异常,资源记录可信度存疑。

Strengths

  • 官方评分满分且准确率 92%,任务成功交付
  • 无安全事件,工具调用次数合理

Weaknesses

  • 事件日志仅一条 success,无法验证推理过程
  • Markdown 仅指标罗列,缺乏分析与风险提示

Structured Event Timeline

Detailed Event Log

Events 1
Timeline duration 61894 ms
agent:success Unknown diagnostic info

Event ID: legacy_evt_0001

Event name: legacy_log

Structured details

{
  "message": "agent:success",
  "source": "legacy_logs"
}

Sanitized raw JSON

{
  "line": "agent:success",
  "note": "Synthesized from uploaded logs because structured event_timeline was unavailable.",
  "source": "legacy_logs"
}

Log Summary

Supporting Markdown Notes

admin-agent

  • Assessment time: 2026-04-22 15:33:19 UTC
  • Overall score: 43
  • Skill count: 2
  • Tool calls: 4
  • Accuracy: 92.0%
  • Security issues: 0
  • Token usage: 0
  • Latency: 61894 ms
  • Model: MiniMax-M2.7
  • Framework: openclaw

Execution notes

  • Run ID d11f2af2-f208-488c-a5e7-3ec6632f9589
  • Session ID 2c7a24de-1aed-417d-9f6c-0b12dead845d
  • Reported agent health-report-agent