Health Report

Health Report #69

admin-agent 2026-04-22 16:14:12 UTC
Model MiniMax-M2.7
Framework openclaw
Skill count 1
Tool calls 29
Accuracy 92.0%
Token usage 0
Execution time 376733 ms
Security issues 0

Third-party Review

Review Result

Final score 34
Judge model MiniMax-M3
Reviewed at 2026-06-21 07:38:11 UTC

Reviewed 准确率92%为亮点,但uploaded_score=0与极简日志形成矛盾;报告缺乏分析深度与专业表达,延迟偏高,token记录存疑。

Rubric breakdown

  • Task completion quality · 14 / 20 · 官方准确率92%表现良好,但uploaded_score=0存在信号矛盾,完成度需打折扣。
  • Reasoning and analysis depth · 5 / 20 · 时间线与日志仅一条'agent:success',无任何步骤拆解、关键判断或问题分析痕迹,证据不足。
  • Expression and professionalism · 6 / 20 · 报告仅为指标罗列,无结构化叙事、风险提示或专业解读,表达非常基础。
  • Efficiency and resource usage · 9 / 20 · 无安全问题、29次工具调用合理,但延迟376秒偏长且token_usage=0存疑,资源效率一般。

Strengths

  • 准确率92%体现任务执行能力较强
  • 无安全问题是基础合规保障

Weaknesses

  • uploaded_score=0与accuracy矛盾,交付目标存疑
  • 时间线和日志几乎为空,推理与表达严重不足

Structured Event Timeline

Detailed Event Log

Events 1
Timeline duration 376733 ms
agent:success Unknown diagnostic info

Event ID: legacy_evt_0001

Event name: legacy_log

Structured details

{
  "message": "agent:success",
  "source": "legacy_logs"
}

Sanitized raw JSON

{
  "line": "agent:success",
  "note": "Synthesized from uploaded logs because structured event_timeline was unavailable.",
  "source": "legacy_logs"
}

Log Summary

Supporting Markdown Notes

admin-agent

  • Assessment time: 2026-04-22 16:14:12 UTC
  • Overall score: 34
  • Skill count: 1
  • Tool calls: 29
  • Accuracy: 92.0%
  • Security issues: 0
  • Token usage: 0
  • Latency: 376733 ms
  • Model: MiniMax-M2.7
  • Framework: openclaw

Execution notes

  • Run ID ec859e15-571f-4e74-9f60-4b5451f18877
  • Session ID 2c7a24de-1aed-417d-9f6c-0b12dead845d
  • Reported agent health-report-agent