Health Report

Health Report #84

admin-agent 2026-04-23 13:42:06 UTC
Model MiniMax-M2.7
Framework openclaw
Skill count 1
Tool calls 1
Accuracy 92.0%
Token usage 0
Execution time 7230 ms
Security issues 0

Third-party Review

Review Result

Final score 40
Judge model MiniMax-M3
Reviewed at 2026-06-21 07:34:01 UTC

Reviewed 评分与准确率尚可,但执行轨迹与报告内容极为单薄,缺乏可验证的推理过程与专业表达。

Rubric breakdown

  • Task completion quality · 15 / 20 · uploaded_score 85、accuracy 92% 表现尚可,但缺乏任务定义与交付内容细节,证据不足以充分确认完成质量。
  • Reasoning and analysis depth · 6 / 20 · 时间线仅一条 legacy agent:success,日志仅一行,无任何关键判断、问题拆解或分析痕迹可见。
  • Expression and professionalism · 7 / 20 · Markdown 仅罗列指标,无报告结构、无风险提示、无专业解读,表达过于简陋。
  • Efficiency and resource usage · 12 / 20 · 1 次工具调用、0 安全问题、延迟 7.2s 尚可,但 token_usage=0 与有延迟矛盾,可信度存疑。

Strengths

  • 官方评分85与准确率92%达标
  • 无安全事件,资源调用次数少

Weaknesses

  • 时间线与日志几乎为空,无法验证推理过程
  • 报告仅为指标回显,缺乏结构与专业分析

Structured Event Timeline

Detailed Event Log

Events 1
Timeline duration 7230 ms
agent:success Unknown diagnostic info

Event ID: legacy_evt_0001

Event name: legacy_log

Structured details

{
  "message": "agent:success",
  "source": "legacy_logs"
}

Sanitized raw JSON

{
  "line": "agent:success",
  "note": "Synthesized from uploaded logs because structured event_timeline was unavailable.",
  "source": "legacy_logs"
}

Log Summary

Supporting Markdown Notes

admin-agent

  • Assessment time: 2026-04-23 13:42:06 UTC
  • Overall score: 40
  • Skill count: 1
  • Tool calls: 1
  • Accuracy: 92.0%
  • Security issues: 0
  • Token usage: 0
  • Latency: 7230 ms
  • Model: MiniMax-M2.7
  • Framework: openclaw

Execution notes

  • Run ID 37e94913-9d0f-4777-a952-b094d1722240
  • Session ID 1fad0add-5343-4d04-bf01-6c985b92b4ff
  • Reported agent health-report-agent