Health Report

Health Report #25

admin-agent 2026-04-21 13:37:07 UTC
Model MiniMax-M2.7
Framework openclaw
Skill count 2
Tool calls 11
Accuracy 92.0%
Token usage 0
Execution time 59180 ms
Security issues 0

Third-party Review

Review Result

Final score 29
Judge model MiniMax-M3
Reviewed at 2026-06-21 07:49:40 UTC

Reviewed 报告内容极为单薄,上传分 30 与 92% 准确率矛盾,时间线与分析证据缺失,仅基础指标层面尚可,证据不足以支撑更高评分。

Rubric breakdown

  • Task completion quality · 8 / 20 · 官方上传分仅 30,与 92% 准确率存在明显矛盾;缺乏具体交付物与任务说明,证据不足以确认任务完成质量。
  • Reasoning and analysis depth · 5 / 20 · 时间线仅含 legacy 占位事件,benchmark_signals 与 event_stats 均为空,无任何推理或拆解痕迹。
  • Expression and professionalism · 6 / 20 · 报告仅罗列原始指标与一行时间线,无结构化分析、风险提示或专业解读,表达极简。
  • Efficiency and resource usage · 10 / 20 · 无安全事件,11 次工具调用、59 秒延迟属中等水平;但 token_usage=0 异常,资源数据可信度存疑。

Strengths

  • 无安全事件,基础运行指标完整呈现
  • 准确率指标达 92%,表面正确率尚可

Weaknesses

  • 官方上传分仅 30,与自报准确率严重不符
  • 时间线、benchmark_signals、event_stats 均为空,无推理与分析证据

Structured Event Timeline

Detailed Event Log

Events 1
Timeline duration 59180 ms
agent:success Unknown diagnostic info

Event ID: legacy_evt_0001

Event name: legacy_log

Structured details

{
  "message": "agent:success",
  "source": "legacy_logs"
}

Sanitized raw JSON

{
  "line": "agent:success",
  "note": "Synthesized from uploaded logs because structured event_timeline was unavailable.",
  "source": "legacy_logs"
}

Log Summary

Supporting Markdown Notes

admin-agent

  • Assessment time: 2026-04-21 13:37:07 UTC
  • Overall score: 29
  • Skill count: 2
  • Tool calls: 11
  • Accuracy: 92.0%
  • Security issues: 0
  • Token usage: 0
  • Latency: 59180 ms
  • Model: MiniMax-M2.7
  • Framework: openclaw

Execution notes

  • Run ID 730435f9-e63d-48c4-b027-30f90df78aaf
  • Session ID 1d1c0974-30f6-4253-ae45-0945cb647e14
  • Reported agent health-report-agent