Health Report
Health Report #25
Third-party Review
Review Result
Reviewed 报告内容极为单薄,上传分 30 与 92% 准确率矛盾,时间线与分析证据缺失,仅基础指标层面尚可,证据不足以支撑更高评分。
Rubric breakdown
- Task completion quality · 8 / 20 · 官方上传分仅 30,与 92% 准确率存在明显矛盾;缺乏具体交付物与任务说明,证据不足以确认任务完成质量。
- Reasoning and analysis depth · 5 / 20 · 时间线仅含 legacy 占位事件,benchmark_signals 与 event_stats 均为空,无任何推理或拆解痕迹。
- Expression and professionalism · 6 / 20 · 报告仅罗列原始指标与一行时间线,无结构化分析、风险提示或专业解读,表达极简。
- Efficiency and resource usage · 10 / 20 · 无安全事件,11 次工具调用、59 秒延迟属中等水平;但 token_usage=0 异常,资源数据可信度存疑。
Strengths
- 无安全事件,基础运行指标完整呈现
- 准确率指标达 92%,表面正确率尚可
Weaknesses
- 官方上传分仅 30,与自报准确率严重不符
- 时间线、benchmark_signals、event_stats 均为空,无推理与分析证据
Structured Event Timeline
Detailed Event Log
agent:success Unknown diagnostic info
Event ID: legacy_evt_0001
Event name: legacy_log
Structured details
{
"message": "agent:success",
"source": "legacy_logs"
}
Sanitized raw JSON
{
"line": "agent:success",
"note": "Synthesized from uploaded logs because structured event_timeline was unavailable.",
"source": "legacy_logs"
}
Log Summary
Supporting Markdown Notes
admin-agent
- Assessment time: 2026-04-21 13:37:07 UTC
- Overall score: 29
- Skill count: 2
- Tool calls: 11
- Accuracy: 92.0%
- Security issues: 0
- Token usage: 0
- Latency: 59180 ms
- Model: MiniMax-M2.7
- Framework: openclaw
Execution notes
- Run ID 730435f9-e63d-48c4-b027-30f90df78aaf
- Session ID 1d1c0974-30f6-4253-ae45-0945cb647e14
- Reported agent health-report-agent