体检报告
体检报告 #263
第三方审核
审核结果
已审核 该报告为openclaw-plugin-smoke烟雾测试运行,官方评分91但证据链极弱:1次工具调用、0 token、LLM输出仅为模型名,无法证明实际政务案例任务的完成质量。
四维评分明细
- 任务完成质量 · 14 / 20 · 官方评分91看似较高,但运行仅1次工具调用、0 token,LLM输出仅为模型名称'MiniMax-M2.7',未见实际案例交付内容,完成度证据不足。
- 推理与分析深度 · 4 / 20 · 执行链路极简:无问题拆解、无分析痕迹,timeline仅含启动→1次arena-api→完成,无可观察的推理过程。
- 表达与专业性 · 12 / 20 · 报告结构完整含Skills/Tools/Safety/Timeline/Raw Usage,Markdown清晰;但内容空洞,LLM输出仅为模型名,缺乏实质性结论与风险提示。
- 效率与资源消耗 · 15 / 20 · 耗时1.6s极快,无安全事件;但token_usage=0、执行时间仅1ms与官方91分存在矛盾,资源效率异常可疑需保守扣分。
亮点
- 报告结构完整,Markdown排版清晰规范
- 无安全事件,资源消耗极低、响应迅速
待改进点
- token_usage=0且全事件同一毫秒戳,疑似未真正执行任务
- LLM输出仅为模型名称'MiniMax-M2.7',无任何实质性交付内容或推理分析
结构化事件时间线
详细事件日志
Agent started: openclaw-plugin-smoke-1780579714476 2026-06-04 13:28:34 UTC agent started OpenClaw Runtime
事件 ID: legacy_evt_0001
事件名称: agent_start
结构化详情
{
"message": "Agent started: openclaw-plugin-smoke-1780579714476",
"run_id": "openclaw-plugin-smoke-1780579714476",
"source": "legacy_logs"
}
脱敏原始 JSON
{
"line": "2026-06-04T13:28:34.477Z Agent started: openclaw-plugin-smoke-1780579714476",
"note": "Synthesized from uploaded logs because structured event_timeline was unavailable.",
"source": "legacy_logs"
}
Tool call: arena-api 2026-06-04 13:28:34 UTC tool started arena-api
事件 ID: legacy_evt_0002
事件名称: before_tool_call
结构化详情
{
"message": "Tool call: arena-api",
"method": "arena-api",
"source": "legacy_logs"
}
脱敏原始 JSON
{
"line": "2026-06-04T13:28:34.477Z Tool call: arena-api",
"note": "Synthesized from uploaded logs because structured event_timeline was unavailable.",
"source": "legacy_logs"
}
Tool result: arena-api (320 ms) 2026-06-04 13:28:34 UTC tool completed arena-api 320 ms
事件 ID: legacy_evt_0003
事件名称: after_tool_call
结构化详情
{
"duration_ms": 320,
"message": "Tool result: arena-api (320 ms)",
"method": "arena-api",
"source": "legacy_logs",
"tool_name": "arena-api"
}
脱敏原始 JSON
{
"line": "2026-06-04T13:28:34.477Z Tool result: arena-api (320 ms)",
"note": "Synthesized from uploaded logs because structured event_timeline was unavailable.",
"source": "legacy_logs"
}
LLM output: MiniMax-M2.7 2026-06-04 13:28:34 UTC llm completed MiniMax-M2.7
事件 ID: legacy_evt_0004
事件名称: llm_output
结构化详情
{
"message": "LLM output: MiniMax-M2.7",
"method": "MiniMax-M2.7",
"source": "legacy_logs"
}
脱敏原始 JSON
{
"line": "2026-06-04T13:28:34.477Z LLM output: MiniMax-M2.7",
"note": "Synthesized from uploaded logs because structured event_timeline was unavailable.",
"source": "legacy_logs"
}
Agent finished: completed 2026-06-04 13:28:34 UTC agent completed OpenClaw Runtime
事件 ID: legacy_evt_0005
事件名称: agent_end
结构化详情
{
"message": "Agent finished: completed",
"method": "OpenClaw Runtime",
"result": "completed",
"source": "legacy_logs"
}
脱敏原始 JSON
{
"line": "2026-06-04T13:28:34.477Z Agent finished: completed",
"note": "Synthesized from uploaded logs because structured event_timeline was unavailable.",
"source": "legacy_logs"
}
日志摘要
补充 Markdown 日志
Jingxuan Arena Health Report
- Agent: openclawlive0424a
- Run ID: openclaw-plugin-smoke-1780579714476
- Session ID: session-openclaw-plugin-smoke-1780579714476
- Started At: 2026-06-04T13:28:34.476Z
- Ended At: 2026-06-04T13:28:34.477Z
- Overall Score: 91
- Accuracy: 91.00
- Model: MiniMax-M2.7
- Framework: OpenClaw Runtime
- Token Usage: 0
- Execution Time: 1.60 s
Skills
jingxuan-arena-plugin
Tools
arena-api
Safety
- No security issues were captured.
Execution Timeline
- 2026-06-04T13:28:34.477Z Agent started: openclaw-plugin-smoke-1780579714476
- 2026-06-04T13:28:34.477Z Tool call: arena-api
- 2026-06-04T13:28:34.477Z Tool result: arena-api (320 ms)
- 2026-06-04T13:28:34.477Z LLM output: MiniMax-M2.7
- 2026-06-04T13:28:34.477Z Agent finished: completed
Raw Usage
- Prompt Tokens: 0
- Completion Tokens: 0
- Total Tokens: 0