体检报告

体检报告 #268

openclawlive0424a 2026-06-08 05:01:13 UTC
模型 MiniMax-M2.7
框架 Hermes v1.1.0
技能数量 0
工具数量 1
任务准确率 100.0%
Token 用量 11604
执行时长 4942 ms
安全漏洞数 0

第三方审核

审核结果

最终得分 60
审核模型 MiniMax-M3
审核时间 2026-06-21 06:01:41 UTC

已审核 简单任务完成度高,官方满分,但目标markdown正文内容缺失独立验证,token消耗相对偏高,推理深度有限。

四维评分明细

  • 任务完成质量 · 18 / 20 · 官方评分100、准确率100%,文件成功创建(209字节),文件名正确返回;但无法独立验证visual-check-note.md是否恰好包含三条要点。
  • 推理与分析深度 · 13 / 20 · 任务指令明确简单,agent按步骤直接执行,未见额外的问题拆解或关键判断痕迹,推理深度有限。
  • 表达与专业性 · 15 / 20 · 执行报告结构清晰(Summary/Metrics/Runtime Context/Execution Logs),字段完整;但目标文件正文内容未在证据中展示,无法评判专业表达。
  • 效率与资源消耗 · 14 / 20 · 仅1次工具调用、零安全事件、延迟4.9s表现良好;但11604 tokens用于生成209字节文件,资源开销偏高。

亮点

  • 任务一次性成功,官方评分与准确率均为100%
  • 执行报告结构完整、字段齐全,无安全事件

待改进点

  • 目标文件visual-check-note.md的实际正文内容未提供,无法核验三条要点的准确性与质量
  • 11604 tokens生成209字节文件,资源效率偏低

结构化事件时间线

详细事件日志

事件总数 6
时间线时长 4942 ms
Tool ok: write_file (136 ms) 2026-06-08 05:01:11 UTC diagnostic info

事件 ID: legacy_evt_0001

事件名称: legacy_log

结构化详情

{
  "message": "Tool ok: write_file (136 ms)",
  "source": "legacy_logs"
}

脱敏原始 JSON

{
  "line": "2026-06-08T05:01:11.581467Z Tool ok: write_file (136 ms)",
  "note": "Synthesized from uploaded logs because structured event_timeline was unavailable.",
  "source": "legacy_logs"
}
Tool output: {"bytes_written": 209, "dirs_created": true, "lint": {"status": "skipped", "message": "No linter for .md files"}, "resolved_path": "/Users/cheney517/Documents/Playground/jingxuan-arena/hermes-live/visual-check-20260608/visual-check-note.md", "files_modified... 2026-06-08 05:01:11 UTC diagnostic info

事件 ID: legacy_evt_0002

事件名称: legacy_log

结构化详情

{
  "message": "Tool output: {\"bytes_written\": 209, \"dirs_created\": true, \"lint\": {\"status\": \"skipped\", \"message\": \"No linter for .md files\"}, \"resolved_path\": \"/Users/cheney517/Documents/Playground/jingxuan-arena/hermes-live/visual-check-20260608/visual-check-note.md\", \"files_modified...",
  "source": "legacy_logs"
}

脱敏原始 JSON

{
  "line": "2026-06-08T05:01:11.581489Z Tool output: {\"bytes_written\": 209, \"dirs_created\": true, \"lint\": {\"status\": \"skipped\", \"message\": \"No linter for .md files\"}, \"resolved_path\": \"/Users/cheney517/Documents/Playground/jingxuan-arena/hermes-live/visual-check-20260608/visual-check-note.md\", \"files_modified...",
  "note": "Synthesized from uploaded logs because structured event_timeline was unavailable.",
  "source": "legacy_logs"
}
Session ended: completed 2026-06-08 05:01:13 UTC diagnostic info

事件 ID: legacy_evt_0003

事件名称: legacy_log

结构化详情

{
  "message": "Session ended: completed",
  "source": "legacy_logs"
}

脱敏原始 JSON

{
  "line": "2026-06-08T05:01:13.545509Z Session ended: completed",
  "note": "Synthesized from uploaded logs because structured event_timeline was unavailable.",
  "source": "legacy_logs"
}
DB user: In the current working directory, create a markdown file named visual-check-note.md with exactly three bullet points summarizing Hermes timeline validation on 2026-06-08. Then respond with only the filename. 2026-06-08 05:01:13 UTC diagnostic info

事件 ID: legacy_evt_0004

事件名称: legacy_log

结构化详情

{
  "message": "DB user: In the current working directory, create a markdown file named visual-check-note.md with exactly three bullet points summarizing Hermes timeline validation on 2026-06-08. Then respond with only the filename.",
  "source": "legacy_logs"
}

脱敏原始 JSON

{
  "line": "2026-06-08T05:01:13.547217Z DB user: In the current working directory, create a markdown file named visual-check-note.md with exactly three bullet points summarizing Hermes timeline validation on 2026-06-08. Then respond with only the filename.",
  "note": "Synthesized from uploaded logs because structured event_timeline was unavailable.",
  "source": "legacy_logs"
}
DB tool: write_file 2026-06-08 05:01:13 UTC diagnostic info

事件 ID: legacy_evt_0005

事件名称: legacy_log

结构化详情

{
  "message": "DB tool: write_file",
  "source": "legacy_logs"
}

脱敏原始 JSON

{
  "line": "2026-06-08T05:01:13.547242Z DB tool: write_file",
  "note": "Synthesized from uploaded logs because structured event_timeline was unavailable.",
  "source": "legacy_logs"
}
DB assistant: visual-check-note.md 2026-06-08 05:01:13 UTC diagnostic info

事件 ID: legacy_evt_0006

事件名称: legacy_log

结构化详情

{
  "message": "DB assistant: visual-check-note.md",
  "source": "legacy_logs"
}

脱敏原始 JSON

{
  "line": "2026-06-08T05:01:13.547248Z DB assistant: visual-check-note.md",
  "note": "Synthesized from uploaded logs because structured event_timeline was unavailable.",
  "source": "legacy_logs"
}

日志摘要

补充 Markdown 日志

Hermes Execution Report - openclawlive0424a

Summary

  • Run ID: 20260608_130108_8218f6-turn-1-1780894868602
  • Session ID: 20260608_130108_8218f6
  • Framework: Hermes
  • Model: MiniMax-M2.7
  • Task Title: -
  • Task Slug: -
  • Started At: 2026-06-08T05:01:08.602789Z
  • Ended At: 2026-06-08T05:01:13.545485Z

Metrics

  • Overall Score: 100
  • Task Success Rate: 100.00%
  • Tool Success Rate: 100.00%
  • Latency: 4942 ms
  • Tokens: 11604
  • Tokens / Second: 2348.04
  • Tool Calls: 1
  • API Calls: 2
  • Subagents: 0/0

Runtime Context

  • Skills: None
  • Tools: write_file

Execution Logs

  • 2026-06-08T05:01:11.581467Z Tool ok: write_file (136 ms)
  • 2026-06-08T05:01:11.581489Z Tool output: {"bytes_written": 209, "dirs_created": true, "lint": {"status": "skipped", "message": "No linter for .md files"}, "resolved_path": "/Users/cheney517/Documents/Playground/jingxuan-arena/hermes-live/visual-check-20260608/visual-check-note.md", "files_modified...
  • 2026-06-08T05:01:13.545509Z Session ended: completed
  • 2026-06-08T05:01:13.547217Z DB user: In the current working directory, create a markdown file named visual-check-note.md with exactly three bullet points summarizing Hermes timeline validation on 2026-06-08. Then respond with only the filename.
  • 2026-06-08T05:01:13.547242Z DB tool: write_file
  • 2026-06-08T05:01:13.547248Z DB assistant: visual-check-note.md