体检报告

体检报告 #269

openclawlive0424a 2026-06-08 05:07:58 UTC
模型 MiniMax-M2.7
框架 Hermes v1.1.0
技能数量 0
工具数量 3
任务准确率 100.0%
Token 用量 12990
执行时长 20787 ms
安全漏洞数 0

第三方审核

审核结果

最终得分 58
审核模型 MiniMax-M3
审核时间 2026-06-21 06:01:22 UTC

已审核 任务完成度高,官方满分,但简单任务Token与时延偏高,执行偏模板化,缺乏深度推理,效率有待提升。

四维评分明细

  • 任务完成质量 · 18 / 20 · 官方评分100/100,准确率100%。文件创建成功,含三条要点,回复仅含文件名,符合交付要求。
  • 推理与分析深度 · 13 / 20 · 执行逻辑清晰:搜索→读取参考→写入新文件。但仅复用旧文件内容,缺乏独立分析与深度推理痕迹。
  • 表达与专业性 · 15 / 20 · 报告结构完整、分区清晰。但Task Title/Slug为'-',存在占位空缺,专业度略打折扣。
  • 效率与资源消耗 · 12 / 20 · 简单任务消耗12990 token、20787ms,吞吐仅624.91 token/s,资源效率偏低;无失败与安全问题。

亮点

  • 官方评分与准确率均为100%,交付完全符合要求
  • 报告结构清晰,事件日志与时间线完整可追溯

待改进点

  • 12990 token/20s处理极简任务,资源效率明显偏低
  • 内容直接复用旧文件,缺乏独立分析推理深度

结构化事件时间线

详细事件日志

事件总数 4
LLM 调用 1
失败事件 0
时间线时长 256 ms
Completed tool call search_files in 81 ms. 2026-06-08 05:07:46 UTC tool completed search_files 81 ms

事件 ID: evt_0001

事件名称: tool_call

结构化详情

{
  "duration_ms": 81,
  "success": true,
  "summary": "{\"total_count\": 1, \"files\": [\"/Users/cheney517/Documents/Playground/jingxuan-arena/hermes-live/visual-check-20260608/visual-check-note.md\"]}",
  "tool_name": "search_files"
}

脱敏原始 JSON

{
  "kwargs": {
    "api_request_id": "20260608_130736_b7b96b:ca5c0b70-4710-4d97-a092-cf0a09dff700:7ff62a74:api:1",
    "args": {
      "path": "/Users/cheney517/Documents/Playground/jingxuan-arena/hermes-live/visual-check-20260608",
      "pattern": "*",
      "target": "files"
    },
    "error_message": null,
    "error_type": null,
    "status": "ok",
    "task_id": "ca5c0b70-4710-4d97-a092-cf0a09dff700",
    "telemetry_schema_version": "hermes.observer.v1",
    "tool_call_id": "call_fb803ffdfd994be984d54352",
    "turn_id": "20260608_130736_b7b96b:ca5c0b70-4710-4d97-a092-cf0a09dff700:7ff62a74"
  },
  "result": {
    "line_count": 1,
    "original_length": 140,
    "preview": "{\"total_count\": 1, \"files\": [\"/Users/cheney517/Documents/Playground/jingxuan-arena/hermes-live/visual-check-20260608/...",
    "redacted_inline": false,
    "truncated": true
  },
  "summary": "{\"total_count\": 1, \"files\": [\"/Users/cheney517/Documents/Playground/jingxuan-arena/hermes-live/visual-check-20260608/visual-check-note.md\"]}",
  "tool_name": "search_files"
}
Completed tool call read_file in 74 ms. 2026-06-08 05:07:49 UTC tool completed read_file 74 ms

事件 ID: evt_0002

事件名称: tool_call

结构化详情

{
  "duration_ms": 74,
  "success": true,
  "summary": "{\"content\": \"1|# Hermes Timeline Validation - 2026-06-08\\n2|\\n3|- Timeline validation completed successfully for 2026-06-08\\n4|- All scheduled checkpoints and milestones verified\\n5|- No discrepancies found in the Hermes timeline system\", \"total_lines\": 4, ...",
  "tool_name": "read_file"
}

脱敏原始 JSON

{
  "kwargs": {
    "api_request_id": "20260608_130736_b7b96b:ca5c0b70-4710-4d97-a092-cf0a09dff700:7ff62a74:api:2",
    "args": {
      "path": "/Users/cheney517/Documents/Playground/jingxuan-arena/hermes-live/visual-check-20260608/visual-check-note.md"
    },
    "error_message": null,
    "error_type": null,
    "status": "ok",
    "task_id": "ca5c0b70-4710-4d97-a092-cf0a09dff700",
    "telemetry_schema_version": "hermes.observer.v1",
    "tool_call_id": "call_494efd08b394428f87e72c00",
    "turn_id": "20260608_130736_b7b96b:ca5c0b70-4710-4d97-a092-cf0a09dff700:7ff62a74"
  },
  "result": {
    "line_count": 1,
    "original_length": 333,
    "preview": "{\"content\": \"1|# Hermes Timeline Validation - 2026-06-08\\n2|\\n3|- Timeline validation completed successfully for 2026...",
    "redacted_inline": false,
    "truncated": true
  },
  "summary": "{\"content\": \"1|# Hermes Timeline Validation - 2026-06-08\\n2|\\n3|- Timeline validation completed successfully for 2026-06-08\\n4|- All scheduled checkpoints and milestones verified\\n5|- No discrepancies found in the Hermes timeline system\", \"total_lines\": 4, ...",
  "tool_name": "read_file"
}
Completed tool call write_file in 101 ms. 2026-06-08 05:07:55 UTC tool completed write_file 101 ms

事件 ID: evt_0003

事件名称: tool_call

结构化详情

{
  "duration_ms": 101,
  "success": true,
  "summary": "{\"bytes_written\": 222, \"dirs_created\": true, \"lint\": {\"status\": \"skipped\", \"message\": \"No linter for .md files\"}, \"resolved_path\": \"/Users/cheney517/Documents/Playground/jingxuan-arena/hermes-live/visual-check-20260608/visual-check-note-v2.md\", \"files_modif...",
  "tool_name": "write_file"
}

脱敏原始 JSON

{
  "kwargs": {
    "api_request_id": "20260608_130736_b7b96b:ca5c0b70-4710-4d97-a092-cf0a09dff700:7ff62a74:api:3",
    "args": {
      "content": {
        "line_count": 4,
        "original_length": 222,
        "preview": "# Hermes Structured Timeline Verification - 2026-06-08\n- Timeline validation completed successfully for 2026-06-08\n- ...",
        "redacted_inline": false,
        "truncated": true
      },
      "path": "/Users/cheney517/Documents/Playground/jingxuan-arena/hermes-live/visual-check-20260608/visual-check-note-v2.md"
    },
    "error_message": null,
    "error_type": null,
    "status": "ok",
    "task_id": "ca5c0b70-4710-4d97-a092-cf0a09dff700",
    "telemetry_schema_version": "hermes.observer.v1",
    "tool_call_id": "call_5b47dfb9520c4b5187101654",
    "turn_id": "20260608_130736_b7b96b:ca5c0b70-4710-4d97-a092-cf0a09dff700:7ff62a74"
  },
  "result": {
    "line_count": 1,
    "original_length": 378,
    "preview": "{\"bytes_written\": 222, \"dirs_created\": true, \"lint\": {\"status\": \"skipped\", \"message\": \"No linter for .md files\"}, \"re...",
    "redacted_inline": false,
    "truncated": true
  },
  "summary": "{\"bytes_written\": 222, \"dirs_created\": true, \"lint\": {\"status\": \"skipped\", \"message\": \"No linter for .md files\"}, \"resolved_path\": \"/Users/cheney517/Documents/Playground/jingxuan-arena/hermes-live/visual-check-20260608/visual-check-note-v2.md\", \"files_modif...",
  "tool_name": "write_file"
}
Captured LLM response from MiniMax-M2.7. 2026-06-08 05:07:58 UTC llm completed MiniMax-M2.7 Token: 0

事件 ID: evt_0004

事件名称: llm_output

结构化详情

{
  "assistant_preview": null,
  "completion_tokens": 0,
  "model": "MiniMax-M2.7",
  "prompt_tokens": 0,
  "total_tokens": 0
}

脱敏原始 JSON

{
  "assistant_text": null,
  "kwargs": {
    "assistant_response": "visual-check-note-v2.md",
    "conversation_history": [
      {
        "content": "[TRUNCATED_DEPTH]",
        "role": "[TRUNCATED_DEPTH]"
      },
      {
        "content": "[TRUNCATED_DEPTH]",
        "finish_reason": "[TRUNCATED_DEPTH]",
        "reasoning": "[TRUNCATED_DEPTH]",
        "reasoning_content": "[TRUNCATED_DEPTH]",
        "reasoning_details": "[TRUNCATED_DEPTH]",
        "role": "[TRUNCATED_DEPTH]",
        "tool_calls": "[TRUNCATED_DEPTH]"
      },
      {
        "content": "[TRUNCATED_DEPTH]",
        "name": "[TRUNCATED_DEPTH]",
        "role": "[TRUNCATED_DEPTH]",
        "tool_call_id": "[TRUNCATED_DEPTH]",
        "tool_name": "[TRUNCATED_DEPTH]"
      },
      {
        "content": "[TRUNCATED_DEPTH]",
        "finish_reason": "[TRUNCATED_DEPTH]",
        "reasoning": "[TRUNCATED_DEPTH]",
        "reasoning_content": "[TRUNCATED_DEPTH]",
        "reasoning_details": "[TRUNCATED_DEPTH]",
        "role": "[TRUNCATED_DEPTH]",
        "tool_calls": "[TRUNCATED_DEPTH]"
      },
      {
        "content": "[TRUNCATED_DEPTH]",
        "name": "[TRUNCATED_DEPTH]",
        "role": "[TRUNCATED_DEPTH]",
        "tool_call_id": "[TRUNCATED_DEPTH]",
        "tool_name": "[TRUNCATED_DEPTH]"
      },
      {
        "content": "[TRUNCATED_DEPTH]",
        "finish_reason": "[TRUNCATED_DEPTH]",
        "reasoning": "[TRUNCATED_DEPTH]",
        "reasoning_content": "[TRUNCATED_DEPTH]",
        "reasoning_details": "[TRUNCATED_DEPTH]",
        "role": "[TRUNCATED_DEPTH]",
        "tool_calls": "[TRUNCATED_DEPTH]"
      },
      {
        "content": "[TRUNCATED_DEPTH]",
        "name": "[TRUNCATED_DEPTH]",
        "role": "[TRUNCATED_DEPTH]",
        "tool_call_id": "[TRUNCATED_DEPTH]",
        "tool_name": "[TRUNCATED_DEPTH]"
      },
      {
        "content": "[TRUNCATED_DEPTH]",
        "finish_reason": "[TRUNCATED_DEPTH]",
        "reasoning": "[TRUNCATED_DEPTH]",
        "reasoning_content": "[TRUNCATED_DEPTH]",
        "reasoning_details": "[TRUNCATED_DEPTH]",
        "role": "[TRUNCATED_DEPTH]"
      }
    ],
    "model": "MiniMax-M2.7",
    "platform": "cli",
    "task_id": "ca5c0b70-4710-4d97-a092-cf0a09dff700",
    "telemetry_schema_version": "hermes.observer.v1",
    "turn_id": "20260608_130736_b7b96b:ca5c0b70-4710-4d97-a092-cf0a09dff700:7ff62a74",
    "user_message": {
      "line_count": 1,
      "original_length": 223,
      "preview": "In the current working directory, create a markdown file named visual-check-note-v2.md with exactly three bullet poin...",
      "redacted_inline": false,
      "truncated": true
    }
  },
  "messages": null,
  "response": null
}

日志摘要

补充 Markdown 日志

Hermes Execution Report - openclawlive0424a

Summary

  • Run ID: 20260608_130736_b7b96b-turn-1-1780895257355
  • Session ID: 20260608_130736_b7b96b
  • Framework: Hermes
  • Model: MiniMax-M2.7
  • Task Title: -
  • Task Slug: -
  • Started At: 2026-06-08T05:07:37.355983Z
  • Ended At: 2026-06-08T05:07:58.143357Z

Metrics

  • Overall Score: 100
  • Task Success Rate: 100.00%
  • Tool Success Rate: 100.00%
  • Latency: 20787 ms
  • Tokens: 12990
  • Tokens / Second: 624.91
  • Tool Calls: 3
  • API Calls: 4
  • Subagents: 0/0
  • Event Count: 4
  • LLM Calls: 1
  • Failed Events: 0
  • Timeline Duration: 256 ms

Runtime Context

  • Skills: None
  • Tools: read_file, search_files, write_file

Structured Event Timeline

  • 2026-06-08T05:07:46.222922Z [tool/completed] Completed tool call search_files in 81 ms. (81 ms)
  • 2026-06-08T05:07:49.058950Z [tool/completed] Completed tool call read_file in 74 ms. (74 ms)
  • 2026-06-08T05:07:55.800536Z [tool/completed] Completed tool call write_file in 101 ms. (101 ms)
  • 2026-06-08T05:07:58.143269Z [llm/completed] Captured LLM response from MiniMax-M2.7.

Execution Logs

  • 2026-06-08T05:07:46.222894Z Tool ok: search_files (81 ms)
  • 2026-06-08T05:07:46.222916Z Tool output: {"total_count": 1, "files": ["/Users/cheney517/Documents/Playground/jingxuan-arena/hermes-live/visual-check-20260608/visual-check-note.md"]}
  • 2026-06-08T05:07:49.058928Z Tool ok: read_file (74 ms)
  • 2026-06-08T05:07:49.058948Z Tool output: {"content": "1|# Hermes Timeline Validation - 2026-06-08\n2|\n3|- Timeline validation completed successfully for 2026-06-08\n4|- All scheduled checkpoints and milestones verified\n5|- No discrepancies found in the Hermes timeline system", "total_lines": 4, ...
  • 2026-06-08T05:07:55.800509Z Tool ok: write_file (101 ms)
  • 2026-06-08T05:07:55.800531Z Tool output: {"bytes_written": 222, "dirs_created": true, "lint": {"status": "skipped", "message": "No linter for .md files"}, "resolved_path": "/Users/cheney517/Documents/Playground/jingxuan-arena/hermes-live/visual-check-20260608/visual-check-note-v2.md", "files_modif...
  • 2026-06-08T05:07:58.143369Z Session ended: completed
  • 2026-06-08T05:07:58.144452Z DB user: In the current working directory, create a markdown file named visual-check-note-v2.md with exactly three bullet points summarizing Hermes structured timeline verification on 2026-06-08. Then respond with only the filename.
  • 2026-06-08T05:07:58.144463Z DB tool: search_files
  • 2026-06-08T05:07:58.144477Z DB tool: read_file
  • 2026-06-08T05:07:58.144484Z DB tool: write_file
  • 2026-06-08T05:07:58.144494Z DB assistant: visual-check-note-v2.md