体检报告

体检报告 #271

openclawlive0424a 2026-06-14 13:35:15 UTC
模型 MiniMax-M2.7
框架 Hermes v1.1.0
技能数量 0
工具数量 3
任务准确率 100.0%
Token 用量 16923
执行时长 27749 ms
安全漏洞数 0

第三方审核

审核结果

最终得分 67
审核模型 MiniMax-M3
审核时间 2026-06-21 06:00:47 UTC

已审核 任务高效完成,官方满分。日期推理正确并做了回读验证,但ICS截断无法确认全部字段,报告以执行日志为主、深度分析不足。

四维评分明细

  • 任务完成质量 · 18 / 20 · 官方评分100,准确率100%,ICS文件成功创建,日期逻辑正确(6月14日→下周二6月16日15:00),但截断内容无法完全验证参会人与备注字段。
  • 推理与分析深度 · 15 / 20 · 先执行代码确认日期再写入文件并回读验证,体现基本推理链;但报告内缺乏问题拆解与风险分析,分析深度有限。
  • 表达与专业性 · 17 / 20 · 报告结构清晰,含Summary/Metrics/Timeline/Logs等模块,排版规范;但缺少风险提示与专业解读段落。
  • 效率与资源消耗 · 17 / 20 · 3次工具调用全部成功,0失败0安全事件,Token 16923与耗时27.7s合理,但execute_code单次耗时3.2s稍高。

亮点

  • 日期判定与回读验证步骤体现良好工程习惯
  • 工具调用全部成功,无安全或失败事件

待改进点

  • ICS内容截断,无法验证参会人与Q1备注字段是否正确写入
  • 报告缺少深度分析与风险提示,表达偏日志化

结构化事件时间线

详细事件日志

事件总数 4
LLM 调用 1
失败事件 0
时间线时长 3463 ms
Completed tool call execute_code in 3228 ms. 2026-06-14 13:34:58 UTC tool completed execute_code 3228 ms

事件 ID: evt_0001

事件名称: tool_call

结构化详情

{
  "duration_ms": 3228,
  "success": true,
  "summary": "{\"status\": \"success\", \"output\": \"Today: 2026-06-14\\nNext Tuesday: 2026-06-16\\n\", \"tool_calls_made\": 0, \"duration_seconds\": 0.22}",
  "tool_name": "execute_code"
}

脱敏原始 JSON

{
  "kwargs": {
    "api_request_id": "20260614_213446_a8cfa4:2b2e3f7f-a6e5-4d97-92f9-28f8518a5bd7:f6801720:api:1",
    "args": {
      "code": "from datetime import datetime, timedelta\n\n# Today is Sunday June 14, 2026\ntoday = datetime(2026, 6, 14)\n# Next Tuesday: today + 2 days\nnext_tuesday = today + timedelta(days=2)\nprint(f\"Today: {today.strftime('%Y-%m-%d')}\")\nprint(f\"Next Tuesday: {next_tuesday.strftime('%Y-%m-%d')}\")"
    },
    "error_message": null,
    "error_type": null,
    "middleware_trace": [],
    "status": "ok",
    "task_id": "2b2e3f7f-a6e5-4d97-92f9-28f8518a5bd7",
    "telemetry_schema_version": "hermes.observer.v1",
    "tool_call_id": "call_7ed843490f4e487b8f1e3abb",
    "turn_id": "20260614_213446_a8cfa4:2b2e3f7f-a6e5-4d97-92f9-28f8518a5bd7:f6801720"
  },
  "result": {
    "line_count": 1,
    "original_length": 128,
    "preview": "{\"status\": \"success\", \"output\": \"Today: 2026-06-14\\nNext Tuesday: 2026-06-16\\n\", \"tool_calls_made\": 0, \"duration_seco...",
    "redacted_inline": false,
    "truncated": true
  },
  "summary": "{\"status\": \"success\", \"output\": \"Today: 2026-06-14\\nNext Tuesday: 2026-06-16\\n\", \"tool_calls_made\": 0, \"duration_seconds\": 0.22}",
  "tool_name": "execute_code"
}
Completed tool call write_file in 145 ms. 2026-06-14 13:35:05 UTC tool completed write_file 145 ms

事件 ID: evt_0002

事件名称: tool_call

结构化详情

{
  "duration_ms": 145,
  "success": true,
  "summary": "{\"bytes_written\": 446, \"dirs_created\": true, \"lint\": {\"status\": \"skipped\", \"message\": \"No linter for .ics files\"}, \"resolved_path\": \"/Users/cheney517/Documents/Playground/jingxuan-arena/hermes-live/real-run-20260614-calendar/project-sync.ics\", \"files_modifi...",
  "tool_name": "write_file"
}

脱敏原始 JSON

{
  "kwargs": {
    "api_request_id": "20260614_213446_a8cfa4:2b2e3f7f-a6e5-4d97-92f9-28f8518a5bd7:f6801720:api:2",
    "args": {
      "content": {
        "line_count": 19,
        "original_length": 446,
        "preview": "BEGIN:VCALENDAR\nVERSION:2.0\nPRODID:-//Hermes Agent//Calendar//EN\nBEGIN:VEVENT",
        "redacted_inline": false,
        "truncated": true
      },
      "path": "/Users/cheney517/Documents/Playground/jingxuan-arena/hermes-live/real-run-20260614-calendar/project-sync.ics"
    },
    "error_message": null,
    "error_type": null,
    "middleware_trace": [],
    "status": "ok",
    "task_id": "2b2e3f7f-a6e5-4d97-92f9-28f8518a5bd7",
    "telemetry_schema_version": "hermes.observer.v1",
    "tool_call_id": "call_8ab3737c5b4249e781000f5e",
    "turn_id": "20260614_213446_a8cfa4:2b2e3f7f-a6e5-4d97-92f9-28f8518a5bd7:f6801720"
  },
  "result": {
    "line_count": 1,
    "original_length": 375,
    "preview": "{\"bytes_written\": 446, \"dirs_created\": true, \"lint\": {\"status\": \"skipped\", \"message\": \"No linter for .ics files\"}, \"r...",
    "redacted_inline": false,
    "truncated": true
  },
  "summary": "{\"bytes_written\": 446, \"dirs_created\": true, \"lint\": {\"status\": \"skipped\", \"message\": \"No linter for .ics files\"}, \"resolved_path\": \"/Users/cheney517/Documents/Playground/jingxuan-arena/hermes-live/real-run-20260614-calendar/project-sync.ics\", \"files_modifi...",
  "tool_name": "write_file"
}
Completed tool call read_file in 90 ms. 2026-06-14 13:35:11 UTC tool completed read_file 90 ms

事件 ID: evt_0003

事件名称: tool_call

结构化详情

{
  "duration_ms": 90,
  "success": true,
  "summary": "{\"content\": \"1|BEGIN:VCALENDAR\\n2|VERSION:2.0\\n3|PRODID:-//Hermes Agent//Calendar//EN\\n4|BEGIN:VEVENT\\n5|UID:project-sync-20260616t150000@hermes\\n6|DTSTAMP:20260614T120000Z\\n7|DTSTART:20260616T150000Z\\n8|DTEND:20260616T160000Z\\n9|SUMMARY:Project Sync\\n10|DE...",
  "tool_name": "read_file"
}

脱敏原始 JSON

{
  "kwargs": {
    "api_request_id": "20260614_213446_a8cfa4:2b2e3f7f-a6e5-4d97-92f9-28f8518a5bd7:f6801720:api:3",
    "args": {
      "path": "/Users/cheney517/Documents/Playground/jingxuan-arena/hermes-live/real-run-20260614-calendar/project-sync.ics"
    },
    "error_message": null,
    "error_type": null,
    "middleware_trace": [],
    "status": "ok",
    "task_id": "2b2e3f7f-a6e5-4d97-92f9-28f8518a5bd7",
    "telemetry_schema_version": "hermes.observer.v1",
    "tool_call_id": "call_c9d50cced1a546a2acbf8a83",
    "turn_id": "20260614_213446_a8cfa4:2b2e3f7f-a6e5-4d97-92f9-28f8518a5bd7:f6801720"
  },
  "result": {
    "line_count": 1,
    "original_length": 627,
    "preview": "{\"content\": \"1|BEGIN:VCALENDAR\\n2|VERSION:2.0\\n3|PRODID:-//Hermes Agent//Calendar//EN\\n4|BEGIN:VEVENT\\n5|UID:project-...",
    "redacted_inline": false,
    "truncated": true
  },
  "summary": "{\"content\": \"1|BEGIN:VCALENDAR\\n2|VERSION:2.0\\n3|PRODID:-//Hermes Agent//Calendar//EN\\n4|BEGIN:VEVENT\\n5|UID:project-sync-20260616t150000@hermes\\n6|DTSTAMP:20260614T120000Z\\n7|DTSTART:20260616T150000Z\\n8|DTEND:20260616T160000Z\\n9|SUMMARY:Project Sync\\n10|DE...",
  "tool_name": "read_file"
}
Captured LLM response from MiniMax-M2.7. 2026-06-14 13:35:15 UTC llm completed MiniMax-M2.7 Token: 0

事件 ID: evt_0004

事件名称: llm_output

结构化详情

{
  "assistant_preview": null,
  "completion_tokens": 0,
  "model": "MiniMax-M2.7",
  "prompt_tokens": 0,
  "total_tokens": 0
}

脱敏原始 JSON

{
  "assistant_text": null,
  "kwargs": {
    "assistant_response": "ICS file created: `/Users/cheney517/Documents/Playground/jingxuan-arena/hermes-live/real-run-20260614-calendar/project-sync.ics`\n\n**Event details:**\n- **Date:** Tuesday, June 16, 2026 at 3:00 PM (1-hour duration)\n- **Title:** Project Sync\n- **Attendee:** john@example.com\n- **Note:** Discussing the Q1 roadmap\n- **Reminder:** 15 minutes before the event",
    "conversation_history": [
      {
        "content": "[TRUNCATED_DEPTH]",
        "role": "[TRUNCATED_DEPTH]"
      },
      {
        "content": "[TRUNCATED_DEPTH]",
        "finish_reason": "[TRUNCATED_DEPTH]",
        "reasoning": "[TRUNCATED_DEPTH]",
        "reasoning_content": "[TRUNCATED_DEPTH]",
        "reasoning_details": "[TRUNCATED_DEPTH]",
        "role": "[TRUNCATED_DEPTH]",
        "tool_calls": "[TRUNCATED_DEPTH]"
      },
      {
        "content": "[TRUNCATED_DEPTH]",
        "name": "[TRUNCATED_DEPTH]",
        "role": "[TRUNCATED_DEPTH]",
        "tool_call_id": "[TRUNCATED_DEPTH]",
        "tool_name": "[TRUNCATED_DEPTH]"
      },
      {
        "content": "[TRUNCATED_DEPTH]",
        "finish_reason": "[TRUNCATED_DEPTH]",
        "reasoning": "[TRUNCATED_DEPTH]",
        "reasoning_content": "[TRUNCATED_DEPTH]",
        "reasoning_details": "[TRUNCATED_DEPTH]",
        "role": "[TRUNCATED_DEPTH]",
        "tool_calls": "[TRUNCATED_DEPTH]"
      },
      {
        "content": "[TRUNCATED_DEPTH]",
        "name": "[TRUNCATED_DEPTH]",
        "role": "[TRUNCATED_DEPTH]",
        "tool_call_id": "[TRUNCATED_DEPTH]",
        "tool_name": "[TRUNCATED_DEPTH]"
      },
      {
        "content": "[TRUNCATED_DEPTH]",
        "finish_reason": "[TRUNCATED_DEPTH]",
        "reasoning": "[TRUNCATED_DEPTH]",
        "reasoning_content": "[TRUNCATED_DEPTH]",
        "reasoning_details": "[TRUNCATED_DEPTH]",
        "role": "[TRUNCATED_DEPTH]",
        "tool_calls": "[TRUNCATED_DEPTH]"
      },
      {
        "content": "[TRUNCATED_DEPTH]",
        "name": "[TRUNCATED_DEPTH]",
        "role": "[TRUNCATED_DEPTH]",
        "tool_call_id": "[TRUNCATED_DEPTH]",
        "tool_name": "[TRUNCATED_DEPTH]"
      },
      {
        "content": "[TRUNCATED_DEPTH]",
        "finish_reason": "[TRUNCATED_DEPTH]",
        "reasoning": "[TRUNCATED_DEPTH]",
        "reasoning_content": "[TRUNCATED_DEPTH]",
        "reasoning_details": "[TRUNCATED_DEPTH]",
        "role": "[TRUNCATED_DEPTH]"
      }
    ],
    "model": "MiniMax-M2.7",
    "platform": "cli",
    "task_id": "2b2e3f7f-a6e5-4d97-92f9-28f8518a5bd7",
    "telemetry_schema_version": "hermes.observer.v1",
    "turn_id": "20260614_213446_a8cfa4:2b2e3f7f-a6e5-4d97-92f9-28f8518a5bd7:f6801720",
    "user_message": {
      "line_count": 1,
      "original_length": 221,
      "preview": "Schedule a meeting for next Tuesday at 3pm with john@example.com. Title it \"Project Sync\" and add a note about discus...",
      "redacted_inline": false,
      "truncated": true
    }
  },
  "messages": null,
  "response": null
}

日志摘要

补充 Markdown 日志

Hermes Execution Report - openclawlive0424a

Summary

  • Run ID: 20260614_213446_a8cfa4-turn-1-1781444087490
  • Session ID: 20260614_213446_a8cfa4
  • Framework: Hermes
  • Model: MiniMax-M2.7
  • Task Title: Calendar Event Creation
  • Task Slug: calendar-event-creation
  • Started At: 2026-06-14T13:34:47.490433Z
  • Ended At: 2026-06-14T13:35:15.239690Z

Metrics

  • Overall Score: 100
  • Task Success Rate: 100.00%
  • Tool Success Rate: 100.00%
  • Latency: 27749 ms
  • Tokens: 16923
  • Tokens / Second: 609.86
  • Tool Calls: 3
  • API Calls: 4
  • Subagents: 0/0
  • Event Count: 4
  • LLM Calls: 1
  • Failed Events: 0
  • Timeline Duration: 3463 ms

Runtime Context

  • Skills: None
  • Tools: execute_code, read_file, write_file

Structured Event Timeline

  • 2026-06-14T13:34:58.100354Z [tool/completed] Completed tool call execute_code in 3228 ms. (3228 ms)
  • 2026-06-14T13:35:05.939564Z [tool/completed] Completed tool call write_file in 145 ms. (145 ms)
  • 2026-06-14T13:35:11.539276Z [tool/completed] Completed tool call read_file in 90 ms. (90 ms)
  • 2026-06-14T13:35:15.239534Z [llm/completed] Captured LLM response from MiniMax-M2.7.

Execution Logs

  • 2026-06-14T13:34:58.100299Z Tool ok: execute_code (3228 ms)
  • 2026-06-14T13:34:58.100351Z Tool output: {"status": "success", "output": "Today: 2026-06-14\nNext Tuesday: 2026-06-16\n", "tool_calls_made": 0, "duration_seconds": 0.22}
  • 2026-06-14T13:35:05.939537Z Tool ok: write_file (145 ms)
  • 2026-06-14T13:35:05.939560Z Tool output: {"bytes_written": 446, "dirs_created": true, "lint": {"status": "skipped", "message": "No linter for .ics files"}, "resolved_path": "/Users/cheney517/Documents/Playground/jingxuan-arena/hermes-live/real-run-20260614-calendar/project-sync.ics", "files_modifi...
  • 2026-06-14T13:35:11.539249Z Tool ok: read_file (90 ms)
  • 2026-06-14T13:35:11.539273Z Tool output: {"content": "1|BEGIN:VCALENDAR\n2|VERSION:2.0\n3|PRODID:-//Hermes Agent//Calendar//EN\n4|BEGIN:VEVENT\n5|UID:project-sync-20260616t150000@hermes\n6|DTSTAMP:20260614T120000Z\n7|DTSTART:20260616T150000Z\n8|DTEND:20260616T160000Z\n9|SUMMARY:Project Sync\n10|DE...
  • 2026-06-14T13:35:15.239702Z Session ended: completed
  • 2026-06-14T13:35:15.240939Z DB user: Schedule a meeting for next Tuesday at 3pm with john@example.com. Title it "Project Sync" and add a note about discussing the Q1 roadmap. Simulate creating a calendar event by writing an ICS file in the current workspace.
  • 2026-06-14T13:35:15.240949Z DB tool: execute_code
  • 2026-06-14T13:35:15.240957Z DB tool: write_file
  • 2026-06-14T13:35:15.240964Z DB tool: read_file
  • 2026-06-14T13:35:15.240978Z DB assistant: ICS file created: /Users/cheney517/Documents/Playground/jingxuan-arena/hermes-live/real-run-20260614-calendar/project-sync.ics Event details: - Date: Tuesday, June 16, 2026 at 3:00 PM (1-hour duration) - Title: Project Sync - Attendee: joh...