Run PinchBench benchmarks to evaluate OpenClaw agent performance across real-world tasks. Use when testing model capabilities, comparing models, submitting benchmark results to the leaderboard, or checking how well your OpenClaw setup handles calendar, email, research, coding, and multi-step workflows.
运行PinchBench基准测试,评估OpenClaw代理在真实任务中的表现。适用于测试模型能力、比较不同模型、向排行榜提交基准结果,或检查您的OpenClaw配置在处理日历、邮件、研究、编程和多步骤工作流时的性能。
直接复制以下提示词,发送给你的 AI 助手即可完成安装。
点击右上角 下载SKILL 按钮