CSTCloud SkillHub - 安全可信的 AI Skill 广场

技能说明

Implement comprehensive evaluation strategies for LLM applications using automated metrics, human feedback, and benchmarking. Use when testing LLM performance, measuring AI application quality, or ...

中文介绍

通过自动化指标、人工反馈和基准测试，为大语言模型应用实施全面的评估策略，适用于测试大语言模型性能、衡量AI应用质量等场景

直接复制以下提示词，发送给你的 AI 助手即可完成安装。

帮我下载并安装这个SKILL：https://skillhub.cstcloud.cn/download/llm-evaluation

点击右上角下载SKILL 按钮

元信息

分类：Test & Security

下载：5

浏览：4

标签：

automated metrics human feedback benchmarking