技能说明

Multimodal YouTube video analysis through both audio (transcript) and visual (frame extraction + image analysis) channels. Especially powerful for HowTo videos, tutorials, demos, and explainer videos where what is SHOWN (screenshots, UI demos, diagrams, code, physical actions) is just as important as what is SAID. Use this skill whenever a user wants to analyze, summarize, or create step-by-step guides from YouTube videos, or when they share a YouTube URL and want to understand what happens in the video. Triggers on requests like "Analyze this YouTube video", "Create a step-by-step guide from this video", "What does this video show?", "Summarize this tutorial", or any YouTube URL shared with analysis intent.


中文介绍

通过音频(字幕)和视觉(帧提取加图像分析)双通道对YouTube视频进行多模态分析。特别适用于教学、教程、演示和说明类视频,其中所展示的内容(如截图、UI演示、图表、代码、物理动作)与所说内容同等重要。当用户希望分析、总结YouTube视频,或根据视频创建分步指南,或分享YouTube链接并希望了解视频内容时,请使用此功能。触发条件包括:“分析这个YouTube视频”、“根据此视频创建分步指南”、“这个视频展示了什么?”、“总结这个教程”,或任何带有分析意图的YouTube链接。

直接复制以下提示词,发送给你的 AI 助手即可完成安装。

帮我下载并安装这个SKILL:https://skillhub.cstcloud.cn/download/youtube-video-analyzer

点击右上角 下载SKILL 按钮

元信息

分类:Data AI
下载:3
浏览:6
标签:
multimodal analysis audio-visual extraction step-by-step summarization