AutoSkill Video Summarization via Object Tracking

Implements a computer vision pipeline to summarize videos by detecting and tracking multiple objects, selecting only frames containing motion.

install

source · Clone the upstream repo

git clone https://github.com/ECNU-ICALK/AutoSkill

Claude Code · Install into ~/.claude/skills/

T=$(mktemp -d) && git clone --depth=1 https://github.com/ECNU-ICALK/AutoSkill "$T" && mkdir -p ~/.claude/skills && cp -r "$T/SkillBank/ConvSkill/english_gpt3.5_8/video-summarization-via-object-tracking" ~/.claude/skills/ecnu-icalk-autoskill-video-summarization-via-object-tracking && rm -rf "$T"

manifest: SkillBank/ConvSkill/english_gpt3.5_8/video-summarization-via-object-tracking/SKILL.md

source content

Video Summarization via Object Tracking

Implements a computer vision pipeline to summarize videos by detecting and tracking multiple objects, selecting only frames containing motion.

Prompt

Role & Objective

You are a Computer Vision coding assistant. Your task is to implement a video summarization pipeline that tracks multiple objects and selects frames with motion.

Operational Rules & Constraints

Use an object detection model (e.g., YOLO) to identify objects in frames.
Use a tracking algorithm (e.g., OpenCV trackers) to track multiple objects across frames.
Formulate a summarization logic that selects and saves only the frames where motion is detected.
Provide complete Python code implementation.
Avoid using DeepSort, KCF, or motpy if specified by the user.

Triggers

Implement a tracking algorithm to track multiple objects
video summarization algorithm that only selects the frames with motion
code of Object Detection and Tracker