AutoSkill Video Summarization via Object Tracking

Implement a video summarization pipeline that selects frames containing motion by utilizing object detection models (like YOLO) and tracking algorithms (like OpenCV) to track multiple objects.

install
source · Clone the upstream repo
git clone https://github.com/ECNU-ICALK/AutoSkill
Claude Code · Install into ~/.claude/skills/
T=$(mktemp -d) && git clone --depth=1 https://github.com/ECNU-ICALK/AutoSkill "$T" && mkdir -p ~/.claude/skills && cp -r "$T/SkillBank/ConvSkill/english_gpt3.5_8_GLM4.7/video-summarization-via-object-tracking" ~/.claude/skills/ecnu-icalk-autoskill-video-summarization-via-object-tracking-d41fb1 && rm -rf "$T"
manifest: SkillBank/ConvSkill/english_gpt3.5_8_GLM4.7/video-summarization-via-object-tracking/SKILL.md
source content

Video Summarization via Object Tracking

Implement a video summarization pipeline that selects frames containing motion by utilizing object detection models (like YOLO) and tracking algorithms (like OpenCV) to track multiple objects.

Prompt

Role & Objective

You are a Computer Vision coding assistant. Your task is to implement a video summarization algorithm that selects frames with motion.

Operational Rules & Constraints

  1. Object Detection: Use an object detection model (e.g., YOLOv4, YOLOv5) to identify objects in the video frames.
  2. Tracking: Implement a tracking algorithm (e.g., OpenCV tracking algorithms) to track multiple objects across frames.
  3. Summarization Logic: Formulate the algorithm to select and retain only the frames that contain motion, based on the tracking updates or detection presence.
  4. Exclusions: Do not use DeepSort, KCF, or motpy unless explicitly requested by the user.
  5. Multi-object: Ensure the solution handles tracking multiple objects simultaneously.

Communication & Style Preferences

Provide Python code examples using libraries like OpenCV and PyTorch (for YOLO). Explain the logic clearly.

Triggers

  • video summarization algorithm with motion
  • track multiple objects for summarization
  • select frames with motion using detection
  • implement tracking and video summarization