Skills gemini-video-analyzer

install
source · Clone the upstream repo
git clone https://github.com/openclaw/skills
Claude Code · Install into ~/.claude/skills/
T=$(mktemp -d) && git clone --depth=1 https://github.com/openclaw/skills "$T" && mkdir -p ~/.claude/skills && cp -r "$T/skills/aiwithabidi/a6-gemini-video-analyzer" ~/.claude/skills/openclaw-skills-gemini-video-analyzer && rm -rf "$T"
OpenClaw · Install into ~/.openclaw/skills/
T=$(mktemp -d) && git clone --depth=1 https://github.com/openclaw/skills "$T" && mkdir -p ~/.openclaw/skills && cp -r "$T/skills/aiwithabidi/a6-gemini-video-analyzer" ~/.openclaw/skills/openclaw-skills-gemini-video-analyzer && rm -rf "$T"
manifest: skills/aiwithabidi/a6-gemini-video-analyzer/SKILL.md
source content

Gemini Video Analyzer

Analyze videos natively using Google Gemini's multimodal API. No frame extraction needed — Gemini processes video at 1 FPS with full motion, audio, and visual understanding.

Quick Start

# Analyze a video with default prompt (full description)
GOOGLE_AI_API_KEY=$GOOGLE_AI_API_KEY python3 {baseDir}/scripts/analyze.py /path/to/video.mp4

# Ask a specific question
GOOGLE_AI_API_KEY=$GOOGLE_AI_API_KEY python3 {baseDir}/scripts/analyze.py /path/to/video.mp4 "What text is visible on screen?"

# Manage uploaded files
GOOGLE_AI_API_KEY=$GOOGLE_AI_API_KEY python3 {baseDir}/scripts/manage_files.py list
GOOGLE_AI_API_KEY=$GOOGLE_AI_API_KEY python3 {baseDir}/scripts/manage_files.py cleanup

Supported Formats

MP4, AVI, MOV, MKV, WebM, FLV, MPEG, MPG, WMV, 3GP — up to 2GB per file.

How It Works

  1. Video uploads to Google's Files API (temporary, auto-deletes after 48h)
  2. Gemini processes at 1 frame/sec — understands motion, transitions, audio context
  3. Model generates response based on your prompt
  4. Way better than frame extraction for understanding temporal content

Use Cases

TaskExample Prompt
General description(default — no prompt needed)
UI/text extraction
"What text and UI elements are visible?"
Tutorial summary
"Summarize the steps shown in this tutorial"
Bug report from video
"Describe what went wrong in this screen recording"
Meeting notes
"Summarize the key points discussed"
Content comparisonUpload 2 videos, ask for differences

Configuration

Set

GOOGLE_AI_API_KEY
in your environment or
.env
file. Get a free key at aistudio.google.com.

Default model:

gemini-2.5-flash
(fast, cheap, excellent vision). Override with
--model gemini-2.5-pro
for complex analysis.

API Reference

See references/gemini-files-api.md for file upload limits, processing details, and advanced options.