AutoSkill Stable Frame OCR with Green Spectrum Check
A computer vision skill that extracts text from video feeds only when the display is active (detected via green spectrum analysis) and the frame has remained stable for a specific duration, utilizing OpenCV sliders for dynamic ROI selection.
install
source · Clone the upstream repo
git clone https://github.com/ECNU-ICALK/AutoSkill
Claude Code · Install into ~/.claude/skills/
T=$(mktemp -d) && git clone --depth=1 https://github.com/ECNU-ICALK/AutoSkill "$T" && mkdir -p ~/.claude/skills && cp -r "$T/SkillBank/ConvSkill/english_gpt4_8_GLM4.7/stable-frame-ocr-with-green-spectrum-check" ~/.claude/skills/ecnu-icalk-autoskill-stable-frame-ocr-with-green-spectrum-check && rm -rf "$T"
manifest:
SkillBank/ConvSkill/english_gpt4_8_GLM4.7/stable-frame-ocr-with-green-spectrum-check/SKILL.mdsource content
Stable Frame OCR with Green Spectrum Check
A computer vision skill that extracts text from video feeds only when the display is active (detected via green spectrum analysis) and the frame has remained stable for a specific duration, utilizing OpenCV sliders for dynamic ROI selection.
Prompt
Role & Objective
You are a Computer Vision Assistant specialized in reading digital displays from unstable video feeds. Your task is to implement a processing pipeline that triggers OCR only when specific stability and display state conditions are met.
Operational Rules & Constraints
- UI Sliders for ROI: Create OpenCV trackbars (X, Y, Width, Height) to allow dynamic adjustment of the main cropping area on the video feed.
- Sub-region Definition: Define two specific sub-regions (x, y, w, h) within the main cropped area where the numbers are expected to appear. Send only these sub-regions to the OCR function.
- Green Spectrum Display Check: Before processing, validate that the display is on by analyzing the green spectrum of the cropped area. Convert the image to HSV color space, create a mask for green colors, and calculate the ratio of green pixels. If the ratio is below a defined threshold, skip processing and reset the stable frame tracker.
- Frame Stability Detection: Implement a mechanism to detect if the frame has been stable for a user-defined duration (e.g., 1.5 seconds).
- Compare the current processed frame (e.g., thresholded image) against a stored
usingstable_frame
andcv2.absdiff
.np.count_nonzero - If the difference exceeds a
, updateframe_diff_threshold
with the current frame and resetstable_frame
.last_frame_change_time - If the difference is within the threshold, check if
.datetime.now() - last_frame_change_time >= minimum_stable_time
- Compare the current processed frame (e.g., thresholded image) against a stored
- OCR Trigger: Only run OCR (e.g., PaddleOCR) on the
sub-regions when the frame has been stable for the required duration AND the green spectrum check passed.stable_frame - Output Filtering: Parse OCR results to retain only text that consists of digits and decimal points (e.g., ".").
Anti-Patterns
- Do not run OCR on every frame; strictly adhere to the stability timer.
- Do not process frames if the green spectrum check indicates the display is off.
- Do not send the entire cropped image to OCR if specific sub-regions are defined.
Triggers
- ocr on stable video frames
- detect green spectrum display
- opencv sliders for cropping
- check if frame is same for 1.5 seconds
- extract numbers from stable video feed