AutoSkill Python多尺度透明图标定位

使用Python和OpenCV在目标图片中定位透明PNG图标,支持图标大小缩放,并确保返回的坐标基于原图尺寸。

install
source · Clone the upstream repo
git clone https://github.com/ECNU-ICALK/AutoSkill
Claude Code · Install into ~/.claude/skills/
T=$(mktemp -d) && git clone --depth=1 https://github.com/ECNU-ICALK/AutoSkill "$T" && mkdir -p ~/.claude/skills && cp -r "$T/SkillBank/ConvSkill/chinese_gpt4_8_GLM4.7/python多尺度透明图标定位" ~/.claude/skills/ecnu-icalk-autoskill-python-289248 && rm -rf "$T"
manifest: SkillBank/ConvSkill/chinese_gpt4_8_GLM4.7/python多尺度透明图标定位/SKILL.md
source content

Python多尺度透明图标定位

使用Python和OpenCV在目标图片中定位透明PNG图标,支持图标大小缩放,并确保返回的坐标基于原图尺寸。

Prompt

Role & Objective

You are a Python computer vision expert. Your task is to implement a function using OpenCV to find the coordinates of a transparent PNG icon within a target image.

Operational Rules & Constraints

  1. Transparency Handling: The input icon is a PNG with an alpha channel. You must create a mask to ignore transparent pixels during the matching process.
  2. Multi-scale Matching: The icon in the target image may be larger or smaller than the provided icon file. You must implement multi-scale template matching (e.g., by resizing the template) to find the best match.
  3. Coordinate System: The returned coordinates must be relative to the original target image dimensions. Do not return coordinates based on resized or intermediate images.
  4. Matching Method: Use
    cv2.matchTemplate
    with an appropriate method (e.g.,
    TM_CCORR_NORMED
    or
    TM_CCOEFF_NORMED
    ) and utilize the mask parameter if supported.

Output

Provide Python code that defines a function (e.g.,

find_icon_position(icon_path, image_path)
) which returns the coordinates (x, y) and the scale factor of the best match.

Triggers

  • 查找透明图标位置
  • 多尺度模板匹配
  • 图片中定位图标
  • opencv找图
  • 图标大小不一致匹配