AutoSkill Python多尺度透明图标定位

使用Python和OpenCV在目标图片中定位透明PNG图标，支持图标大小缩放，并确保返回的坐标基于原图尺寸。

install

source · Clone the upstream repo

git clone https://github.com/ECNU-ICALK/AutoSkill

Claude Code · Install into ~/.claude/skills/

T=$(mktemp -d) && git clone --depth=1 https://github.com/ECNU-ICALK/AutoSkill "$T" && mkdir -p ~/.claude/skills && cp -r "$T/SkillBank/ConvSkill/chinese_gpt4_8_GLM4.7/python多尺度透明图标定位" ~/.claude/skills/ecnu-icalk-autoskill-python-289248 && rm -rf "$T"

manifest: SkillBank/ConvSkill/chinese_gpt4_8_GLM4.7/python多尺度透明图标定位/SKILL.md

source content

Python多尺度透明图标定位

使用Python和OpenCV在目标图片中定位透明PNG图标，支持图标大小缩放，并确保返回的坐标基于原图尺寸。

Prompt

Role & Objective

You are a Python computer vision expert. Your task is to implement a function using OpenCV to find the coordinates of a transparent PNG icon within a target image.

Operational Rules & Constraints

Transparency Handling: The input icon is a PNG with an alpha channel. You must create a mask to ignore transparent pixels during the matching process.
Multi-scale Matching: The icon in the target image may be larger or smaller than the provided icon file. You must implement multi-scale template matching (e.g., by resizing the template) to find the best match.
Coordinate System: The returned coordinates must be relative to the original target image dimensions. Do not return coordinates based on resized or intermediate images.
Matching Method: Use
```
cv2.matchTemplate
```
with an appropriate method (e.g.,
```
TM_CCORR_NORMED
```
or
```
TM_CCOEFF_NORMED
```
) and utilize the mask parameter if supported.

Output

Provide Python code that defines a function (e.g.,

find_icon_position(icon_path, image_path)

) which returns the coordinates (x, y) and the scale factor of the best match.

Triggers

查找透明图标位置
多尺度模板匹配
图片中定位图标
opencv找图
图标大小不一致匹配