Azure-image-gen azure-image-gen

Image Generation & Editing Skill (Azure OpenAI)

install

source · Clone the upstream repo

git clone https://github.com/neojarvis0704/azure-image-gen

Claude Code · Install into ~/.claude/skills/

git clone --depth=1 https://github.com/neojarvis0704/azure-image-gen ~/.claude/skills/neojarvis0704-azure-image-gen-azure-image-gen-d3eea1

manifest: SKILL.md

source content

Image Generation & Editing Skill (Azure OpenAI)

This skill allows the agent to generate new images from text prompts or edit existing images using reference files via Azure OpenAI's GPT-Image-1.5 model.

Configuration

Script:
```
gen.py
```
Environment:
```
/home/thomas/.env.azure
```
(Required for API credentials)
Output: Images are saved to the
```
./out/
```
directory with a timestamped filename.

Usage

1. Generate New Image

Use this mode when the user wants to create an image from scratch. Syntax:

python3 gen.py "<detailed_prompt>"

2. Edit Existing Image

Use this mode when a user provides one or more reference images and requests changes (e.g., changing colors, adding objects, or altering style). Syntax:

python3 gen.py "<edit_instructions>" --edit --ref <path_to_image>

Parameters

Parameter	Type	Description
`prompt`	String	(Positional) The visual description or edit instruction.
`--edit`	Flag	Required to trigger image-to-image/editing mode.
`--ref`	Path(s)	One or more paths to the images to be used as a reference.

Examples

Create a new image:

python3 gen.py "A professional headshot of a solution consultant in a modern office, cinematic lighting"

Modify an existing image:

python3 gen.py "Change the suit color to charcoal gray and add a laptop on the desk" --edit --ref ./out/20260208_120000.png

Notes for the Agent

Fidelity: The skill is hardcoded to "high" fidelity to ensure identity and structural persistence during edits.
Image Formats: Input images must be in PNG format.
Output: The tool returns a JSON object containing the
```
path
```
to the generated image. Always display this path or the image to the user.