azure-image-gen

0 stars

0 forks

Python

38 views

SKILL.md

Image Generation & Editing Skill (Azure OpenAI)

This skill allows the agent to generate new images from text prompts or edit existing images using reference files via Azure OpenAI's GPT-Image-1.5 model.

Configuration

Script: gen.py
Environment: /home/thomas/.env.azure (Required for API credentials)
Output: Images are saved to the ./out/ directory with a timestamped filename.

Usage

1. Generate New Image

Use this mode when the user wants to create an image from scratch. Syntax: python3 gen.py "<detailed_prompt>"

2. Edit Existing Image

Use this mode when a user provides one or more reference images and requests changes (e.g., changing colors, adding objects, or altering style). Syntax: python3 gen.py "<edit_instructions>" --edit --ref <path_to_image>

Parameters

Parameter	Type	Description
`prompt`	String	(Positional) The visual description or edit instruction.
`--edit`	Flag	Required to trigger image-to-image/editing mode.
`--ref`	Path(s)	One or more paths to the images to be used as a reference.

Examples

Create a new image: python3 gen.py "A professional headshot of a solution consultant in a modern office, cinematic lighting"
Modify an existing image: python3 gen.py "Change the suit color to charcoal gray and add a laptop on the desk" --edit --ref ./out/20260208_120000.png

Notes for the Agent

Fidelity: The skill is hardcoded to "high" fidelity to ensure identity and structural persistence during edits.
Image Formats: Input images must be in PNG format.
Output: The tool returns a JSON object containing the path to the generated image. Always display this path or the image to the user.

Installation

Option 1: Use slash command in Claude Code

/install-skill https://github.com/neojarvis0704/azure-image-gen

Option 2: Clone to skills directory

# Global (all projects)

git clone https://github.com/neojarvis0704/azure-image-gen ~/.claude/skills/azure-image-gen

# Project-specific

git clone https://github.com/neojarvis0704/azure-image-gen .claude/skills/azure-image-gen

Add MCP server to .cursor/mcp.json:

{
  "mcpServers": {
    "skillz": {
      "command": "npx",
      "args": ["-y", "skillz-mcp", "https://github.com/neojarvis0704/azure-image-gen"]
    }
  }
}

Restart Cursor after adding the configuration.

Option 1: Use Gemini CLI command

gemini extensions install https://github.com/neojarvis0704/azure-image-gen

Option 2: Clone to extensions directory

git clone https://github.com/neojarvis0704/azure-image-gen ~/.gemini/extensions/azure-image-gen

Topics

Related Skills

xlsx

Public repository for Agent Skills

skill-writer

Tensors and Dynamic neural networks in Python with strong GPU acceleration

youtube-downloader

A curated list of awesome Claude Skills, resources, and tools for customizing Claude AI workflows

agno

Build, run, manage agentic software at scale.