analyzing-media

Analyzes PDFs, images, screenshots, diagrams, and documents using Gemini multimodal. Extracts text, tables, forms; interprets visuals, architecture diagrams, flowcharts, ERDs. Use when user mentions PDFs, images, screenshots, document extraction, OCR, visual analysis, diagram interpretation, or form processing. Do not use for web searching or shell commands.

$ Instalar

git clone https://github.com/alexei-led/claude-code-config /tmp/claude-code-config && cp -r /tmp/claude-code-config/skills/analyzing-media ~/.claude/skills/claude-code-config

// tip: Run this command in your terminal to install the skill