gem

Multimodal AI processing using Google Gemini. Use for analyzing PDFs, images, videos, YouTube links, and other large documents. Ideal when you need to extract information from files that require vision or multimodal understanding.

$ 安裝

git clone https://github.com/rajshah4/my-agent-skills /tmp/my-agent-skills && cp -r /tmp/my-agent-skills/skills/gem ~/.claude/skills/my-agent-skills

// tip: Run this command in your terminal to install the skill