cloudflare-workers-ai

Run LLMs and AI models on Cloudflare's global GPU network with Workers AI. Includes Llama, Flux image generation,BGE embeddings, and streaming support with AI Gateway for caching and logging.Use when: implementing LLM inference, generating images with Flux/Stable Diffusion, building RAG with embeddings,streaming AI responses, using AI Gateway for cost tracking, or troubleshooting AI_ERROR, rate limits, model notfound, token limits, or neurons exceeded.Keywords: workers ai, cloudflare ai, ai bindings, llm workers, @cf/meta/llama, workers ai models,ai inference, cloudflare llm, ai streaming, text generation ai, ai embeddings, image generation ai,workers ai rag, ai gateway, llama workers, flux image generation, stable diffusion workers,vision models ai, ai chat completion, AI_ERROR, rate limit ai, model not found, token limit exceeded,neurons exceeded, ai quota exceeded, streaming failed, model unavailable, workers ai hono,ai gateway workers, vercel ai sdk workers, openai compatible workers, workers

$ Installer

git clone https://github.com/ovachiever/droid-tings /tmp/droid-tings && cp -r /tmp/droid-tings/skills/cloudflare-workers-ai ~/.claude/skills/droid-tings

// tip: Run this command in your terminal to install the skill