Marketplace

cloudflare-workers-ai

Run LLMs and AI models on Cloudflare's global GPU network with Workers AI. Includes Llama 4, Gemma 3, Mistral 3.1,Flux image generation, BGE embeddings (2x faster, 2025), streaming support, and AI Gateway for cost tracking.Use when: implementing LLM inference, generating images, building RAG with embeddings, streaming AI responses,using AI Gateway, troubleshooting max_tokens defaults (breaking change 2025), BGE pooling parameter (not backwardscompatible), or handling AI_ERROR, rate limits, model deprecations, token limits.Keywords: workers ai, cloudflare ai, ai bindings, llm workers, @cf/meta/llama-4-scout, @cf/google/gemma-3-12b-it,@cf/mistralai/mistral-small-3.1-24b-instruct, @cf/openai/gpt-oss-120b, workers ai models, ai inference,cloudflare llm, ai streaming, text generation ai, ai embeddings, bge pooling cls mean, image generation ai,workers ai rag, ai gateway, llama workers, flux image generation, deepgram aura, leonardo image generation,vision models ai, ai chat completion, AI_ERROR, rate li

$ インストール

git clone https://github.com/jezweb/claude-skills /tmp/claude-skills && cp -r /tmp/claude-skills/skills/cloudflare-workers-ai ~/.claude/skills/claude-skills

// tip: Run this command in your terminal to install the skill