數據工程
525 skills in 數據與 AI > 數據工程
n8n-automation
n8n workflow automation for building analytics including SkySpark multi-agent systems, FastAPI tool servers, workflow orchestration, and automated building system alert triage
data-engineering
ETL pipelines, Apache Spark, data warehousing, and big data processing. Use for building data pipelines, processing large datasets, or data infrastructure.
aoc-orchestrator
Main coordinator for the automated Advent of Code workflow. Orchestrates puzzle fetching, TDD solving, and submission for daily AoC challenges. Use when running the full automated solving pipeline or when user requests to solve an AoC day.
devops
DevOps essentials: Docker, CI/CD pipelines, deployment strategies.Use when: containerizing apps, setting up CI/CD, deploying to production.Triggers: "docker", "dockerfile", "ci/cd", "github actions", "deploy","kubernetes", "compose", "container", "pipeline".
optaic-v0-migration
Guide for porting code from optaic-v0 to optaic-trading. Use when migrating DataAPI, pipelines, stores, accessors, operators, or expressions into the Resource/Activity architecture. Covers pattern mappings for permission checks, audit trails, and catalog lookups.
pyspark-test-generator
Generate comprehensive PySpark-based data quality validation tests for Databricks tables. Use when creating automated tests for data completeness, accuracy, consistency, and conformity, or when user mentions test generation, data validation, quality monitoring, or PySpark test frameworks.
cdp-lite-adapter
Adapt the User Explorer (CDP-lite) prototype to new data sources by mapping input exports to the build pipeline, updating adapters/enrichments, and preserving output JSON schemas for the UI. Use when asked to connect real data, replace inputs, or modify the build outputs/UI contracts.
nextjs-blog-netlify
Next.js blog with visual editing and Tailwind CSS for Netlify.
plan-with-research
Build step-by-step plans grounded in evidence and research. Use when the user asks for a plan, roadmap, steps, or strategy for code, data, or pipeline work.
github-workflow-automation
Advanced GitHub Actions workflow automation with AI swarm coordination, intelligent CI/CD pipelines, and comprehensive repository management
tanstack-chat-netlify
Modern chat app with TanStack Router and Claude AI for Netlify.
publishing-astro-websites
Comprehensive guidance for building and deploying static websites with the Astro framework.This skill should be used when asked to "create astro site", "deploy astro to firebase","set up content collections", "add mermaid diagrams to astro", "configure astro i18n","build static blog", or "astro markdown setup". Covers SSG fundamentals, Content Collections,Markdown/MDX, partial hydration, islands architecture, and deployment to Netlify, Vercel,GitHub Pages, or GCP/Firebase.
devops-cloud
Master DevOps, cloud infrastructure, containerization, and Kubernetes. Learn Docker, Terraform, AWS, CI/CD pipelines, monitoring, and production infrastructure management.
blog-master-orchestrator
Central coordinator for blog writing workflow with multi-agent execution. USE WHEN user says 'write a blog post', 'create blog content', 'start blog workflow', OR user wants to orchestrate the full blog writing pipeline.
test-ci-pipeline
CI/CD pipeline configuration for monorepo testing including GitHub Actions workflows, Turborepo integration, two-tier quality gates (typecheck→lint→test→build), preview branch integration, retry logic, and environment management. Use when configuring CI pipelines, troubleshooting CI test failures, implementing preview deployments, or optimizing CI performance. Triggers on: CI configuration, github actions testing, CI pipeline, turborepo CI, quality gates, preview integration, CI troubleshooting.
testing-patterns
pytest fixtures and integration testing patterns for Spark applications, including DataFrame assertions and mock data generation.
chunking-strategies
Document chunking strategies for RAG systems. Use when implementing document processing pipelines to determine optimal chunking approaches based on document type and retrieval requirements.
archaeology-discussion
고고학 발굴조사보고서 고찰 자동 작성 파이프라인. "고찰 작성해줘" 한 번의 명령으로 완료된 보고서+논문+주변유적 보고서를 분석하여 문화재청 표준양식 고찰을 생성. 폴더1(주보고서), 폴더2(논문), 폴더3(비교유적)을 자동 분석. Use for automated archaeological excavation report discussion writing pipeline: analyzes main report + papers + comparison sites to generate discussion following Korean Cultural Heritage Administration standards. One command processes all folders.
workflow-orchestration
Standard agent pipelines for audit, coding, new project, refactor, and simple workflows. Defines 5 workflow types with specific agent sequences (AUDIT: BA→PM→Workers→Reviewer→PM, CODING: Architect→PM→Workers→Validator→Reviewer→PM, NEW_PROJECT and REFACTOR follow coding pipeline, SIMPLE: direct processing). Includes agent contracts, workflow detection logic, and orchestration best practices. Use when /ms command needs to determine workflow type and coordinate multi-agent execution.
image-processor-guidelines
Development guidelines for Quantum Skincare's Python FastAPI image processor microservice. Covers FastAPI patterns, Perfect Corp API integration, MediaPipe FaceMesh validation, correlation headers, access control (CIDR + X-Internal-Secret), error handling, Pydantic models, structured logging, mock mode, provider normalization, and testing strategies. Use when working with image-processor code, routes, validation pipeline, Perfect Corp integration, or Python/FastAPI patterns.