Data Engineering
525 skills in Data & AI > Data Engineering
model-requirements
Use this skill when gathering requirements for new BigQuery data models OR when asked to edit existing queries in bqetl. For new models, guides structured requirements interviews. For existing queries, understands current model, checks downstream dependencies, and gathers requirements for changes. Works as pre-planning before query-writer skill.
bigquery-etl-core
The core skill for working within the bigquery-etl repository. Use this skill when understanding project structure, conventions, and common patterns. Works with model-requirements, query-writer, metadata-manager, sql-test-generator, and bigconfig-generator skills.
pipeline
Run dlt pipelines and Temporal workflows for SignalRoom. Use when syncing data sources, triggering workflows, checking pipeline status, or debugging data ingestion issues.
airflow-workflows
Apache Airflow DAG design, operators, and scheduling best practices.
pipeline-errors
Fetch and analyze GitHub Actions pipeline errors. Use when the user wants to debug CI/CD failures, check workflow runs, or troubleshoot test failures.
test-ci-pipeline
CI/CD pipeline configuration for monorepo testing including GitHub Actions workflows, Turborepo integration, two-tier quality gates (typecheckâlintâtestâbuild), preview branch integration, retry logic, and environment management. Use when configuring CI pipelines, troubleshooting CI test failures, implementing preview deployments, or optimizing CI performance. Triggers on: CI configuration, github actions testing, CI pipeline, turborepo CI, quality gates, preview integration, CI troubleshooting.
backend-development
Build robust backend systems with modern technologies (Node.js, Python, Go, Rust), frameworks (NestJS, FastAPI, Django), databases (PostgreSQL, MongoDB, Redis), APIs (REST, GraphQL, gRPC), authentication (OAuth 2.1, JWT), testing strategies, security best practices (OWASP Top 10), performance optimization, scalability patterns (microservices, caching, sharding), DevOps practices (Docker, Kubernetes, CI/CD), and monitoring. Use when designing APIs, implementing authentication, optimizing database queries, setting up CI/CD pipelines, handling security vulnerabilities, building microservices, or developing production-ready backend systems.
ontology-phase-2-analyze
Phase 2 of Ontology Builder Pipeline. AI acts as domain SME to analyze raw inputs,extract entities/workflows/rules, fill knowledge gaps using market expertise.Use after Phase 1 ingestion is complete.
lead-hunter
Self-improving AI system for distressed property lead generation. Monitors performance, spawns specialized skills to fix bottlenecks, runs A/B tests, and continuously optimizes lead conversion. Use when building or optimizing lead generation workflows, analyzing pipeline metrics, or creating automated lead intelligence systems.
devsecops-lookup
Looks up OWASP DevSecOps Guideline phases, security tools, and pipeline checks. Returns tool configurations, CWE mappings, and integration patterns for CI/CD security. Use when user asks about "DevSecOps", "SAST", "DAST", "SCA", "container security", "IaC security", "secret detection", "gitleaks", "semgrep", "trivy", "pipeline security", "ă·ăŒăŻăŹăăæ€ćș", "éçè§Łæ", "ćçè§Łæ", "ăłăłăăă»ăă„ăȘăăŁ", "ă»ăă„ăȘăăŁăČăŒă".
generation
Generates answers by synthesizing information from retrieved documents with citations. Use as final RAG pipeline step, when creating answers from multiple sources, or when the user mentions answer generation or synthesis.
databricks-notebook-manager
Create and manage Databricks notebooks programmatically. Use when generating ingestion code, creating ETL notebooks, executing Databricks workflows, or when user mentions notebook creation, job automation, or data pipeline implementation in Databricks. Handles notebook generation, execution, and results retrieval.
effect-foundations
Core Effect foundations and style for a coding agent. Use when starting an Effect task, choosing operators, or structuring a small pipeline.
solidjs-solidstart-expert
Expert-level SolidJS and SolidStart development skill with 20+ years senior/lead engineer mindset. Comprehensive guidance for building production-ready, scalable web applications with fine-grained reactivity. Use when Claude needs to: (1) Create new SolidJS/SolidStart projects, (2) Implement TanStack Query/Router/Table/Form integration, (3) Build reactive components with signals/stores/resources, (4) Handle SSR/SSG/streaming with SolidStart, (5) Implement authentication and API routes, (6) Optimize bundle size and performance, (7) Debug reactivity issues and memory leaks, (8) Structure large-scale applications, (9) Implement type-safe patterns with TypeScript, (10) Handle error boundaries and suspense, (11) Build accessible UI components, (12) Deploy to Vercel/Netlify/Cloudflare. Triggers: "solid", "solidjs", "solidstart", "createSignal", "createStore", "createResource", "tanstack solid", "vinxi", "fine-grained reactivity".
bio-seq
Read/write FASTA, GenBank, FASTQ files. Sequence manipulation (complement, translate). Indexed random access via faidx. For NGS pipelines (SAM/BAM/VCF), use pysam. For BLAST, use gget or blat-integration.
eu-grant-hunter
Scans EU funding databases (Horizon Europe, ERDF, Digital Europe,Innovation Fund) to identify grant opportunities matching UBOScapabilities. Tracks deadlines with multi-level reminders (90/60/30/7days), scores opportunities by fit (0-5), and generates opportunitybriefs. Use when discussing EU grants, funding opportunities, or whentracking the âŹ70M+ pipeline. Automatically runs daily at 09:00 UTC.
clojure-core-async-flow
core.async.flow provides a declarative framework for building process flows with strict separation of application logic from topology, execution, and lifecycle management. Use when building multi-process systems, data pipelines, or when you need coordinated lifecycle management and error handling across communicating processes.
data-pipeline
GenStage, Broadway, and Flow for Elixir data pipelines
moai-domain-devops
CI/CD pipelines, Docker 27, Kubernetes 1.32, and infrastructure as code.
github-actions-deploy
Setup GitHub Actions deployment workflows for various project types. Use when creating CI/CD pipelines, automated deployments to servers (SSH/SFTP), Vercel, Netlify, or container registries. Includes templates for WordPress, Laravel, Node.js, and static sites.