Claude-skill-registry k8s
Kubernetes ops skill for deploying, operating, and troubleshooting services on Kubernetes. Use for tasks like writing manifests/Helm, configuring deployments/services/ingress, autoscaling, observability, RBAC, secrets/configmaps, rollout/rollback, incident debugging, and production readiness checks.
install
source · Clone the upstream repo
git clone https://github.com/majiayu000/claude-skill-registry
Claude Code · Install into ~/.claude/skills/
T=$(mktemp -d) && git clone --depth=1 https://github.com/majiayu000/claude-skill-registry "$T" && mkdir -p ~/.claude/skills && cp -r "$T/skills/data/k8s" ~/.claude/skills/majiayu000-claude-skill-registry-k8s && rm -rf "$T"
manifest:
skills/data/k8s/SKILL.mdsource content
k8s
Use this skill for Kubernetes 运维与发布相关工作。
Defaults / assumptions to confirm
- Cluster type: managed (EKS/GKE/ACK) vs self-hosted
- Packaging: raw YAML vs Helm vs Kustomize
- Ingress: NGINX/ALB/APISIX/Istio
- Observability stack: Prometheus/Grafana, Loki/ELK, tracing
Workflow
- Understand service requirements
- Ports, protocols, health checks, resources (CPU/mem), storage needs.
- SLOs: latency, availability, RPO/RTO.
- Dependencies: DB, cache, MQ, external APIs.
- Deployment design
- Use
for stateless;Deployment
for stable identities/storage.StatefulSet - Define
andreadinessProbe
(andlivenessProbe
if needed).startupProbe - Set
and choose appropriate QoS.resources.requests/limits - Use
for availability during maintenance.PodDisruptionBudget
- Config & secrets
- Config:
(non-sensitive), mounted or env.ConfigMap - Secrets:
(sensitive) + external secret manager if available.Secret - Never commit plaintext secrets; prefer sealed/external secrets.
- Networking
types and DNS.Service
/Gateway routing, TLS termination, timeouts.Ingress- NetworkPolicy if cluster enforces it.
- Scaling & resilience
based on CPU/memory/custom metrics.HPA- Graceful shutdown (
, terminationGracePeriodSeconds).preStop - Retry/backoff at client; avoid retry storms.
- Observability
- Standard logs with correlation IDs.
- Metrics: RPS, p95 latency, error rate, saturation.
- Alerts and dashboards; runbook links.
- Release operations
- Rolling updates, canary/blue-green if needed.
+ rollback plan.kubectl rollout status- Post-deploy verification checks and smoke tests.
- Troubleshooting checklist
pods, events, andkubectl get/describe
.logs- Check probes, image pull, env/config, DNS, network, and resource throttling.
- For performance: node pressure, HPA behavior, GC/heap, connection pool limits.
Output expectations when making changes
- Provide manifests (or Helm values/templates) + brief deployment notes.
- Include resource sizing rationale and probe settings.
- Include rollback instructions and verification steps.