Agent-Skills azure-reliability

Expert knowledge for Azure Reliability development including best practices, decision making, architecture & design patterns, limits & quotas, and deployment. Use when designing zone/multi-region apps, AZ-enabled MySQL, resilient Functions, AKS/DB HA, or Queue size limits, and other Azure Reliability related development tasks. Not for Azure Resiliency (use azure-resiliency), Azure Monitor (use azure-monitor), Azure Service Health (use azure-service-health), Chaos Studio (use azure-chaos-studio).

install
source · Clone the upstream repo
git clone https://github.com/MicrosoftDocs/Agent-Skills
Claude Code · Install into ~/.claude/skills/
T=$(mktemp -d) && git clone --depth=1 https://github.com/MicrosoftDocs/Agent-Skills "$T" && mkdir -p ~/.claude/skills && cp -r "$T/skills/azure-reliability" ~/.claude/skills/microsoftdocs-agent-skills-azure-reliability && rm -rf "$T"
manifest: skills/azure-reliability/SKILL.md
source content

Azure Reliability Skill

This skill provides expert guidance for Azure Reliability. Covers best practices, decision making, architecture & design patterns, limits & quotas, and deployment. It combines local quick-reference content with remote documentation fetching capabilities.

How to Use This Skill

IMPORTANT for Agent: Use the Category Index below to locate relevant sections. For categories with line ranges (e.g.,

L35-L120
), use
read_file
with the specified lines. For categories with file links (e.g.,
[security.md](security.md)
), use
read_file
on the linked reference file

IMPORTANT for Agent: If

metadata.generated_at
is more than 3 months old, suggest the user pull the latest version from the repository. If
mcp_microsoftdocs
tools are not available, suggest the user install it: Installation Guide

This skill requires network access to fetch documentation content:

  • Preferred: Use
    mcp_microsoftdocs:microsoft_docs_fetch
    with query string
    from=learn-agent-skill
    . Returns Markdown.
  • Fallback: Use
    fetch_webpage
    with query string
    from=learn-agent-skill&accept=text/markdown
    . Returns Markdown.

Category Index

CategoryLinesDescription
Best PracticesL33-L68Patterns and checklists for designing, configuring, and hardening high‑availability, resilient architectures for specific Azure services (AKS, DBs, messaging, networking, monitoring, DR).
Decision MakingL69-L73Guidance on using availability zones, nonregional services, and resilient Azure Functions architectures to design highly available, fault-tolerant Azure solutions.
Architecture & Design PatternsL74-L80Designing Azure apps for high availability using zones and multi-region patterns, including planning zone-resilient workloads, hardening zonal deployments, and building in nonpaired regions.
Limits & QuotasL81-L85Details on Azure Queue Storage message size limits, including max message size, behavior when limits are exceeded, and best practices for handling large payloads.
DeploymentL86-L89Guidance on deploying Azure services and MySQL Flexible Server with availability zones, including configuring zone-redundant high availability and migration to zone-resilient setups.

Best Practices

TopicURL
Design resilient clusters in Azure Kubernetes Servicehttps://learn.microsoft.com/en-us/azure/reliability/reliability-aks
Configure reliability for Azure API Centerhttps://learn.microsoft.com/en-us/azure/reliability/reliability-api-center
Harden Azure App Service Environment reliabilityhttps://learn.microsoft.com/en-us/azure/reliability/reliability-app-service-environment
Architect highly available Azure Application Gateway v2https://learn.microsoft.com/en-us/azure/reliability/reliability-application-gateway-v2
Plan reliability for Azure Bot Servicehttps://learn.microsoft.com/en-us/azure/reliability/reliability-bot
Configure reliability for Azure Chaos Studiohttps://learn.microsoft.com/en-us/azure/reliability/reliability-chaos-studio
Achieve high availability in Azure Cosmos DB NoSQLhttps://learn.microsoft.com/en-us/azure/reliability/reliability-cosmos-db-nosql
Design resilient Azure Data Explorer deploymentshttps://learn.microsoft.com/en-us/azure/reliability/reliability-data-explorer
Harden Azure Data Factory for outageshttps://learn.microsoft.com/en-us/azure/reliability/reliability-data-factory
Harden Azure Database for MySQL for high availabilityhttps://learn.microsoft.com/en-us/azure/reliability/reliability-database-mysql
Design resilient Azure Database for MySQL deploymentshttps://learn.microsoft.com/en-us/azure/reliability/reliability-database-mysql
Implement high availability for Azure Database for PostgreSQLhttps://learn.microsoft.com/en-us/azure/reliability/reliability-database-postgresql
Implement resilient architectures in Azure Databrickshttps://learn.microsoft.com/en-us/azure/reliability/reliability-databricks
Ensure reliability for Azure Device Registry metadatahttps://learn.microsoft.com/en-us/azure/reliability/reliability-device-registry
Design high availability for Azure DocumentDBhttps://learn.microsoft.com/en-us/azure/reliability/reliability-documentdb
Build resilient architectures with Azure Event Gridhttps://learn.microsoft.com/en-us/azure/reliability/reliability-event-grid
Increase reliability of Azure Event Hubs streaminghttps://learn.microsoft.com/en-us/azure/reliability/reliability-event-hubs
Design reliable analytics with Microsoft Fabrichttps://learn.microsoft.com/en-us/azure/reliability/reliability-fabric
Implement resilient architectures with Azure Functionshttps://learn.microsoft.com/en-us/azure/reliability/reliability-functions
Implement resilient architectures with Azure Functionshttps://learn.microsoft.com/en-us/azure/reliability/reliability-functions
Implement disaster recovery for Azure Image Builderhttps://learn.microsoft.com/en-us/azure/reliability/reliability-image-builder
Design resilient device connectivity with Azure IoT Hubhttps://learn.microsoft.com/en-us/azure/reliability/reliability-iot-hub
Design resilient architectures with Azure Load Balancerhttps://learn.microsoft.com/en-us/azure/reliability/reliability-load-balancer
Design resilient architectures with Azure Load Balancerhttps://learn.microsoft.com/en-us/azure/reliability/reliability-load-balancer
Design resilient workflows with Azure Logic Appshttps://learn.microsoft.com/en-us/azure/reliability/reliability-logic-apps
Increase reliability of Azure Managed Redis cacheshttps://learn.microsoft.com/en-us/azure/reliability/reliability-managed-redis
Implement resilient logging with Azure Monitor Logshttps://learn.microsoft.com/en-us/azure/reliability/reliability-monitor-logs
Improve reliability of Azure Notification Hubshttps://learn.microsoft.com/en-us/azure/reliability/reliability-notification-hubs
Design resilient disaster recovery with Azure Site Recoveryhttps://learn.microsoft.com/en-us/azure/reliability/reliability-site-recovery
Implement resilient architectures in Azure SQL Databasehttps://learn.microsoft.com/en-us/azure/reliability/reliability-sql-database
Increase reliability of Azure Stream Analytics jobshttps://learn.microsoft.com/en-us/azure/reliability/reliability-stream-analytics
Design resilient workloads on Azure VMware Solutionhttps://learn.microsoft.com/en-us/azure/reliability/reliability-vmware-solution

Decision Making

TopicURL
Select and understand Azure nonregional serviceshttps://learn.microsoft.com/en-us/azure/reliability/regions-nonregional-services

Architecture & Design Patterns

TopicURL
Enable and plan zone-resilient Azure workloadshttps://learn.microsoft.com/en-us/azure/reliability/availability-zones-enable-zone-resiliency
Design and harden zonal Azure resource deploymentshttps://learn.microsoft.com/en-us/azure/reliability/availability-zones-zonal-resource-resiliency
Design multi-region solutions in nonpaired Azure regionshttps://learn.microsoft.com/en-us/azure/reliability/regions-multi-region-nonpaired

Limits & Quotas

TopicURL
Understand Azure Queue Storage message size limitshttps://learn.microsoft.com/en-us/azure/reliability/reliability-storage-queue

Deployment

TopicURL
Use Azure services with availability zone supporthttps://learn.microsoft.com/en-us/azure/reliability/availability-zones-service-support