Azure Monitor Baseline Alerts
Download AlertsGlossaryGitHubGitHub IssuesToggle Dark/Light/Auto modeToggle Dark/Light/Auto modeToggle Dark/Light/Auto modeBack to homepage

Artificial Intelligence

Overview

There are numerous ways to implement AI solution on Azure, and each comes with its own monitoring solution. Monitoring AI solutions involves a combination of the infra or paas resources, along with monitoring any utilization metrics that can be exposed through the platform or other tooling. This page will summarize the recommended monitoring solutions for different scenarios.

AI on Azure Platforms (PaaS)

Common AI Ready infrastructures on Azure may contain services such as Azure AI Hub, Azure AI Services (including Azure OpenAI) and AI Search. Specific workloads like Azure Kubernetes services, API Management and App Services are also frequently used to build enterprise-level AI applications. The table below provides quick links to alert guidelines for the most commonly used services. For other Azure services in your architecture, please refer to the Azure Resource, which offers comprehensive lists.

ServicesResource Type
Azure AI Studio Hub/Azure Machine LearningMicrosoft.MachineLearningServices/workspaces
Azure AI SearchMicrosoft.Search/searchServices
Azure AI ServicesMicrosoft.CognitiveServices/accounts
Azure Kubernetes servicesMicrosoft.ContainerService/managedClusters
Azure App ServicesMicrosoft.Web/sites
Azure API ManagementMicrosoft.ApiManagement/service
Azure Container AppsMicrosoft.App/containerApps
Azure Functions AppsMicrosoft.Web/sites
Azure Cosmos DBMicrosoft.DocumentDB/databaseAccounts
Azure SQL Database - managedInstancesMicrosoft.Sql/managedInstances
Azure SQL Database - serverMicrosoft.Sql/servers/databases
Azure Database for MySQL - flexibleServersMicrosoft.DBforMySQL/flexibleServers
Azure Database for MySQL - serversMicrosoft.DBforMySQL/servers
Azure Database for PostgreSQL - flexibleServersMicrosoft.DBforPostgreSQL/flexibleServers
Azure Database for PostgreSQL - serversMicrosoft.DBforPostgreSQL/servers

AI on Infrastructure (IaaS)

Running AI workloads on Azure infrastructure involves monitoring each of the components of the solution, including virtual machines, storage, and networking. Refer to the defined metrics in HPC. For monitoring the GPU/CPU metrics, use Moneo

AI Specialized Workload Patterns

GPT-RAG (coming soon)