Azure Monitor Baseline Alerts
Download AlertsGlossaryGitHubGitHub IssuesToggle Dark/Light/Auto modeToggle Dark/Light/Auto modeToggle Dark/Light/Auto modeBack to homepage

GPT-RAG

Overview

The Retrieval-Augmented Generation (RAG) pattern is an industry-standard approach to building applications that use large language models to reason over specific or proprietary data that is not already known to the large language model.

This page provides the alert settings for AI RAG pattern setup. It contains relevant metrics and threshold recommendations for key services involved in a RAG pattern architecture.For a reference architecture design of RAG, see GPT-RAG.

Below is a basic architecture of RAG implementation

RAG Basic Architecture

We may update these settings as we continue to work with a breadth of customers.

Alerts

Alert NameComponentMetricAggregationOperatorThresholdWindowFrequencySeverityScopeSupport for Multiple ResourcesVerifiedReferences
Microsoft.CognitiveServices/accountsTotalCallsTotalGreaterThan5000PT5MPT1M3NoN
Microsoft.CognitiveServices/accountsTotalErrorsTotalGreaterThan5PT5MPT1M2NoN
Microsoft.CognitiveServices/accountsLatencyAverageGreaterThan90PT5MPT1M2NoN
Microsoft.CognitiveServices/accountsSuccessRateAverageLessThan99.9PT5MPT1M3NoN
Microsoft.CognitiveServices/accountsServerErrorsTotalGreaterThan0PT5MPT1M3NoN
Microsoft.CognitiveServices/accountsTokenTransactionTotalGreaterThan180000PT5MPT1M2NoN
Microsoft.CognitiveServices/accountsBlockedCallsTotalGreaterThan0PT5MPT1M2NoN
Microsoft.CognitiveServices/accountsClientErrorsTotalGreaterThan0PT5MPT1M1NoN
Microsoft.CognitiveServices/accountsAzureOpenAIContextTokensCacheMatchRateTotalGreaterThan75PT5MPT1M2NoY
Microsoft.CognitiveServices/accountsAzureOpenAIProvisionedManagedUtilizationV2TotalGreaterThan80PT5MPT1M2NoY
Microsoft.CognitiveServices/accountsAzureOpenAITimeToResponseTotalGreaterThan200PT5MPT1M2NoY
Microsoft.DocumentDB/databaseAccountsTotalRequestsCountGreaterThan5PT5MPT1M3NoNMonitor Azure Cosmos DB Create alerts for Azure Cosmos DB using Azure Monitor Monitoring Azure Cosmos DB data reference Explore Azure Monitor Azure Cosmos DB insights
Microsoft.DocumentDB/databaseAccountsNormalizedRUConsumptionAverageGreaterThan70PT5MPT1M3NoNHow to monitor normalized RU/s for an Azure Cosmos DB container or an account Monitor Azure Cosmos DB Create alerts for Azure Cosmos DB using Azure Monitor
Microsoft.DocumentDB/databaseAccountsServiceAvailabilityAverageLessThan99.9PT1HPT5M1NoNMonitor Azure Cosmos DB Create alerts for Azure Cosmos DB using Azure Monitor Monitoring Azure Cosmos DB data reference
Microsoft.DocumentDB/databaseAccountsTotalRequestUnitsTotalGreaterThan100PT5MPT1M2NoNMonitor Azure Cosmos DB Create alerts for Azure Cosmos DB using Azure Monitor Monitoring Azure Cosmos DB data reference How to monitor throughput or request unit usage of an operation in Azure Cosmos DB
Microsoft.DocumentDB/databaseAccountsServerSideLatencyAverageGreaterThan100PT5MPT1M3NoNMonitor Azure Cosmos DB Create alerts for Azure Cosmos DB using Azure Monitor Monitoring Azure Cosmos DB data reference
Microsoft.DocumentDB/databaseAccountsProvisionedThroughputMaximumGreaterThan3000PT1HPT1M3NoNMonitor Azure Cosmos DB Create alerts for Azure Cosmos DB using Azure Monitor Monitoring Azure Cosmos DB data reference
Microsoft.DocumentDB/databaseAccountsRegionFailoverCountGreaterThan0PT5MPT1M3NoNMonitor Azure Cosmos DB Create alerts for Azure Cosmos DB using Azure Monitor
Microsoft.DocumentDB/databaseAccountsUpdateAccountKeysCountGreaterThanOrEqual1PT5MPT5M2NoNMonitor Azure Cosmos DB Create alerts for Azure Cosmos DB using Azure Monitor Monitor your Azure Cosmos DB account for key updates and key regeneration
Microsoft.DocumentDB/databaseAccountsDataUsageTotalGreaterThan2.147483648e+09PT5MPT1M3NoN
Microsoft.DocumentDB/databaseAccountsMongoRequestsCountGreaterThan9PT5MPT1M3NoNMonitor Azure Cosmos DB Create alerts for Azure Cosmos DB using Azure Monitor Monitoring Azure Cosmos DB data reference
Microsoft.DocumentDB/databaseAccountsRemoveRegionCountGreaterThanOrEqual0PT15MPT5M3NoNMonitor Azure Cosmos DB Create alerts for Azure Cosmos DB using Azure Monitor
Microsoft.DocumentDB/databaseAccountsReplicationLatencyAverageGreaterThan5000PT15MPT5M3NoN
Microsoft.DocumentDB/databaseAccountsSqlContainerDeleteCountGreaterThanOrEqual0PT15MPT5M2NoNMonitor Azure Cosmos DB Create alerts for Azure Cosmos DB using Azure Monitor
Microsoft.DocumentDB/databaseAccountsOfflineRegionCountGreaterThan0PT5MPT1M3NoN
Microsoft.DocumentDB/databaseAccountsSqlDatabaseDeleteCountGreaterThanOrEqual0PT15MPT5M2NoNMonitor Azure Cosmos DB Create alerts for Azure Cosmos DB using Azure Monitor
Microsoft.KeyVault/vaultsAvailabilityAverageLessThan90PT5MPT1M1NoYMonitoring KeyVault Reference Monitoring Microsoft.KeyVault/vaults KeyVault Insights Overview
Microsoft.KeyVault/vaultsSaturationShoeboxAverageGreaterThan75PT5MPT1M1NoYMonitoring KeyVault Reference Monitoring Microsoft.KeyVault/vaults KeyVault Insights Overview
Microsoft.KeyVault/vaultsServiceApiLatencyAverageGreaterThan1000PT5MPT5M3NoYMonitoring KeyVault Reference Monitoring Microsoft.KeyVault/vaults KeyVault Insights Overview
Microsoft.KeyVault/vaultsServiceApiResultAverageGreaterThandynamicPT5MPT5M2NoYMonitoring KeyVault Reference Monitoring Microsoft.KeyVault/vaults KeyVault Insights Overview
Microsoft.KeyVault/vaultsServiceApiHitAverageGreaterThanOrEqual80PT5MPT5M3NoN
Microsoft.Search/searchServicesSearchLatencyAverageGreaterThan5PT5MPT1M3NoN
Microsoft.Search/searchServicesThrottledSearchQueriesPercentageAverageGreaterThan10PT5MPT1M3NoN
Microsoft.Storage/storageAccountsAvailabilityAverageLessThan100PT5MPT5M1NoYMonitoring Availability Supported metrics for Microsoft.Storage/storageAccounts
Microsoft.Storage/storageAccounts/fileServicesTransactionsTotalGreaterThanOrEqual1PT15MPT5M2NoNHigh latency, low throughput, or low IOPS
Microsoft.Storage/storageAccountsUsedCapacityAverageGreaterThan2.2518e+15PT1HPT1H3NoNAccount Level Metrics Azure Storage Metric - Used Capacity
Microsoft.Storage/storageAccountsEgressTotalGreaterThan6e+07PT5MPT5M2NoNTransaction Metrics Storage Account Metric Dimensions (all storage)
Microsoft.Storage/storageAccountsIngressTotalGreaterThan1.073741824e+09PT5MPT5M3NoNTransaction Metrics Storage Account Metric Dimensions (all storage)
Microsoft.Storage/storageAccounts/blobServicesSuccessE2ELatencyAverageGreaterThan1000PT5MPT1M3NoNVerify throughput and latency metrics for a storage account Troubleshoot performance in Azure storage accounts
Microsoft.Storage/storageAccounts/blobServicesSuccessServerLatencyAverageGreaterThan1000PT5MPT1M2NoNTrouble shoot performance in Azure storage accounts Verify throughput and latency metrics for a storage account Storage Transaction Metrics
Microsoft.Storage/storageAccounts/fileServicesTransactionsTotalGreaterThan10PT5MPT1M3NoNIdentify storage accounts with no or low use Monitor the use of a container Storage Transaction Metrics
Microsoft.Web/sitesAverageResponseTimeAverageGreaterThan60PT5MPT5M3NoN
Microsoft.Web/sitesCpuTimeTotalGreaterThan120PT5MPT1M3NoNUnderstand App Service Metrics Supported Metrics Monitor your app CPU time vs CPU percentage Alerts and Autoscale in Azure App Service
Microsoft.Web/sitesAppConnectionsMaximumGreaterThan6000PT15MPT5M3NoNUnderstand App Service Metrics Supported Metrics Manage Connections in Azure Functions Configure Monitoring for Azure Functions
Microsoft.Web/sitesRequestsInApplicationQueueMaximumGreaterThan10PT15MPT5M3NoNUnderstand App Service Metrics Supported Metrics
Microsoft.Web/sitesPrivateBytesAverageGreaterThan1.2e+09PT5MPT1M3NoNUnderstand App Service Metrics Supported Metrics
Microsoft.Web/sitesFileSystemUsageAverageGreaterThan4e+08PT6HPT1H1NoNUnderstand App Service Metrics Supported Metrics Quota Enforcement
Microsoft.Web/sitesMemoryWorkingSetAverageGreaterThan1.5e+09PT5MPT1M3NoNUnderstand App Service Metrics Supported Metrics Monitor your app
Microsoft.Web/sitesThreadsAverageGreaterThan200PT15MPT5M4NoNUnderstand App Service Metrics Supported Metrics
Microsoft.Web/sitesHttp401TotalGreaterThan20PT5MPT5M2NoNUnderstand App Service Metrics Supported Metrics Client-side JavaScript SDK Exception Reporting
Microsoft.Web/sitesRequestsTotalGreaterThan1000PT5MPT1M3NoN
Microsoft.Web/sitesFunctionExecutionCountTotalLessThanOrEqual0PT5MPT5M1NoNFunction Execution Count Monitor Azure Functions Supported Metrics
Microsoft.Web/sitesBytesSentAverageGreaterOrLessThandynamicPT5MPT1M3NoNUnderstand App Service Metrics Supported Metrics
Microsoft.Web/sitesHttp406TotalGreaterThan1PT15MPT15M1NoNUnderstand App Service Metrics Supported Metrics
Microsoft.Web/sitesHttp3xxTotalGreaterThan15PT5MPT5M3NoNUnderstand App Service Metrics Supported Metrics Enable diagnostic logging for Apps in Azure App Service HTTP Status Classes
Microsoft.Web/sitesWorkflowRunsFailureRateTotalGreaterThan0PT5MPT5M1NoN
Microsoft.Web/sitesBytesReceivedTotalGreaterThan2.048e+09PT5MPT1M3NoNUnderstand App Service Metrics Supported Metrics
Microsoft.Web/sitesHandlesAverageGreaterOrLessThandynamicPT5MPT1M2NoNUnderstand App Service Metrics Supported Metrics
Microsoft.Web/sitesFunctionExecutionUnitsTotalGreaterThan1.3e+10PT5MPT1M3NoNFunction Execution Units
Microsoft.Web/sitesWorkflowTriggersFailureRateTotalGreaterThan50PT5MPT5M1NoN
Microsoft.Web/sitesHttp2xxTotalGreaterThan15PT5MPT5M3NoN
Microsoft.Web/sitesCurrentAssembliesAverageGreaterThan0PT1MPT1M0NoN
Microsoft.Web/sites/slotsHttp5xxTotalGreaterThan10PT15MPT5M1NoNUnderstand App Service Metrics Supported Metrics Diagnose Web Apps' Performance with Application Insights Troubleshoot HTTP 502/503 Errors
Microsoft.Web/sites/slotsHttpResponseTimeAverageGreaterThan5PT30MPT15M1NoNUnderstand App Service Metrics Supported Metrics Troubleshoot Slow App Performance
Microsoft.Web/sites/slotsHttp4xxAverageGreaterThan5PT30MPT15M1NoNUnderstand App Service Metrics Supported Metrics
Microsoft.Web/sites/slotsAverageMemoryWorkingSetAverageGreaterThan8e+08PT5MPT5M3NoN
Microsoft.Web/sites/slotsHealthCheckStatusAverageLessThan100PT5MPT1M3NoNUnderstand App Service Metrics Supported Metrics Monitor App Service Instances using Health check
Microsoft.Web/sites/slotsHttp403TotalGreaterThan5PT30MPT15M0NoN
Microsoft.Web/sites/slotsHttp404AverageGreaterThan30PT15MPT5M2NoNUnderstand App Service Metrics Supported Metrics