9.1 Azure Monitor Metrics, Logs, and Diagnostic Settings

Key Takeaways

Azure Monitor metrics are numeric time-series data suited to near-real-time charting, alerting, and autoscale decisions.
Azure Monitor logs store records in Log Analytics workspaces and are queried with KQL for investigation and reporting.
Diagnostic settings route resource logs and metrics to Log Analytics, Storage accounts, Event Hubs, and partner destinations.
Activity log records subscription-level control plane events, while resource logs capture service-specific data plane or platform behavior.
AZ-104 scenarios often test where to enable collection, where data lands, and which signal type supports the required query or alert.

Last updated: May 2026

Signal types and collection boundaries

Azure Monitor is the central monitoring platform for Azure resources, but the exam expects more than knowing the product name. You need to decide whether a problem should be solved with platform metrics, activity log events, resource logs, guest logs, diagnostic settings, or a Log Analytics query. The right answer usually depends on the source of the event and the operation the administrator must perform.

Metrics are numeric measurements collected at regular intervals. CPU percentage on a VM, storage account availability, disk read operations, HTTP server errors, and gateway throughput are metric-style signals. Metrics are useful when the requirement says near-real-time chart, threshold alert, autoscale, or quick capacity view. They are not the best fit when the requirement asks to search individual records, parse message text, join tables, or find the caller who changed a setting.

Logs are structured records stored in tables. Azure Monitor Logs, usually queried through a Log Analytics workspace, supports Kusto Query Language, or KQL. Logs are stronger for investigation because each record contains columns, timestamps, resource identifiers, status values, caller details, and message fields. Logs are also better for cross-resource questions, such as finding which VMs failed updates, which storage operations were denied, or which resources generated errors during a deployment window.

The activity log is a subscription-level control plane log. It records create, update, delete, policy, service health, security, recommendation, and administrative events. If a question asks who deleted a network security group rule, which policy denied a deployment, or when a VM size was changed, start with the activity log. If a question asks what an application wrote to stdout, what a firewall allowed, or which storage request returned 403, start with resource or guest logs.

Diagnostic settings

Diagnostic settings are the common routing mechanism for many Azure resource logs and selected metrics. They do not magically enable every data source in Azure. They are configured on resources, subscriptions, and some tenant-level services, and they choose categories plus destinations.

Portal path: Azure portal > Resource > Monitoring > Diagnostic settings > Add diagnostic setting. Select log categories, optionally metrics, then choose one or more destinations.

Destination	Use when the requirement says	AZ-104 decision clue
Log Analytics workspace	Query, correlate, alert from logs, build workbooks, use KQL	Best default for operational analysis.
Storage account	Long retention, archive, compliance export, low-cost raw storage	Querying is not the primary goal.
Event Hubs	Stream to SIEM, custom platform, or third-party collector	Integration or near-real-time forwarding is named.
Partner solution	Send to integrated partner monitoring platform	The scenario names a supported partner destination.

A common mistake is creating a Log Analytics workspace and assuming all resources automatically send diagnostic logs to it. Many platform metrics are available automatically, but resource logs usually require diagnostic settings. For VMs, guest OS performance counters and event logs require an agent and data collection configuration, not just a diagnostic setting on the VM resource.

Metrics and dimensions

Metrics live in a metrics database optimized for time-series operations. In Metrics Explorer, choose the scope, metric namespace, metric, aggregation, time range, and dimensions. Dimensions split a metric by properties such as status code, API name, disk, or instance. If a storage account has high total transactions, dimensioning by response type can quickly show whether the problem is throttling, authentication failures, or normal load.

CLI example:

az monitor metrics list \
  --resource <resource-id> \
  --metric "Percentage CPU" \
  --interval PT5M \
  --aggregation Average \
  --output table

Metric alerts evaluate metric values against thresholds. They are often faster and cheaper than log alerts for simple numeric conditions. Use a log query alert when the condition depends on filtering records, joining tables, or counting events that only exist in Log Analytics.

Workspace design and retention

A Log Analytics workspace is a regional data store and administrative boundary. Workspaces have retention settings and access controls. A centralized workspace can simplify cross-resource queries and alert management. Separate workspaces can support data residency, billing separation, environment isolation, or team boundaries. For AZ-104, do not assume one workspace is always correct. Match the design to region, operations ownership, and query needs.

Workspace access can be workspace-context or resource-context. With resource-context access, a user who can read a resource can view that resource's logs without gaining broad access to every table in the workspace. This matters when a central operations team owns the workspace but application teams should only see their own resources.

Implementation workflow

A practical monitoring setup for a production resource group follows a repeatable order:

Identify required signals: platform metrics, activity log, resource logs, guest logs, or application logs.
Create or select the Log Analytics workspace in the correct region and subscription.
Enable diagnostic settings on resources and subscription activity logs.
Configure VM data collection with Azure Monitor Agent when guest data is needed.
Verify ingestion by querying expected tables and checking timestamps.
Create workbooks, alerts, and action groups only after confirming data exists.

Example diagnostic setting command:

az monitor diagnostic-settings create \
  --name send-to-law \
  --resource <resource-id> \
  --workspace <workspace-resource-id> \
  --logs '[{"category":"AuditEvent","enabled":true}]' \
  --metrics '[{"category":"AllMetrics","enabled":true}]'

Categories differ by resource provider. On the exam, a command with a plausible category for one service might be wrong for another. In real administration, inspect supported categories first:

az monitor diagnostic-settings categories list --resource <resource-id> --output table

Troubleshooting collection gaps

If a workbook or alert shows no data, start with scope and ingestion. Confirm the resource has a diagnostic setting and that the selected category is enabled. Confirm the destination workspace is the workspace you are querying. Check the time range. Many investigations fail because the query is correct but the time picker excludes the event.

If activity log events appear but resource details do not, remember that activity log and resource logs are different sources. If VM CPU appears but Windows event logs do not, platform metrics are present but guest collection is missing. If data exists in a storage account but not Log Analytics, the diagnostic setting destination is archival rather than queryable.

Exam traps are usually wording traps. "Monitor administrative changes" points to activity log. "Query failed sign-ins" points outside ordinary resource metrics and into identity logs if configured. "Alert when CPU is above 90 percent" points to metric alerts. "Alert when more than five 403 storage requests occur in ten minutes" points to diagnostic logs plus a KQL query. "Keep raw logs for seven years" points to storage export or archive strategy, not a short interactive query.

Test Your Knowledge

An administrator needs to query storage account authorization failures by request status and caller IP. What should be configured first?

A diagnostic setting that sends the relevant storage resource logs to a Log Analytics workspace

A metric alert on total transactions only

A Recovery Services vault

A VM availability set

Test Your Knowledge

Which Azure Monitor signal is best for finding who changed a VM size at the subscription control plane?

Activity log

Guest Windows Security log

Storage lifecycle log

Application Insights dependency telemetry

Test Your Knowledge

A company must archive resource logs for long-term compliance but does not need frequent KQL queries against the archive. Which diagnostic setting destination best fits?

Storage account

Availability set

Azure Bastion

Public IP prefix

Up Next

9.2 Log Analytics, KQL, and Operational Queries

Continue learning

AZ-104 Microsoft Azure Administrator Study Guide

Azure AZ-104

9.1 Azure Monitor Metrics, Logs, and Diagnostic Settings

Key Takeaways

Signal types and collection boundaries

Diagnostic settings

Metrics and dimensions

Workspace design and retention

Implementation workflow

Troubleshooting collection gaps

AZ-104 Microsoft Azure Administrator Study Guide

1Chapter 1: Exam Orientation and Microsoft Learn Source Control

2Chapter 2: Identity, RBAC, and Governance Foundations

3Chapter 3: Subscriptions, Policy, Costs, and Resource Organization

4Chapter 4: Storage Accounts, Access, and Data Protection

5Chapter 5: Compute, Virtual Machines, Scale Sets, and Bicep

6Chapter 6: Containers, App Service, and Platform Compute

7Chapter 7: Virtual Networking, Routing, and Secure Access

8Chapter 8: Name Resolution, Load Balancing, and Connectivity Troubleshooting

9Chapter 9: Monitoring, Alerting, Backup, and Site Recovery

10Chapter 10: AZ-104 Integration and Troubleshooting Case Labs

11Chapter 11: Final Review, Exam Experience, Renewal, and Career Path

Azure AZ-104

9.1 Azure Monitor Metrics, Logs, and Diagnostic Settings

Key Takeaways

Signal types and collection boundaries

Diagnostic settings

Metrics and dimensions

Workspace design and retention

Implementation workflow

Troubleshooting collection gaps