Which architecture pattern combines Azure AI Search with Azure OpenAI Service to generate responses grounded in enterprise data?

RAG (Retrieval-Augmented Generation) pattern. The RAG pattern combines Azure AI Search (to retrieve relevant documents) with Azure OpenAI Service (to generate grounded responses based on the retrieved context). This is the most commonly tested architecture pattern on the AI-102 exam.

Azure AI service containers deployed on-premises still require internet connectivity. Why?

For billing and metering purposes. Azure AI containers deployed on-premises or at the edge still require periodic internet connectivity for billing and metering purposes. The AI model itself runs locally, but Azure needs to track usage for billing. The container will stop functioning if it cannot connect to Azure for an extended period.

A manufacturing company needs real-time defect detection on an assembly line with no internet connectivity. Which deployment pattern should they use?

Edge deployment using Custom Vision container on Azure IoT Edge. Edge deployment using a Custom Vision container on Azure IoT Edge enables real-time inference locally without constant internet connectivity. The model runs on the edge device, providing low-latency predictions for manufacturing quality control. Note that periodic connectivity is still needed for billing.

Azure AI Solution Architecture Patterns

Quick Answer: Key Azure AI architecture patterns include microservices (independent AI services), orchestration (pipeline coordination), RAG (search + generative AI), and edge deployment (containers at the edge). Design solutions with scalability, fault tolerance, cost optimization, and security in mind.

Pattern 1: Microservices Architecture

Each AI capability runs as an independent service that can be developed, deployed, and scaled independently.

[Client App] → [API Gateway]
                    ├── [Vision Service]    → Azure AI Vision
                    ├── [Language Service]  → Azure AI Language
                    ├── [Speech Service]    → Azure AI Speech
                    └── [Search Service]    → Azure AI Search

When to use: Large applications with multiple AI capabilities that have different scaling requirements and update cycles.

Pattern 2: Orchestration Pipeline

A central orchestrator coordinates multiple AI services in sequence or parallel to process complex requests.

[Input] → [Azure Function Orchestrator]
            ├── Step 1: OCR (Document Intelligence)
            ├── Step 2: Entity Extraction (AI Language)
            ├── Step 3: Sentiment Analysis (AI Language)
            ├── Step 4: Content Safety Check
            └── Step 5: Index Results (AI Search)

When to use: Document processing pipelines, content enrichment workflows, multi-step AI analysis.

Pattern 3: RAG (Retrieval-Augmented Generation)

Combines search and generative AI to produce grounded responses from enterprise data:

[User Query]
    → [Azure AI Search] (retrieve relevant documents)
    → [Prompt Construction] (query + retrieved context + system message)
    → [Azure OpenAI Service] (generate grounded response)
    → [Content Safety] (filter harmful content)
    → [Response to User]

When to use: Enterprise chatbots, knowledge bases, customer support, document Q&A. This is the most tested architecture pattern on the AI-102 exam.

Pattern 4: Edge Deployment

Deploy AI inference at the edge using containerized models for low-latency, offline-capable scenarios:

[IoT Device / Camera]
    → [Azure IoT Edge]
        → [Custom Vision Container] (local inference)
        → [Speech Container] (local STT/TTS)
    → [Azure IoT Hub] (sync results to cloud)

When to use: Manufacturing quality inspection, retail analytics, remote locations with limited connectivity.

Container Deployment for Azure AI Services

Several Azure AI services can be deployed as Docker containers for on-premises or edge scenarios:

Service	Container Available	Use Case
Azure AI Language	Yes	Sentiment, NER, key phrases on-premises
Azure AI Speech	Yes	STT/TTS without internet dependency
Azure AI Vision (Read/OCR)	Yes	OCR processing at the edge
Azure AI Document Intelligence	Yes	Form processing on-premises
Azure AI Translator	Yes	Offline translation

Container Deployment Requirements

Containers still require periodic connectivity to Azure for billing (metering)
EULA acceptance is required via container environment variables
Endpoint and API key must be configured for billing verification
Containers do NOT support all features available in the cloud version

On the Exam: Container deployment questions typically test whether you know that: (1) containers require internet for billing, (2) not all features are available in containers, and (3) containers are used for latency, compliance, or connectivity requirements.

Cost Optimization Strategies

Strategy	Description	Savings
Commitment tier pricing	Pre-purchase usage at discounted rates	15-30%
Right-size deployments	Match provisioned throughput to actual demand	Variable
Batch processing	Use batch APIs instead of real-time for non-urgent tasks	40-60%
Caching	Cache frequently requested results to reduce API calls	Variable
Free tier for development	Use F0 tier for development and testing	100% (dev only)
Regional pricing differences	Deploy in lower-cost regions where compliance permits	5-20%

Azure AI Engineer Associate

1.5 Azure AI Solution Architecture Patterns

Key Takeaways

Azure AI Solution Architecture Patterns

Pattern 1: Microservices Architecture

Pattern 2: Orchestration Pipeline

Pattern 3: RAG (Retrieval-Augmented Generation)

Pattern 4: Edge Deployment

Container Deployment for Azure AI Services

Container Deployment Requirements

Cost Optimization Strategies

High Availability and Disaster Recovery

Multi-Region Deployment

Data Replication

Azure AI Engineer Associate

1Introduction

2Domain 1: Plan and Manage an Azure AI Solution (15-20%)

3Domain 2: Implement Content Moderation Solutions (10-15%)

4Domain 3: Implement Computer Vision Solutions (15-20%)

5Domain 4: Implement Natural Language Processing Solutions (25-30%)

6Domain 5: Implement Knowledge Mining and Document Intelligence Solutions (10-15%)

7Domain 6: Implement Generative AI Solutions (10-15%)

8Exam Review: Cross-Domain Topics and Advanced Practice

1.5 Azure AI Solution Architecture Patterns

Key Takeaways

Azure AI Solution Architecture Patterns

Pattern 1: Microservices Architecture

Pattern 2: Orchestration Pipeline

Pattern 3: RAG (Retrieval-Augmented Generation)

Pattern 4: Edge Deployment

Container Deployment for Azure AI Services

Container Deployment Requirements

Cost Optimization Strategies

High Availability and Disaster Recovery

Multi-Region Deployment

Data Replication