A company needs to extract vendor names, invoice numbers, and total amounts from thousands of scanned invoices. Which approach should they use?

Azure AI Document Intelligence with the prebuilt-invoice model. The prebuilt-invoice model understands invoice structure and returns named fields (VendorName, InvoiceId, InvoiceTotal, line items) directly. Generic OCR returns only raw text with no field semantics, a generative prompt is non-deterministic and costlier per page, and Search is for retrieval, not field extraction.

A travel app must understand the utterance "Book a flight to Paris" and extract the destination as a structured slot. Which service fits best?

Conversational Language Understanding (CLU). CLU is purpose-built for intent classification plus entity extraction from conversational utterances; it would return the BookFlight intent and a Destination entity of "Paris". A generative model is non-structured overkill, Custom Question Answering only returns FAQ answers, and general NER would tag "Paris" as a location but without an intent.

Which deprecated service must be migrated to Custom Question Answering?

QnA Maker. QnA Maker was retired and its capability moved into Custom Question Answering inside Azure AI Language. LUIS migrates to CLU, Form Recognizer was renamed Document Intelligence, and Cognitive Search was renamed Azure AI Search.

Service Comparison and Selection Guide — Free Study Guide 2026

Quick Answer: Roughly one in five AI-102 questions is "which service?". Memorize the boundaries: Language for deterministic NLP, Azure OpenAI for generative text, Vision Read for general OCR, Document Intelligence for structured fields, Custom Vision for custom image classes, Face for identity, CLU for intents, Custom Question Answering for FAQs, and AI Search for retrieval.

Why "Select the Appropriate Service" Dominates

The December 2025 skills outline lists six service-selection bullets in the Plan and manage an Azure AI solution domain alone (20-25% of the exam). Microsoft writes these items as business scenarios, not API trivia. The trap is that Azure OpenAI can technically do almost anything — so it appears as a tempting distractor. The right answer is usually the purpose-built service because it is cheaper, deterministic, lower-latency, and does not need prompt engineering or grounding.

Text Processing Decision Matrix

Scenario	Correct Service	Why NOT the alternative
Sentiment of customer reviews	Azure AI Language (sentiment)	OpenAI works but Language is cheaper, faster, deterministic
Open-ended summary of a long brief	Azure OpenAI (chat completions)	Language abstractive summarization is extractive/limited
Detect people, places, orgs	Azure AI Language (NER)	OpenAI output is non-structured and varies run to run
Extract policy or case IDs	Azure AI Language custom NER	Trained model is consistent; a prompt drifts
Route support tickets to categories	Azure AI Language custom text classification	Trained on your labels, unlike a generic prompt
Generate marketing copy	Azure OpenAI	Language cannot author creative text
Translate text/documents	Azure AI Translator	Language has no translation feature
Detect and redact PII	Azure AI Language (PII)	Deterministic, comprehensive, returns offsets
Understand "Book a flight to Paris"	CLU	OpenAI is generative, not structured intent + entity
Answer questions from an FAQ KB	Custom Question Answering	Returns curated answers; CLU has no Q&A pairs

Image and Document Decision Matrix

Scenario	Correct Service	Why NOT the alternative
Read text off a street-sign photo	Azure AI Vision Read (OCR)	Document Intelligence is for structured forms
Pull vendor, total, line items from invoices	Document Intelligence prebuilt-invoice	Raw OCR has no field understanding
Classify product photos into custom classes	Azure AI Custom Vision	Vision tags are generic, not your categories
Verify a person's identity (1:1)	Azure AI Face	Vision detects people but cannot recognize
Count/track shoppers on camera	Azure AI Vision Spatial Analysis	Face does recognition, not movement tracking
Insights from a video file	Azure AI Video Indexer	Vision processes still images
Extract across docs, images, audio, video	Content Understanding (Foundry Tools)	New multimodal extraction service
Generate an image from text	Azure OpenAI (DALL-E)	No other Azure service generates images

Knowledge and Generative Decision Matrix

Full-text + vector search over enterprise content -> Azure AI Search (with skillsets for OCR/NER/embeddings during indexing).
Direct curated answers from FAQs -> Custom Question Answering.
Open-ended chat grounded in your data -> Azure OpenAI + Azure AI Search (RAG) — Search retrieves, OpenAI generates the grounded answer with citations.
High-volume forms at scale -> Document Intelligence (prebuilt or composed model).

Deprecated Services and Their Replacements

Deprecated	Current name	Migration
LUIS	CLU (Conversational Language Understanding)	Export LUIS app, import into CLU
QnA Maker	Custom Question Answering	Recreate KB as a Q&A project
Form Recognizer	Document Intelligence	Rename only; same models
Cognitive Search	Azure AI Search	Rename only; same API
Cognitive Services	Azure AI Services / Microsoft Foundry	Multi-service resource
Face emotion / age / gender	Retired	Removed for Responsible AI

On the Exam: If LUIS, QnA Maker, or Form Recognizer appear as options, they are almost always wrong unless the stem asks specifically about migration. Watch the new Foundry branding: "Microsoft Foundry Services", "Azure Vision in Foundry Tools", and "Content Understanding" are the current 2025-2026 names you must recognize.

Worked Example: Reading the Scenario for Disqualifiers

Microsoft writes selection questions so that the disqualifier is a single phrase. Train yourself to find it. Consider: "A retailer receives thousands of supplier invoices in different layouts and must extract totals and line items with the least development effort." The phrase "least development effort" rules out a custom-trained model and rules out an OpenAI prompt you would have to engineer and validate. The phrase "line items" rules out plain OCR, which returns unstructured text. That leaves Document Intelligence prebuilt-invoice — a model that ships ready to recognize invoice fields across layouts.

Now change one word: "...extract a non-standard field that appears only on this retailer's contracts." The phrase "non-standard field" now eliminates the prebuilt model and points to a custom extraction model you train on your own labeled samples. Single-word swaps like this flip the answer, so read every qualifier before scanning the options.

Cost and Latency: Why Purpose-Built Wins

When two services can both produce a result, the exam reward goes to the cheaper, faster, deterministic option. Pre-built Language operations are billed per 1,000 text records and return structured JSON with confidence scores in a single synchronous call. A generative model billed per token must read a system prompt plus your instructions plus the input on every request, can vary its output between runs, and may need a second groundedness or validation pass.

For high-volume, repeatable extraction or classification, Language or Document Intelligence is the defensible answer; reserve Azure OpenAI for tasks that genuinely require open-ended language generation, reasoning over free text, or multimodal understanding that no pre-built model exposes.

Speech, Translation, and Multimodal Edge Cases

A handful of scenarios trip candidates because the obvious service is wrong. Translating a Word or PDF document while preserving formatting is Translator's document translation feature, not Language and not OpenAI. Real-time captioning of a live meeting is Speech (speech-to-text), not Video Indexer, which works on recorded files. Identifying who is speaking across a recording is Speech speaker diarization. Extracting fields from a scanned form, a photo, and an audio note in one ingestion is Content Understanding, the new multimodal extractor in Foundry Tools.

Memorize these so the distractors that pair a plausible-but-wrong service with the right verb do not catch you.

Azure AI Engineer Associate

Azure AI-102

7.1 Service Comparison and Selection Guide

Key Takeaways

Why "Select the Appropriate Service" Dominates

Text Processing Decision Matrix

Image and Document Decision Matrix

Knowledge and Generative Decision Matrix

Deprecated Services and Their Replacements

Worked Example: Reading the Scenario for Disqualifiers

Cost and Latency: Why Purpose-Built Wins

Speech, Translation, and Multimodal Edge Cases

Azure AI Engineer Associate

1Introduction

2Domain 1: Plan and Manage an Azure AI Solution (20-25%)

3Content Safety and Moderation (within Plan and Manage, Domain 1)

4Domain 4: Implement Computer Vision Solutions (10-15%)

5Domain 5: Implement Natural Language Processing Solutions (15-20%)

6Domain 6: Implement Knowledge Mining and Information Extraction Solutions (15-20%)

7Domain 2: Implement Generative AI Solutions (15-20%)

8Domain 3: Implement an Agentic Solution (5-10%)

9Exam Review: Cross-Domain Topics and Advanced Practice

Azure AI-102

7.1 Service Comparison and Selection Guide

Key Takeaways

Why "Select the Appropriate Service" Dominates

Text Processing Decision Matrix

Image and Document Decision Matrix

Knowledge and Generative Decision Matrix

Deprecated Services and Their Replacements

Worked Example: Reading the Scenario for Disqualifiers

Cost and Latency: Why Purpose-Built Wins

Speech, Translation, and Multimodal Edge Cases