What was Azure AI Document Intelligence previously called?

Azure Form Recognizer. Azure AI Document Intelligence was previously called Azure Form Recognizer. It was renamed as part of the broader Azure AI services rebranding. The AI-102 exam uses the current name "Document Intelligence" but you may encounter the old name in documentation.

Which model should you use to extract vendor name, invoice total, and line items from an invoice?

prebuilt-invoice. The prebuilt-invoice model is specifically designed to extract invoice-specific fields including vendor name, customer name, invoice total, line items, tax amounts, and due dates. The layout model extracts text and tables but does not understand invoice-specific fields.

What is the purpose of a composed model in Document Intelligence?

To combine multiple custom models into a single endpoint that automatically routes documents to the correct model. A composed model combines multiple custom models (up to 100) into a single endpoint. When a document is submitted, the composed model automatically classifies the document type and routes it to the appropriate component model for field extraction. No pre-classification step is needed.

What is the key difference between template and neural custom models?

Template models work with fixed layouts; neural models handle varying layouts. Template models are designed for documents with consistent, fixed layouts (forms, standardized applications). Neural models use deep learning to handle documents with varying layouts and formats (contracts, letters). Neural models generalize better to new document variations.

Azure AI Document Intelligence

Quick Answer: Document Intelligence (formerly Form Recognizer) extracts structured data from documents. Use prebuilt models for common documents (invoices, receipts, IDs), custom models for domain-specific documents, and composed models to automatically route different document types. The Layout API extracts text, tables, and structure without training.

Document Intelligence Models

Prebuilt Models

Model	Document Type	Extracted Fields
Invoice	Invoices	Vendor, customer, amounts, line items, tax, total
Receipt	Receipts	Merchant, date, items, subtotal, tax, total, tip
ID Document	IDs, passports, driver licenses	Name, DOB, address, document number, expiration
Business Card	Business cards	Name, title, company, phone, email, address
W-2	US tax form W-2	Employee info, employer info, wages, taxes
Health Insurance	Health insurance cards	Insurer, member ID, group number, plan
Contract	Contracts	Parties, dates, terms
US Tax Forms	1040, 1098, 1099 variants	All relevant tax fields

General Models

Model	Purpose	Training Required
Read	Extract text and language from documents	No
Layout	Extract text, tables, selection marks, structure	No
General Document	Extract key-value pairs from any document	No

Using Prebuilt Models

from azure.ai.documentintelligence import DocumentIntelligenceClient
from azure.core.credentials import AzureKeyCredential

client = DocumentIntelligenceClient(
    endpoint="https://my-doc-intel.cognitiveservices.azure.com/",
    credential=AzureKeyCredential("<your-key>")
)

# Analyze an invoice
with open("invoice.pdf", "rb") as f:
    poller = client.begin_analyze_document(
        model_id="prebuilt-invoice",
        body=f
    )
result = poller.result()

for document in result.documents:
    vendor = document.fields.get("VendorName")
    if vendor:
        print(f"Vendor: {vendor.content} "
              f"(confidence: {vendor.confidence:.2f})")

    invoice_total = document.fields.get("InvoiceTotal")
    if invoice_total:
        print(f"Total: {invoice_total.content} "
              f"(confidence: {invoice_total.confidence:.2f})")

    # Access line items
    items = document.fields.get("Items")
    if items:
        for item in items.value:
            description = item.value.get("Description")
            amount = item.value.get("Amount")
            print(f"  Item: {description.content} = {amount.content}")

Layout Model

The Layout model extracts document structure without any training:

# Extract layout (text, tables, structure)
with open("document.pdf", "rb") as f:
    poller = client.begin_analyze_document(
        model_id="prebuilt-layout",
        body=f
    )
result = poller.result()

# Extract text by page
for page in result.pages:
    print(f"Page {page.page_number}:")
    for line in page.lines:
        print(f"  Line: {line.content}")

# Extract tables
for table in result.tables:
    print(f"Table ({table.row_count} rows x {table.column_count} columns):")
    for cell in table.cells:
        print(f"  [{cell.row_index},{cell.column_index}]: {cell.content}")

# Extract selection marks (checkboxes)
for page in result.pages:
    for mark in page.selection_marks:
        print(f"  Checkbox at ({mark.polygon}): {mark.state}")
        # state: "selected" or "unselected"

Custom Models

Template Models (Fixed Layout)

Train on documents with a consistent layout
Best for: Forms, applications, structured questionnaires
Minimum: 5 training documents

Neural Models (Varying Layout)

Handle documents with varying layouts and formats
Best for: Contracts, letters, documents with unpredictable structures
Minimum: 5 training documents (recommended: 15+)
Better generalization than template models

Training a Custom Model

# Start custom model training
poller = client.begin_build_document_model(
    build_mode="template",  # or "neural"
    blob_container_url="https://storage.blob.core.windows.net/training-data?<SAS>",
    description="Custom purchase order model"
)

model = poller.result()
print(f"Model ID: {model.model_id}")
print(f"Fields: {model.doc_types}")

Composed Models

Composed models combine multiple custom models into a single endpoint:

# Create a composed model from existing models
poller = client.begin_compose_document_model(
    component_model_ids=["invoice-model", "receipt-model", "po-model"],
    description="Unified document processing model"
)

composed_model = poller.result()
print(f"Composed model ID: {composed_model.model_id}")

# When you analyze a document with the composed model,
# it automatically classifies and routes to the correct component model

How Composed Models Work

A document is submitted to the composed model endpoint
The composed model classifies the document type
The appropriate component model processes the document
Results include the document type and extracted fields

On the Exam: Composed models are the answer when a scenario describes needing to process multiple document types through a single endpoint. The composed model handles routing automatically — no pre-classification step is needed.

Document Classification

Document classification models categorize documents without extracting fields:

Feature	Description
Purpose	Sort documents into categories before processing
Training	Provide labeled document examples per category
Output	Document type classification with confidence score
Use case	Mail sorting, document routing, triage

# Classify a document
poller = client.begin_classify_document(
    classifier_id="my-document-classifier",
    body=document_bytes
)
result = poller.result()

for document in result.documents:
    print(f"Type: {document.doc_type}")
    print(f"Confidence: {document.confidence:.2f}")

Azure AI Engineer Associate

5.3 Azure AI Document Intelligence

Key Takeaways

Azure AI Document Intelligence

Document Intelligence Models

Prebuilt Models

General Models

Using Prebuilt Models

Layout Model

Custom Models

Template Models (Fixed Layout)

Neural Models (Varying Layout)

Training a Custom Model

Composed Models

How Composed Models Work

Document Classification

Azure AI Engineer Associate

1Introduction

2Domain 1: Plan and Manage an Azure AI Solution (15-20%)

3Domain 2: Implement Content Moderation Solutions (10-15%)

4Domain 3: Implement Computer Vision Solutions (15-20%)

5Domain 4: Implement Natural Language Processing Solutions (25-30%)

6Domain 5: Implement Knowledge Mining and Document Intelligence Solutions (10-15%)

7Domain 6: Implement Generative AI Solutions (10-15%)

8Exam Review: Cross-Domain Topics and Advanced Practice

5.3 Azure AI Document Intelligence

Key Takeaways

Azure AI Document Intelligence

Document Intelligence Models

Prebuilt Models

General Models

Using Prebuilt Models

Layout Model

Custom Models

Template Models (Fixed Layout)

Neural Models (Varying Layout)

Training a Custom Model

Composed Models

How Composed Models Work

Document Classification