Cheat sheet

Databricks GenAI Engineer Associate Cheat Sheet

Design Applications

14%of exam

Prompt designTask selectionChain componentsBusiness mappingSolution decomposition

Data Preparation

14%of exam

ChunkingEmbeddingsRetrieval qualityDocument extractionDelta LakeUnity Catalog

Application Development

30%of exam

RAG chainsAgentsTool callingLangChainLangGraphGuardrails

Assembling + Deploying

22%of exam

MLflowModel ServingVector SearchPyfuncCI/CDFoundation Model APIs

Governance

8%of exam

MaskingGuardrailsLicensingUnity CatalogAI GatewaySafe input

Evaluation + Monitoring

12%of exam

Offline evalOnline monitoringLLM judgeInference tablesRAGASCost control

Quick Facts

Exam
Databricks GenAI
Credential
GenAI Engineer Associate
Time
90 min
Pass
70% (32/45)
Questions
45 scored
Fee
$200
Blueprint
Mar 18 2026

Design Decisions

Problem decomposition
Split to subtasks
Task selection
Pick model type
Chain design
Sequence steps
Business mapping
Goal to behavior
Architecture choice
Pick pattern
Cost/latency
Tradeoff balance

Prompt Engineering

Zero-shot
No examples
Few-shot
Example-driven
Chain-of-thought
Step reasoning
System prompt
Set behavior
Role prompt
Persona
Template
Reusable scaffold
Structured output
JSON/schema

Semantic vs Keyword Search

Semantic

  • Vector similarity
  • Meaning match
  • Needs embeddings

Keyword

  • Term match
  • BM25/SQL
  • No embeddings

Meaning vs terms

Chunking Strategy Picker

  1. Uniform docsFixed-size(N tokens)
  2. Sentences matterSentence(Split by .)
  3. Topic shiftsSemantic(Meaning bounds)
  4. Nested structureRecursive(Hierarchy)
  5. Short docsWhole doc(No split)
  6. Need contextOverlap(Share text)
  7. Retrieve precisionParent-child(Child index)

Chunking Strategies

Fixed-size
N tokens each
Sentence
Split by sentence
Semantic
Topic boundaries
Recursive
Hierarchical split
Overlap
Adjacent share text
Parent-child
Index child; retrieve parent
Sliding window
Fixed stride

Data Preparation Pipeline

Extraction
Parse documents
Filtering
Drop noise
Cleaning
Normalize text
Delta Lake
Versioned tables
Unity Catalog
Data governance
Indexing
Build vector index

RAG Pipeline

Ingest | Chunk | Embed | Index | Retrieve | Rerank | Generate

Ingest: parseChunk: splitEmbed: vectorIndex: storeRetrieve: fetchRerank: reorderGenerate: answer

RAG vs Fine-tuning

RAG

  • Inject knowledge
  • No training
  • Updates easy

Fine-tuning

  • Bake behavior
  • Needs training
  • Updates costly

Knowledge vs behavior

RAG vs Fine-tune vs Prompt

  1. Need latest infoRAG(Inject knowledge)
  2. Need private dataRAG(Vector Search)
  3. Need new skillsFine-tune(Bake behavior)
  4. Need behavior shiftFine-tune(LoRA/QLoRA)
  5. Need format onlyPrompt(No training)
  6. Quick one-shot taskPrompt(Zero-shot)
  7. Multi-step taskAgent(Tool calling)
  8. Need external toolsAgent(MCP server)

RAG Components

Vector store
Semantic index
Embedding
Vector representation
Chunking
Split documents
Retrieval
Fetch relevant chunks
Re-ranking
Reorder top results
Context window
Max input tokens
Grounding
Cite source facts
Augmentation
Inject context

LLM Params

Temp=creativity | Top-p=nucleus | Top-k=pool

Temp: randomTop-p: nucleusTop-k: candidates

LoRA vs Full Fine-tune

LoRA

  • Low-rank adapters
  • Few params
  • Cheap train

Full

  • All weights
  • Costly train
  • Max control

Adapters vs all weights

LLM Concepts

Token
Subword unit
Context window
Max input tokens
Temperature
Randomness control
Top-p
Nucleus sampling
Top-k
Top candidates
Hallucination
Unfounded output
Sampling
Probabilistic decode

Agent Bricks vs Framework

Agent Bricks

  • Managed
  • Low code
  • Databricks-hosted

Framework

  • Custom code
  • LangChain
  • Full control

Managed vs custom

Fine-tuning Methods

Full fine-tune
Update all weights
LoRA
Low-rank adapters
QLoRA
Quantized LoRA
Instruction-tune
Follow directions
PEFT
Partial tuning
RLHF
Human feedback
DPO
Preference optimize

Agent Patterns

ReAct
Reason + act
Tool calling
Invoke functions
Planning
Decompose tasks
Reflection
Self-critique
Multi-agent
Coordinate roles
MCP server
Tool protocol
Memory
Persist context

Deploy Gates

Package | Register | Serve | Monitor

Package: pyfuncRegister: MLflowServe: endpointMonitor: drift

Playground vs Model Serving

Playground

  • Interactive test
  • Prompt iterate
  • No endpoint

Model Serving

  • Deploy endpoint
  • Production API
  • Scale traffic

Test vs deploy

Deployment Picker

  1. Track experimentMLflow(Log params)
  2. Register modelMLflow register(Version)
  3. Real-time APIModel Serving(Endpoint)
  4. Async scoringBatch inference(No endpoint)
  5. Semantic searchVector Search(Index)
  6. Usage meteringAI Gateway(Track)
  7. Managed agentsAgent Bricks(Low code)

Databricks Tools

Vector Search
Semantic index
AI Playground
Test prompts
Foundation Model APIs
Hosted base models
MLflow
Track experiments
Unity Catalog
Governance hub
Model Serving
Deploy endpoints
AI Gateway
Usage tracking
Agent Bricks
Managed agents

MLflow vs Unity Catalog

MLflow

  • Track experiments
  • Log params
  • Version models

Unity Catalog

  • Govern data
  • Access control
  • Lineage

Track vs govern

Deployment Patterns

Pyfunc
Python packaging
MLflow register
Version models
Model Serving
Real-time endpoint
Batch inference
Async scoring
CI/CD
Automate release
Persistent state
Save context
Streaming
Real-time input

Governance Stack

UC | Masking | Guardrails | AI Gateway

UC: governMasking: PIIGuardrails: filterGateway: track

Governance Controls

Masking
Hide PII
Guardrails
Input/output filters
Licensing
Model terms
Unity Catalog
Central governance
AI Gateway
Track usage
PII detection
Find sensitive
Audit logging
Track actions

Eval Dimensions

Faithfulness | Relevance | Context | Answer

Faithfulness: groundedRelevance: matchedContext: recallAnswer: quality

Offline vs Online Eval

Offline

  • Pre-deploy
  • Ground truth
  • RAGAS/BLEU

Online

  • Production
  • Live metrics
  • Drift detect

Test vs monitor

Evaluation Metrics

BLEU
N-gram overlap
ROUGE
Recall overlap
RAGAS
RAG eval suite
Human eval
Gold standard
LLM judge
Auto evaluate
Faithfulness
Source-grounded
Groundedness
No hallucination

Common Traps

RAG vs Fine-tune vs Prompt

RAG adds knowledge Fine-tune shifts behavior

Temperature vs Top-p

Temp scales randomness Top-p truncates nucleus

Vector vs Keyword Search

Semantic matches meaning Keyword matches terms

Offline vs Online Eval

Offline needs ground truth Online detects drift

MLflow vs Model Serving

MLflow tracks experiments Serving deploys endpoints

Unity Catalog vs Delta Lake

UC governs access Delta stores tables

Agent Bricks vs Framework

Bricks is managed Framework is custom code

Last Minute

  1. 1.14/14/30/22/8/12
  2. 2.45 Q / 90 min / $200
  3. 3.Pass: 70% (32/45)
  4. 4.RAG = retrieve + augment
  5. 5.LoRA = adapters; QLoRA = quantized
  6. 6.Vector Search = semantic index
  7. 7.MLflow = track; Serving = deploy
  8. 8.Unity Catalog = governance
  9. 9.AI Gateway = usage tracking
  10. 10.Agent Bricks = managed agents
  11. 11.MCP = tool protocol
  12. 12.March 18 2026 = blueprint update
  13. 13.14-day retake; full fee
Same family resources

Explore More Databricks Certifications

Continue into nearby exams from the same family. Each card keeps practice questions, study guides, flashcards, videos, and articles in one place.