In a machine learning model that predicts house prices based on square footage, number of bedrooms, and location, what are the "features"?

Square footage, number of bedrooms, and location. Features are the input variables used for prediction. In this scenario, square footage, number of bedrooms, and location are the features (inputs), while house price is the label (output) the model predicts.

Which dataset is used to provide an unbiased evaluation of a fully trained machine learning model?

Test data. Test data is held back until after the model is fully trained and tuned. It provides an unbiased estimate of how the model will perform on completely new data in production. Training data is used for learning, and validation data is used for tuning.

A machine learning model performs very well on training data but poorly on new, unseen data. What problem is this called?

Overfitting. Overfitting occurs when a model memorizes the training data instead of learning generalizable patterns. This results in excellent performance on training data but poor performance on new data. Solutions include more data, simpler models, and regularization.

Which type of learning uses labeled data where both features and outcomes are known?

Supervised learning. Supervised learning uses labeled data — data where both features (inputs) and labels (outputs) are known. The model learns the relationship between features and labels. Unsupervised learning uses unlabeled data where only features are known.

Core Machine Learning Concepts

Quick Answer: Machine learning trains models on data to learn patterns and make predictions without explicit programming. Key concepts include features (inputs), labels (outputs), training data (for learning), validation data (for tuning), and test data (for evaluation). Supervised learning uses labeled data; unsupervised learning finds patterns in unlabeled data.

What Is Machine Learning?

Machine learning (ML) is a subset of artificial intelligence where computer systems learn from data and improve their performance over time without being explicitly programmed for every possible scenario. Instead of writing rules like "if temperature > 100 then alert", you provide examples of normal and abnormal temperatures and let the model learn the boundary.

Traditional Programming vs. Machine Learning

Approach	Input	Process	Output
Traditional programming	Data + Rules	Execute rules on data	Results
Machine learning	Data + Expected Results	Learn rules from data	Model (rules)

In traditional programming, you write the rules. In machine learning, the algorithm discovers the rules by analyzing patterns in the data.

Features and Labels

Understanding features and labels is essential for the AI-900:

Features (also called attributes or input variables) are the characteristics of the data that the model uses to make predictions. Think of features as the "questions" the model considers.

Labels (also called target variables or output variables) are the values the model tries to predict. Think of labels as the "answers" the model produces.

Scenario	Features (Inputs)	Label (Output)
House price prediction	Square footage, bedrooms, location, age	Price ($)
Email spam detection	Subject line, sender, body text, links	Spam / Not Spam
Customer churn prediction	Account age, usage, complaints, payments	Churn / Stay
Medical diagnosis	Symptoms, test results, age, medical history	Disease / No Disease
Temperature forecasting	Date, location, humidity, wind speed	Temperature (°F)

On the Exam: When a question describes a dataset with input columns and an output column, the input columns are features and the output column is the label. If the question says "predict the price based on size, location, and age" — size, location, and age are features; price is the label.

Training, Validation, and Test Data

Machine learning requires splitting your data into three subsets:

Training Data (typically 60-80% of all data)

Used to teach the model patterns
The model adjusts its internal parameters based on this data
Larger training sets generally produce better models

Validation Data (typically 10-20% of all data)

Used to tune the model during training
Helps prevent overfitting by evaluating the model on data it has not trained on
Used to select the best model configuration (hyperparameters)

Test Data (typically 10-20% of all data)

Used to evaluate final model performance on completely unseen data
Provides an unbiased estimate of how the model will perform in production
Only used AFTER the model is fully trained and tuned

Total Dataset
├── Training Data (70%)  → Model learns patterns here
├── Validation Data (15%) → Model is tuned here
└── Test Data (15%)       → Final evaluation here

On the Exam: The key distinction: training data teaches the model, validation data tunes it, and test data provides the final unbiased evaluation. A question might ask which dataset is used to prevent overfitting (validation) or which provides an unbiased performance estimate (test).

Overfitting and Underfitting

Problem	Description	Symptom	Solution
Overfitting	Model memorizes training data instead of learning general patterns	Great on training data, poor on new data	More data, simpler model, regularization
Underfitting	Model is too simple to capture patterns in the data	Poor on both training and new data	More complex model, more features, more training
Good fit	Model learns general patterns that apply to new data	Good performance on both training and new data	The goal of ML

Supervised vs. Unsupervised Learning

Supervised Learning

The model learns from labeled data — data where both features and labels are known. The "supervision" comes from the labeled examples that guide the learning process.

Types of supervised learning:

Regression — predict continuous numerical values (price, temperature, quantity)
Classification — predict discrete categories (spam/not spam, disease/no disease)

Unsupervised Learning

The model discovers patterns in unlabeled data — data where only features are known, with no predefined labels. The model finds natural groupings or structures in the data.

Types of unsupervised learning:

Clustering — group similar items together (customer segments, document topics)

Comparison

Aspect	Supervised Learning	Unsupervised Learning
Data	Labeled (features + labels)	Unlabeled (features only)
Goal	Predict known output categories	Discover hidden patterns
Types	Regression, Classification	Clustering
Example	Predict house price from features	Group customers by behavior
Evaluation	Compare predictions to known labels	Assess cluster quality and separation

The Machine Learning Workflow

The end-to-end ML process follows these steps:

Define the problem — What business question are you trying to answer?
Collect data — Gather relevant data from databases, APIs, files
Prepare data — Clean, transform, handle missing values, select features
Split data — Divide into training, validation, and test sets
Choose an algorithm — Select the appropriate ML algorithm for the task
Train the model — Feed training data to the algorithm
Evaluate the model — Test performance on validation and test data
Tune the model — Adjust hyperparameters to improve performance
Deploy the model — Publish as an endpoint for applications to consume
Monitor the model — Track performance and retrain when accuracy degrades

Microsoft Azure AI Fundamentals

2.1 Core Machine Learning Concepts

Key Takeaways

Core Machine Learning Concepts

What Is Machine Learning?

Traditional Programming vs. Machine Learning

Features and Labels

Training, Validation, and Test Data

Training Data (typically 60-80% of all data)

Validation Data (typically 10-20% of all data)

Test Data (typically 10-20% of all data)

Overfitting and Underfitting

Supervised vs. Unsupervised Learning

Supervised Learning

Unsupervised Learning

Comparison

The Machine Learning Workflow

Microsoft Azure AI Fundamentals

1Introduction

2Domain 1: Describe AI Workloads and Considerations (15-20%)

3Domain 2: Fundamental Principles of Machine Learning on Azure (20-25%)

4Domain 3: Computer Vision Workloads on Azure (15-20%)

5Domain 4: Natural Language Processing Workloads on Azure (15-20%)

6Domain 5: Generative AI Workloads on Azure (15-20%)

7Exam Review and Full-Length Practice Questions

2.1 Core Machine Learning Concepts

Key Takeaways

Core Machine Learning Concepts

What Is Machine Learning?

Traditional Programming vs. Machine Learning

Features and Labels

Training, Validation, and Test Data

Training Data (typically 60-80% of all data)

Validation Data (typically 10-20% of all data)

Test Data (typically 10-20% of all data)

Overfitting and Underfitting

Supervised vs. Unsupervised Learning

Supervised Learning

Unsupervised Learning

Comparison

The Machine Learning Workflow