3.6 AI/ML and Analytics Services

Key Takeaways

  • Amazon SageMaker is a fully managed platform for building, training, and deploying machine learning models at scale.
  • Amazon Bedrock provides access to foundation models (Claude, Llama, Mistral) for building generative AI applications without managing infrastructure.
  • Amazon Rekognition uses deep learning for image and video analysis — identify objects, faces, text, and activities.
  • Amazon Athena is a serverless interactive query service that lets you analyze data in S3 using standard SQL.
  • Amazon Kinesis processes real-time streaming data from sources like IoT devices, application logs, and clickstreams.
Last updated: March 2026

AI/ML and Analytics Services

The CLF-C02 exam includes questions on AWS AI/ML services and analytics. You need to know what each service does and when to use it — no deep technical knowledge required.

Machine Learning Platform

Amazon SageMaker

A fully managed platform for building, training, and deploying ML models at scale.

FeatureDescription
SageMaker StudioIntegrated development environment for ML
Built-in AlgorithmsPre-built algorithms for common ML tasks
TrainingManaged training infrastructure with automatic model tuning
DeploymentOne-click model deployment to production endpoints
Ground TruthData labeling service with human + ML labeling

When to use: Organizations building custom ML models that need full control over the training and deployment pipeline.

Amazon Bedrock

A fully managed service for building generative AI applications using foundation models from leading AI providers.

FeatureDescription
Foundation ModelsAccess to Claude (Anthropic), Llama (Meta), Mistral, Cohere, Stable Diffusion, and Amazon Titan
CustomizationFine-tune models with your own data
AgentsBuild AI agents that can take actions and interact with your systems
GuardrailsSafety filters and responsible AI controls
Knowledge BasesRAG (Retrieval-Augmented Generation) with your data sources

When to use: Building generative AI applications (chatbots, content generation, summarization) using pre-trained foundation models.

Amazon Q

AWS's AI assistant for business and development:

  • Amazon Q Business — AI assistant that answers questions about your company data
  • Amazon Q Developer — AI coding assistant (code generation, debugging, optimization)

AI Services (Pre-Trained, No ML Expertise Needed)

These services use pre-trained models — no ML expertise required:

ServiceWhat It DoesUse Case
Amazon RekognitionImage and video analysisFace detection, object recognition, content moderation
Amazon TranscribeSpeech to textMeeting transcription, subtitles, call center analytics
Amazon PollyText to speechRead content aloud, accessibility, IVR systems
Amazon TranslateLanguage translationReal-time translation, website localization
Amazon ComprehendNatural language processing (NLP)Sentiment analysis, entity extraction, topic modeling
Amazon LexConversational interfaces (chatbots)Virtual agents, chatbots (powers Alexa)
Amazon TextractExtract text from documentsInvoice processing, form digitization
Amazon KendraIntelligent enterprise searchSearch across documents, FAQs, knowledge bases
Amazon PersonalizeReal-time personalizationProduct recommendations, personalized content
Amazon ForecastTime-series forecastingDemand forecasting, financial planning

On the Exam: Know the one-line description for each AI service. Common pattern: "A company wants to [add subtitles / detect faces / translate text / build a chatbot]" — match to the right service.


Analytics Services

Amazon Athena

A serverless interactive query service that analyzes data directly in Amazon S3 using standard SQL.

  • No infrastructure to manage — just point at your S3 data and query
  • Pay per query — charged based on amount of data scanned
  • Supports CSV, JSON, Parquet, ORC formats
  • Use case: Ad-hoc queries, log analysis, quick data exploration

Amazon Kinesis

Real-time data streaming platform with four services:

ServicePurpose
Kinesis Data StreamsCollect and process real-time streaming data
Kinesis Data FirehoseLoad streaming data into S3, Redshift, etc.
Kinesis Data AnalyticsAnalyze streaming data with SQL or Apache Flink
Kinesis Video StreamsProcess and analyze video streams

Amazon EMR (Elastic MapReduce)

A managed cluster platform for running big data frameworks like Apache Hadoop, Apache Spark, and Presto at scale.

Amazon QuickSight

A serverless business intelligence (BI) service that creates interactive dashboards and visualizations.

AWS Glue

A serverless data integration service for ETL (Extract, Transform, Load). Discovers, prepares, and combines data for analytics.

FeatureDescription
Glue Data CatalogCentral metadata repository for all data assets
Glue CrawlersAutomatically discover data and populate the catalog
Glue ETL JobsTransform data between formats and destinations

AWS Lake Formation

Simplifies the creation and management of a data lake on S3 with fine-grained security controls.

Analytics Quick Reference

NeedService
Query data in S3 with SQLAthena
Real-time streaming dataKinesis
Big data processing (Hadoop/Spark)EMR
Business intelligence dashboardsQuickSight
ETL / data preparationGlue
Data warehouse queriesRedshift
Create a data lakeLake Formation
Test Your Knowledge

Which AWS service provides access to foundation models from multiple AI providers for building generative AI applications?

A
B
C
D
Test Your Knowledge

A company wants to automatically add subtitles to their video content. Which AWS service should they use?

A
B
C
D
Test Your Knowledge

Which AWS service lets you run SQL queries directly against data stored in Amazon S3 without loading it into a database first?

A
B
C
D
Test Your Knowledge

A company needs to process and analyze data from thousands of IoT sensors in real time. Which AWS service is best suited?

A
B
C
D
Test Your Knowledge

Which AWS AI service uses deep learning to analyze images and detect objects, faces, and text in photos and videos?

A
B
C
D