7.3 Analytics Services — Athena, OpenSearch, and QuickSight

Key Takeaways

  • Amazon Athena queries S3 data directly using SQL without loading data into a database — serverless, pay per query (\$5 per TB scanned).
  • Amazon OpenSearch (successor to Elasticsearch) provides full-text search, log analytics, and real-time application monitoring with dashboards.
  • Amazon QuickSight is a serverless BI tool that creates interactive dashboards and visualizations from multiple data sources.
  • Use Athena for ad-hoc SQL queries on S3 data lakes; OpenSearch for full-text search and log analytics; QuickSight for business intelligence dashboards.
  • Athena performance improves with columnar formats (Parquet, ORC) and partitioning — reducing scanned data reduces cost.
Last updated: March 2026

Analytics Services — Athena, OpenSearch, and QuickSight

Quick Answer: Athena = SQL queries on S3 (serverless, pay per scan). OpenSearch = full-text search + log analytics + dashboards. QuickSight = serverless BI dashboards. Use Athena for ad-hoc S3 queries, OpenSearch for log/search use cases, QuickSight for business reporting.

Amazon Athena

Athena is a serverless, interactive query service that analyzes data in S3 using standard SQL.

FeatureDetail
ServerlessNo infrastructure to manage
Query languageStandard SQL (Presto engine)
Data sourceS3 (CSV, JSON, Parquet, ORC, Avro)
Cost$5 per TB of data scanned
IntegrationUses Glue Data Catalog for schemas
Federated queryQuery data in RDS, DynamoDB, and other sources

Athena Cost Optimization

TechniqueImpact
Use columnar formats (Parquet, ORC)30-90% less data scanned
Partition dataSkip scanning irrelevant partitions (e.g., by date)
Compress dataLess data = less scanning = less cost
Use LIMITLimits data processed in simple queries

When to Use Athena

  • Ad-hoc queries on data in S3
  • Query CloudTrail logs, VPC Flow Logs, ELB access logs
  • Data exploration in a data lake
  • One-time analysis without setting up a database

Amazon OpenSearch Service

OpenSearch (successor to Amazon Elasticsearch Service) provides search, log analytics, and real-time application monitoring.

FeatureDetail
SearchFull-text search with relevance scoring
AnalyticsLog analytics, trace analytics, metrics
DashboardsBuilt-in OpenSearch Dashboards (Kibana successor)
ManagedAutomated patching, backups, monitoring
ScalingUp to 3 PB of data, horizontal scaling
ServerlessOpenSearch Serverless option (auto-scaling)

Common OpenSearch Architecture

CloudWatch Logs → Subscription Filter → Lambda → OpenSearch
VPC Flow Logs → Kinesis Data Firehose → OpenSearch
Application Logs → Kinesis Data Firehose → OpenSearch

On the Exam: "Search through application logs and create dashboards" → OpenSearch. "Run SQL queries on data in S3" → Athena.

Amazon QuickSight

QuickSight is a serverless business intelligence (BI) service for creating visualizations and dashboards.

FeatureDetail
ServerlessNo servers to manage
Data sourcesS3, RDS, Redshift, Athena, DynamoDB, on-premises databases
SPICEIn-memory engine for fast dashboard performance
ML InsightsAnomaly detection, forecasting, natural language queries
EmbeddingEmbed dashboards in applications
Per-session pricingPay per user session (cost-effective for occasional use)

Analytics Service Comparison

ServiceTypeBest For
AthenaAd-hoc SQL on S3Data lake queries, log analysis
OpenSearchFull-text search + analyticsLog analytics, search applications
QuickSightBI dashboardsBusiness reporting, visualizations
RedshiftData warehouseComplex analytics on structured data
EMRBig data processingSpark, Hadoop, large-scale ETL
Kinesis Data AnalyticsReal-time SQL/FlinkStreaming data analytics
Test Your Knowledge

A data analyst needs to run SQL queries on CSV files stored in S3 without setting up any infrastructure. Which service should they use?

A
B
C
D
Test Your Knowledge

How can you reduce the cost of Amazon Athena queries?

A
B
C
D