Skip to main content

In-scope AWS services and features

The following list contains AWS services and features that are in scope for the exam. This list is non-exhaustive and is subject to change. AWS offerings appear in categories that align with the offerings’ primary functions:

Machine Learning

Amazon Textract (OCR)

What it is:
Amazon Textract extracts printed and handwritten text, tables, and forms from scanned documents using OCR (Optical Character Recognition).

Typical Use Cases:

  • Automating form processing (e.g., tax, insurance)
  • Digitizing PDFs and scanned documents
  • Extracting structured data for analysis

Learn more

Amazon Comprehend

What it is:
Amazon Comprehend is a Natural Language Processing (NLP) service that uses ML to uncover insights from text — like identifying entities, language, sentiment, and key phrases.

Typical Use Cases:

  • Analyzing customer feedback
  • Tagging documents automatically
  • Detecting personally identifiable information (PII)

Learn more

Amazon Transcribe (STT)

What it is:
Amazon Transcribe converts audio into accurate, readable text using ASR (Automatic Speech Recognition). It supports real-time and batch transcription.

Typical Use Cases:

  • Meeting transcriptions
  • Voice command logging
  • Subtitles for audio/video content

Learn more

Amazon Polly (TTS)

What it is:
Amazon Polly converts text into natural-sounding human speech using advanced deep learning technologies. It supports dozens of languages and voice styles.

Typical Use Cases:

  • Reading text aloud for accessibility
  • Creating voice responses for chatbots
  • Generating audio for training content or news

Learn more

Amazon Translate

What it is:
Amazon Translate is a neural machine translation service that allows real-time and batch translation between dozens of languages.

Typical Use Cases:

  • Multilingual chat applications
  • Document localization
  • Translating user-generated content

Learn more

Amazon Lex (ASR & NLU)

What it is:
Amazon Lex is a service for building conversational interfaces using voice and text — similar to how Alexa works. It combines Automatic Speech Recognition (ASR) and Natural Language Understanding (NLU).

Typical Use Cases:

  • Customer support chatbots
  • Voice-enabled apps and IVRs
  • Automated service desks

Learn more

Amazon Fraud Detector

What it is:
Amazon Fraud Detector helps detect potentially fraudulent activities in real time using pre-built ML models tailored to fraud detection scenarios.

Typical Use Cases:

  • Identifying suspicious online account signups
  • Flagging fraudulent payment attempts
  • Detecting identity theft in transactions

Learn more

Amazon Personalize (Recommendation)

What it is:
Amazon Personalize is a real-time recommendation engine that creates personalized user experiences using your own data — no ML experience required.

Typical Use Cases:

  • Personalized product recommendations
  • Video or music streaming suggestions
  • Content ranking based on user behavior

Learn more

Amazon Rekognition (Computer Vision)

What it is:
Amazon Rekognition is a computer vision service that uses deep learning to analyze images and videos. It can detect objects, scenes, faces, text, and inappropriate content, and also supports facial analysis and facial recognition.

Typical Use Cases:
  • Facial recognition for user verification or security
  • Content moderation for images and videos
  • Detecting objects and scenes in media assets
  • Analyzing sentiment or demographics from facial attributes

Learn more

Amazon SageMaker

What it is:
Amazon SageMaker is a comprehensive platform to build, train, tune, and deploy custom machine learning models. It supports everything from data prep to production deployment.

Typical Use Cases:

  • Training deep learning models (e.g., NLP, vision)
  • Hosting and serving models at scale
  • Creating MLOps pipelines
What is it?

Amazon SageMaker Canvas is a no-code tool that enables users to build accurate ML models without any ML expertise.

Key Features:

  • Drag-and-drop interface with no coding required.
  • Access ready-to-use foundation models from Amazon Bedrock and SageMaker JumpStart.
  • Build custom ML models using AutoML powered by SageMaker AutoPilot.

Typical Use Cases:

  • Empower business analysts to create predictive models.
  • Rapidly prototype and test ML use cases without engineering help.
  • Automate model building and deployment workflows.

Why it matters:
It democratizes ML by making model creation accessible to non-technical users.

Learn more

Amazon Q

What it is:
Amazon Q is a generative AI assistant embedded within the AWS ecosystem. It helps developers and IT teams understand AWS services, generate code, and troubleshoot infrastructure using natural language.

Typical Use Cases:

  • Explaining AWS concepts and CLI commands
  • Generating infrastructure-as-code (e.g., CloudFormation)
  • Helping users navigate AWS Console faster
What is it?
Amazon Q Business allows employees to ask natural language questions and receive accurate answers based on internal company data.

Key Features:

  • Connects to data sources such as SharePoint, Confluence, Salesforce, Slack, S3, and more.
  • Uses Retrieval-Augmented Generation (RAG) to ground answers in your organization’s documents.
  • Maintains enterprise security by respecting identity and access permissions.

Typical Use Cases:

  • Ask: “What is our company’s refund policy?” and get a direct answer from internal PDFs or wikis.
  • Help HR, finance, and operations teams self-serve without IT intervention.
  • Analyze and summarize knowledge spread across internal systems.

Why it matters:
It enables secure, company-specific knowledge access for non-technical employees without needing custom AI development.

Learn more

Amazon Kendra (Intelligent Search Engine)
  • What it is: An intelligent enterprise search engine with natural language support.
  • Use Case: Enterprise document search, FAQ chatbots.

Learn more

Amazon A2I (Augmented AI)

What it is:
Amazon A2I (Augmented AI) helps you build workflows that include human review of ML predictions. It’s especially useful when ML confidence is low or when regulatory compliance requires human checks.

Typical Use Cases:

  • Reviewing document processing results (e.g., from Textract)
  • Moderating sensitive content flagged by Rekognition
  • Validating NLP classification outputs

Learn more

Amazon Bedrock

What it is:
Amazon Bedrock is a serverless platform that allows you to build and scale generative AI applications using foundation models (FMs) from leading providers (Anthropic, Meta, Cohere, etc.) — all without managing infrastructure.

Typical Use Cases:

  • Building chatbots, text summarizers, or content generators
  • Retrieval-Augmented Generation (RAG) via Knowledge Bases
  • Language translation, classification, and embedding generation

Learn more

Database

Amazon DocumentDB (NoSQL document-oriented database)

Type: NoSQL (document-oriented), MongoDB-compatible.

Use Cases: Content management, catalogs, user profiles, flexible JSON data storage.

AI Context: Useful for storing semi-structured data (JSON) used by AI applications.

Vector DB Suitability:Not ideal

Learn more

Amazon DynamoDB (NoSQL key-value store)

Type: Serverless NoSQL (key-value store), providing high availability and low latency.

Use Cases: High-traffic apps, session management, metadata storage.

AI Context: Suitable for real-time AI workloads like metadata management or recommendation engines.

Vector DB Suitability:Not ideal

Learn more

Amazon ElastiCache (in-memory data store)

Type: Managed in-memory data store (Redis/Memcached).

Use Cases: Real-time caching, session storage, leaderboards, real-time analytics.

AI Context: Ideal for caching model inference outputs and providing rapid data retrieval.

Vector DB Suitability:Not ideal

Learn more

Amazon MemoryDB (in-memory Redis-compatible database)

Type: Durable, in-memory Redis-compatible database.

Use Cases: Real-time transactional workloads requiring high performance and durability.

AI Context: Excellent for real-time AI applications with stringent speed and durability requirements.

Vector DB Suitability:Not ideal

Learn more

Amazon Neptune (Managed graph database) ✅ Support VectorDB

Type: Managed graph database (supports Gremlin, SPARQL).

Use Cases: Fraud detection, recommendation systems, knowledge graphs, relationship analysis.

AI Context: Powerful for AI scenarios involving connected data, semantic searches, and graph analytics.

Vector DB Suitability:Possible (Graph-based vector search).

Learn more

Amazon RDS (Managed relational databases) ✅ Support VectorDB

Type: Managed relational databases (MySQL, PostgreSQL, Oracle, SQL Server).

Use Cases: Traditional applications, structured transactional systems (ERP, CRM), structured data analytics.

AI Context: Ideal for structured, relational data usage in AI contexts.

Vector DB Suitability:Possible (via PostgreSQL with pgvector extension, moderate-scale recommended).

Learn more

Amazon Aurora (Managed relational databases) ✅ Support VectorDB

Type: High-performance managed relational database compatible with MySQL and PostgreSQL.

Use Cases: Enterprise-grade applications, highly scalable transactional workloads, analytics.

AI Context: Good choice for structured relational data requiring high throughput, performance, and reliability in AI workloads.

Vector DB Suitability:Possible (using PostgreSQL-compatible Aurora with the pgvector extension), suitable for moderate vector-search scenarios but not optimized for extensive vector workloads.

Learn more

Storage

Amazon S3

What it is:
Amazon S3 (Simple Storage Service) is AWS’s object storage service designed to store and retrieve any amount of data from anywhere on the web.

Why it matters:

  • It's scalable, durable (99.999999999%), and cost-effective
  • Frequently used to store training datasets, model outputs, logs, and documents
  • Integrates seamlessly with services like SageMaker, Bedrock, and Lambda

Typical Use Cases:

  • Storing datasets for AI/ML model training
  • Hosting website files or media assets
  • Saving logs and predictions from AI pipelines
  • Backup and recovery of application data

Learn more

Amazon S3 Glacier

What it is:
Amazon S3 Glacier is a low-cost storage service for data archiving and long-term backup. It is designed for data that is infrequently accessed but must be retained securely for years.

Why it matters:

  • Ideal for archiving training datasets or compliance logs
  • Offers different retrieval speeds (minutes to hours)
  • Cost-effective for storing AI/ML data not actively used

Typical Use Cases:

  • Archiving large ML datasets not currently in use
  • Storing compliance and audit data for AI projects
  • Backing up AI-generated reports, logs, and checkpoints

Learn more

Management and Governance

AWS CloudTrail

What it is:
CloudTrail is a service that records all API calls and actions made in your AWS account, including who made the call, what services were affected, and when.

Why it matters:

  • Provides an audit trail for all changes and activities
  • Helps you detect suspicious behavior or unauthorized access
  • Useful for compliance reporting and forensic analysis

Typical Use Cases:

  • Investigating security incidents (e.g., who deleted a resource?)
  • Monitoring access to sensitive services (e.g., S3, IAM, SageMaker)
  • Setting up alarms on critical changes

Learn more

AWS CloudWatch

What it is:
CloudWatch is AWS’s central monitoring service for metrics, logs, and alarms. It collects and tracks data from AWS services and custom sources.

Why it matters:

  • Helps you visualize performance (CPU, memory, latency, etc.)
  • Allows you to set alarms and get notified when something goes wrong
  • Enables automated actions (e.g., restarting instances)

Typical Use Cases:

  • Monitoring model performance or resource usage in SageMaker
  • Setting alerts on Lambda failures or high error rates
  • Creating dashboards for your application’s health

Learn more

AWS Config

What it is:
AWS Config is a resource compliance and configuration tracking service. It monitors changes to AWS resources and evaluates them against predefined rules.

Why it matters:

  • Provides a timeline of resource changes
  • Ensures your environment adheres to security and compliance policies
  • Supports automatic remediation of non-compliant resources

Typical Use Cases:

  • Checking if S3 buckets are publicly accessible
  • Tracking IAM policy changes
  • Auditing the history of ML model versions or endpoints

Learn more

AWS Trusted Advisor

What it is:
Trusted Advisor is a service that scans your AWS environment and gives recommendations to help improve performance, security, fault tolerance, and cost optimization.

Why it matters:

  • Highlights security vulnerabilities (e.g., open ports, weak IAM policies)
  • Identifies unused resources to reduce cost
  • Suggests best-practice improvements

Typical Use Cases:

  • Checking for over-provisioned EC2/SageMaker instances
  • Ensuring MFA is enabled for root accounts
  • Finding unused EBS volumes or idle load balancers

Learn more

AWS Well-Architected Tool

What it is:
This is a self-assessment tool that helps you review and improve your architecture based on the AWS Well-Architected Framework, which includes 6 pillars (Operational Excellence, Security,Reliability, Cost Optimization, Performance Efficiency, Sustainability).

Why it matters:

  • Provides a structured review of your architecture
  • Helps you identify risks and improvement areas
  • Guides you in building resilient and efficient applications

Typical Use Cases:

  • Assessing your ML/AI solution before production
  • Aligning your architecture with AWS best practices
  • Comparing designs across multiple workloads or teams

Learn more

Security, Identity, and Compliance

AWS Artifact

What it is:
AWS Artifact is your central hub for AWS compliance reports and certifications, such as SOC, ISO, and PCI. It provides downloadable documents to help with audits and legal assessments.

Why it matters:

  • Gives easy access to AWS compliance documentation
  • Helps meet regulatory and customer requirements
  • Supports internal and external audit processes

Typical Use Cases:

  • Sharing SOC 2 reports with auditors
  • Collecting evidence for compliance assessments
  • Validating AWS compliance for your organization

Learn more

AWS Audit Manager

What it is:
AWS Audit Manager helps automate the collection of audit evidence by mapping AWS usage data to compliance frameworks such as GDPR, HIPAA, and ISO.

Why it matters:

  • Reduces the manual effort in audit preparation
  • Continuously tracks compliance posture
  • Helps demonstrate control effectiveness

Typical Use Cases:

  • Automating SOC 2 evidence collection
  • Mapping AWS usage to GDPR controls
  • Monitoring compliance for AI/ML pipelines

Learn more

AWS IAM (Identity and Access Management)

What it is:
IAM is AWS's core access control service, enabling you to create users, groups, roles, and policies to securely manage access to AWS services and resources.

Why it matters:

  • Enforces least privilege across your organization
  • Provides fine-grained access controls for AI/ML services
  • Supports secure role-based delegation

Typical Use Cases:

  • Allowing SageMaker to read data from S3
  • Creating service roles for Bedrock or Lambda
  • Enforcing MFA and managing user permissions

Learn more

Amazon Inspector

What it is:
Amazon Inspector is an automated vulnerability scanning tool for EC2, container images, and Lambda functions. It continuously checks for known security issues.

Why it matters:

  • Helps protect applications from known vulnerabilities
  • Automates security checks in DevSecOps pipelines
  • Sends real-time findings to Security Hub or CloudWatch

Typical Use Cases:

  • Scanning inference EC2s or Lambda functions
  • Securing SageMaker endpoints
  • Identifying CVEs in Docker images

Learn more

AWS KMS (Key Management Service)

What it is:
AWS KMS is a managed service for creating and controlling encryption keys used to secure your data across AWS services.

Why it matters:

  • Enables encryption-at-rest and in-transit
  • Supports customer-managed key (CMK) creation
  • Logs key usage via CloudTrail for auditing

Typical Use Cases:

  • Encrypting training datasets in S3
  • Managing key rotation for AI/ML environments
  • Protecting secrets and database credentials

Learn more

Amazon Macie

What it is:
Amazon Macie is a data security and privacy service that uses ML to discover, classify, and protect sensitive data such as personally identifiable information (PII) stored in Amazon S3.

Why it matters:

  • Identifies sensitive data like names, addresses, and credit card numbers
  • Alerts you to publicly accessible or misconfigured S3 buckets
  • Helps meet privacy regulations like GDPR and HIPAA

Typical Use Cases:

  • Scanning training datasets for sensitive content
  • Auditing AI data lakes for PII
  • Automatically flagging non-compliant storage configurations

Learn more

AWS Secrets Manager

What it is:
Secrets Manager helps you store, retrieve, and rotate secrets (e.g., database credentials, API keys, tokens) securely in your applications.

Why it matters:

  • Keeps secrets out of code
  • Supports automatic rotation of credentials
  • Provides fine-grained IAM access to secrets

Typical Use Cases:

  • Managing API keys for AI services
  • Storing database credentials used in SageMaker pipelines
  • Rotating secrets used in Lambda or Bedrock functions

Learn more

Cloud Financial Management

AWS Budgets

What it is:
AWS Budgets is a cost management tool that allows you to set custom budgets and receive alerts when your usage or spending exceeds thresholds.

Why it matters:

  • Helps prevent unexpected cost overruns in AI/ML workloads
  • Supports budgeting for specific services, linked accounts, or projects
  • Allows email or SNS alerts when nearing or exceeding your budget

Typical Use Cases:

  • Monitoring SageMaker or Bedrock cost usage
  • Creating budgets per department or project
  • Alerting finance or DevOps teams when limits are exceeded

Learn more

AWS Cost Explorer

What it is:
AWS Cost Explorer is a visual tool for analyzing and tracking your AWS spending over time. It lets you explore usage by service, region, account, or tag.

Why it matters:

  • Enables granular analysis of cloud costs
  • Identifies trends, anomalies, and opportunities for savings
  • Helps teams understand the cost impact of AI/ML workloads

Typical Use Cases:

  • Visualizing SageMaker and GPU usage cost trends
  • Identifying unused resources contributing to waste
  • Tagging AI/ML workloads to understand specific project spend

Learn more

Compute

Amazon EC2 (Elastic Compute Cloud)

What it is:
Amazon EC2 provides resizable virtual machines (instances) in the cloud. You can choose from a wide range of instance types optimized for compute, memory, storage, or GPU-based workloads.

Why it matters:

  • Offers full control over compute resources
  • Supports AI/ML training and inference using GPU instances
  • Scales from small experiments to high-performance distributed training

Typical Use Cases:

  • Running deep learning frameworks like TensorFlow or PyTorch on GPU instances
  • Hosting custom-trained ML models for inference
  • Performing large-scale simulations or model training
What is it?

AWS Trainium instances use a custom-designed machine learning chip engineered for high performance with low power consumption, reducing the carbon footprint of training large-scale models.

Key Features:

  • Up to 25% more energy efficient than comparable accelerated computing EC2 instances.
  • Specifically designed for optimal performance per watt for deep learning workloads.
  • Lowers environmental impact compared to other instance types.

Typical Use Cases:

  • Large-scale deep learning training.
  • Organizations prioritizing sustainability and energy efficiency in AI workloads.

Why it matters:
They are the most environmentally friendly choice, helping companies meet sustainability goals while training complex models.

Learn more

Networking and Content Delivery

Containers

Amazon ECS (Elastic Container Service)

What it is:
Amazon ECS is a fully managed container orchestration service that allows you to run and scale Docker containers on AWS without needing to manage your own servers.

Why it matters:

  • Simplifies containerized application deployment
  • Supports Fargate for serverless containers (no instance management)
  • Integrates with SageMaker, Lambda, and other AI pipelines

Typical Use Cases:

  • Running microservices for AI model inference
  • Hosting REST APIs that wrap around ML models
  • Scaling backend services that preprocess ML input data

Learn more

Amazon EKS (Elastic Kubernetes Service)

What it is:
Amazon EKS is a fully managed Kubernetes service that lets you run Kubernetes clusters on AWS without manually configuring the control plane.

Why it matters:

  • Provides more flexibility and portability than ECS
  • Allows you to run AI/ML workloads using K8s-native tools (e.g., Kubeflow, MLflow)
  • Scales AI model serving and training pipelines using Kubernetes best practices

Typical Use Cases:

  • Running ML pipelines with Kubeflow on Kubernetes
  • Managing multi-step model training and deployment workflows
  • Hosting AI microservices using containers

Learn more

Analytics

AWS Data Exchange

What it is:
AWS Data Exchange makes it easy to find, subscribe to, and use third-party datasets in the cloud, such as demographics, weather, or financial data.

Why it matters:

  • Enables external data integration for AI/ML models
  • Automates data subscription, delivery, and updates
  • Helps enhance model accuracy with premium datasets

Typical Use Cases:

  • Enriching ML models with weather or location data
  • Using healthcare or financial datasets from third parties
  • Automating ingestion of licensed datasets into S3 or Redshift

Learn more

Amazon EMR

What it is:
Amazon EMR is a managed cluster platform that runs big data frameworks like Apache Spark, Hive, and Hadoop for data processing and transformation at scale.

Why it matters:

  • Supports large-scale data preprocessing for ML
  • Easily processes petabytes of structured or unstructured data
  • Integrates with S3, HDFS, Redshift, and more

Typical Use Cases:

  • Preprocessing datasets for ML models
  • Running Spark ML jobs at scale
  • Performing distributed feature engineering

Learn more

AWS Glue

What it is:
AWS Glue is a serverless data integration service that discovers, prepares, and combines data for analytics and ML, using ETL pipelines.

Why it matters:

  • Automates data cataloging, cleaning, and transformation
  • Integrates directly with S3, Redshift, and RDS
  • Supports Python- and Spark-based ETL jobs

Typical Use Cases:

  • Cleaning and joining ML training data
  • Building ETL pipelines for AI dashboards
  • Creating feature pipelines for SageMaker models

Learn more

AWS Glue DataBrew

What it is:
Glue DataBrew is a visual data preparation tool for users who want to clean and normalize data without writing code.

Why it matters:

  • Enables non-developers to explore and prepare datasets
  • Provides 250+ built-in transformations (e.g., deduplication, joins)
  • Accelerates data prep for ML pipelines and dashboards

Typical Use Cases:

  • Exploring AI/ML datasets visually
  • Removing outliers, fixing nulls before model training
  • Generating reusable transformations with no code

Learn more

AWS Lake Formation

What it is:
Lake Formation helps you build, secure, and manage data lakes on AWS. It simplifies ingesting, cataloging, and securing data from various sources into S3.

Why it matters:

  • Makes it easier to create a centralized data lake for AI
  • Provides fine-grained data access control
  • Integrates with Glue, Athena, Redshift, and SageMaker

Typical Use Cases:

  • Creating data lakes for AI training and analysis
  • Managing data access permissions for teams
  • Curating and tagging ML training datasets

Learn more

Amazon OpenSearch Service

What it is:
OpenSearch Service is a managed search and analytics engine that supports full-text search, log analytics, and vector search for AI use cases.

Why it matters:

  • Supports semantic search and RAG (Retrieval-Augmented Generation)
  • Integrates with Bedrock Knowledge Bases
  • Includes k-NN vector indexing for similarity search

Typical Use Cases:

  • Powering AI chatbots with semantic search
  • Storing and retrieving embeddings for vector search
  • Building analytics dashboards from log data

Learn more

Amazon QuickSight

What it is:
QuickSight is AWS’s business intelligence and data visualization tool that helps create dashboards, reports, and charts from various data sources.

Why it matters:

  • Allows real-time visualization of AI/ML results
  • Supports embedded dashboards in apps
  • Uses ML-powered insights (e.g., anomaly detection, forecasting)

Typical Use Cases:

  • Visualizing model predictions or performance metrics
  • Creating dashboards for business stakeholders
  • Monitoring usage and accuracy trends for ML solutions

Learn more

Amazon Redshift

What it is:
Amazon Redshift is a fully managed cloud data warehouse that lets you analyze structured and semi-structured data at scale using SQL.

Why it matters:

  • Integrates with SageMaker for in-database ML
  • Supports Redshift ML to run models directly in the warehouse
  • Handles petabyte-scale analytics

Typical Use Cases:

  • Running AI inference directly in SQL queries
  • Building AI-powered dashboards from transactional data
  • Training ML models on aggregated data

Learn more