EMR PNG and SVG Icon
Amazon EMR (Elastic MapReduce) is a cloud big data platform for processing massive amounts of data using open-source tools like Apache Spark, Hive, and Hadoop.
Last Modified: August 10, 2025

16px
32px
48px
64px
Details
Key Features
- Managed big data processing service using open-source frameworks.
- Supports Apache Spark, Hive, HBase, and more.
- Automatic scaling and cost optimization options.
- Processes large datasets efficiently on AWS.
Common Use Cases
- Processing big data using Apache Spark and Hadoop
- Running large-scale log analysis and ETL jobs
- Training machine learning models on distributed data
Explore More Icons
Cost and Usage Report
AWS Cost and Usage Report (CUR) provides the most detailed information available about your AWS costs and usage, exported to Amazon S3 for advanced analysis.
Elastic Fabric Adapter
Elastic Fabric Adapter (EFA) is a network interface for EC2 instances that enables low-latency, high-throughput communication for HPC and ML workloads.
DeepLens
AWS DeepLens is a deep learning-enabled video camera for developers to run ML models locally on edge devices in real time.
Telco Network Builder
AWS Telco Network Builder simplifies the deployment and management of telecom networks on AWS using standard telecom models.
SageMaker
Amazon SageMaker is a fully managed service that provides tools to build, train, and deploy machine learning models at scale.
rePost Private
AWS re:Post Private offers a secure, private version of the re:Post community within an organization, enabling internal knowledge sharing and collaboration around AWS topics.
Pinpoint
Amazon Pinpoint is a flexible and scalable outbound and inbound marketing communications service for sending targeted messages to customers across multiple channels.
Elemental MediaStore
AWS Elemental MediaStore is a storage service optimized for media that offers the performance, consistency, and low latency required for video workloads.
FSx for NetApp ONTAP
Amazon FSx for NetApp ONTAP offers fully managed NetApp file systems on AWS with familiar features like snapshots, clones, and data tiering.
FSx
Amazon FSx provides fully managed third-party file systems optimized for a range of workloads including Windows File Server, Lustre, NetApp, and OpenZFS.
Red Hat OpenShift Service on AWS
Red Hat OpenShift Service on AWS (ROSA) is a fully managed service that enables you to run Red Hat OpenShift, a Kubernetes-based container platform, directly on AWS.
SageMaker Studio Lab
Amazon SageMaker Studio Lab is a free ML development environment that provides Jupyter-based tools for experimenting with models and datasets.
DevOps Guru
Amazon DevOps Guru uses ML to detect operational issues and anomalies in applications, providing insights to improve reliability and performance.
Braket
Amazon Braket is a fully managed service that helps researchers and developers explore and design quantum computing algorithms on simulators and quantum hardware.
Automation
AWS Systems Manager Automation simplifies common maintenance and deployment tasks using predefined or custom workflows.
Vault
Vault typically refers to Amazon S3 Glacier Vaults, containers for managing archives and controlling access to long-term stored data.
Nova
Amazon Nova refers to internal AI infrastructure or services (if announced); details may vary as it's not yet publicly defined.
Private Certificate Authority
AWS Private Certificate Authority (CA) is a managed private CA service that helps you issue and manage private SSL/TLS certificates for internal applications.
HealthLake
Amazon HealthLake is a HIPAA-eligible service that stores, transforms, and analyzes health data in the FHIR format for advanced analytics and ML.
Reserved Instance Reporting
AWS Reserved Instance Reporting helps you monitor and optimize the utilization and coverage of your purchased Reserved Instances for cost savings.
Translate
Amazon Translate is a neural machine translation service that delivers fast, high-quality, and customizable language translation.
Lake Formation
AWS Lake Formation is a service that simplifies setting up a secure data lake by automating data ingestion, cleaning, cataloging, and access control.
Command Line Interface
AWS Command Line Interface (CLI) is a tool that enables you to manage AWS services and resources through commands in your terminal.
SQS Message
Amazon SQS Message refers to an individual data unit sent between distributed system components via Amazon Simple Queue Service.