EMR Cluster
Big data processing on Spark / Hive / Hadoop / Presto.
Configuration
Section titled “Configuration”| Setting | Type | Required | Default |
|---|---|---|---|
| Cluster name | Text | Yes | — |
| EMR release | Text | — | emr-7.0.0 |
| Applications | List | — | — |
| Master instance type Options: m5.xlarge — 4v · 16 GiB, m5.2xlarge — 8v · 32 GiB, m5.4xlarge — 16v · 64 GiB, m5.8xlarge — 32v · 128 GiB, m5.12xlarge — 48v · 192 GiB, m5.16xlarge — 64v · 256 GiB, m5.24xlarge — 96v · 384 GiB, m6g.xlarge — 4v · 16 GiB, m6g.2xlarge — 8v · 32 GiB, m6g.4xlarge — 16v · 64 GiB, m6g.8xlarge — 32v · 128 GiB, m6g.12xlarge — 48v · 192 GiB, m6g.16xlarge — 64v · 256 GiB, c5.xlarge — 4v · 8 GiB, c5.2xlarge — 8v · 16 GiB, c5.4xlarge — 16v · 32 GiB, c5.9xlarge — 36v · 72 GiB, c5.12xlarge — 48v · 96 GiB, c5.18xlarge — 72v · 144 GiB, c5.24xlarge — 96v · 192 GiB, r5.xlarge — 4v · 32 GiB, r5.2xlarge — 8v · 64 GiB, r5.4xlarge — 16v · 128 GiB, r5.8xlarge — 32v · 256 GiB, r5.16xlarge — 64v · 512 GiB, r5.24xlarge — 96v · 768 GiB | Choice | — | m5.xlarge |
| Core instance type Options: m5.xlarge — 4v · 16 GiB, m5.2xlarge — 8v · 32 GiB, m5.4xlarge — 16v · 64 GiB, m5.8xlarge — 32v · 128 GiB, m5.12xlarge — 48v · 192 GiB, m5.16xlarge — 64v · 256 GiB, m5.24xlarge — 96v · 384 GiB, m6g.xlarge — 4v · 16 GiB, m6g.2xlarge — 8v · 32 GiB, m6g.4xlarge — 16v · 64 GiB, m6g.8xlarge — 32v · 128 GiB, m6g.12xlarge — 48v · 192 GiB, m6g.16xlarge — 64v · 256 GiB, c5.xlarge — 4v · 8 GiB, c5.2xlarge — 8v · 16 GiB, c5.4xlarge — 16v · 32 GiB, c5.9xlarge — 36v · 72 GiB, c5.12xlarge — 48v · 96 GiB, c5.18xlarge — 72v · 144 GiB, c5.24xlarge — 96v · 192 GiB, r5.xlarge — 4v · 32 GiB, r5.2xlarge — 8v · 64 GiB, r5.4xlarge — 16v · 128 GiB, r5.8xlarge — 32v · 256 GiB, r5.16xlarge — 64v · 512 GiB, r5.24xlarge — 96v · 768 GiB | Choice | — | m5.xlarge |
| Core node count | Number | — | 2 |
| Task instance type Options: m5.xlarge — 4v · 16 GiB, m5.2xlarge — 8v · 32 GiB, m5.4xlarge — 16v · 64 GiB, m5.8xlarge — 32v · 128 GiB, m5.12xlarge — 48v · 192 GiB, m5.16xlarge — 64v · 256 GiB, m5.24xlarge — 96v · 384 GiB, m6g.xlarge — 4v · 16 GiB, m6g.2xlarge — 8v · 32 GiB, m6g.4xlarge — 16v · 64 GiB, m6g.8xlarge — 32v · 128 GiB, m6g.12xlarge — 48v · 192 GiB, m6g.16xlarge — 64v · 256 GiB, c5.xlarge — 4v · 8 GiB, c5.2xlarge — 8v · 16 GiB, c5.4xlarge — 16v · 32 GiB, c5.9xlarge — 36v · 72 GiB, c5.12xlarge — 48v · 96 GiB, c5.18xlarge — 72v · 144 GiB, c5.24xlarge — 96v · 192 GiB, r5.xlarge — 4v · 32 GiB, r5.2xlarge — 8v · 64 GiB, r5.4xlarge — 16v · 128 GiB, r5.8xlarge — 32v · 256 GiB, r5.16xlarge — 64v · 512 GiB, r5.24xlarge — 96v · 768 GiB | Choice | — | — |
| Task node count | Number | — | 0 |
| Task nodes use Spot | Toggle | — | false |
| Spot bid price ($) | Number | — | — |
| Scale-down behavior Options: At instance hour, At task completion | Choice | — | TERMINATE_AT_TASK_COMPLETION |
| Visible to all users | Toggle | — | true |
| Termination protection | Toggle | — | false |
| Keep alive when no steps | Toggle | — | true |
| Log URI (S3) | Text | — | — |
| EC2 instance profile | Text | — | — |
| EMR service role | Text | — | — |
| Kerberos KDC admin password | Text | — | — |
| Kerberos realm | Text | — | — |
| Configurations (JSON) | Text | — | — |
| Step concurrency level | Text | — | 1 |
| Tags | Key–value | — | — |
Connections
Section titled “Connections”| Socket | Direction | Accepts | Terraform arg |
|---|---|---|---|
| Subnet | Input | aws.subnet | ec2_attributes.subnet_id |
| Security groupsmulti | Input | aws.security-group | ec2_attributes.additional_master_security_groups |