Amazon EKS

Kubernetes-based orchestration for distributed training jobs

View on GitHub

Overview

Amazon EKS provides Kubernetes-based orchestration for distributed training, enabling containerized workloads with GPU scheduling, EFA networking, and integration with Training Operator.