Hands-On Workshops
Step-by-step guided workshops to deploy and operate distributed training on AWS.
🎓
AI on SageMaker HyperPod (Slurm)
Deploy and train on managed GPU clusters with HyperPod and Slurm scheduling
HyperPodSlurmGPUDistributed Training
🎓
AI on SageMaker HyperPod (EKS)
Run distributed training on HyperPod with Kubernetes/EKS orchestration
HyperPodEKSKubernetesGPU
🎓
ML on AWS PCS
Use the managed Parallel Computing Service with Slurm for ML training
PCSManaged SlurmHPCML Training