Megatron / NeMo

4 test cases

NVIDIA's Megatron-LM and NeMo frameworks for large-scale LLM pre-training with tensor parallelism, pipeline parallelism, expert parallelism, and sequence parallelism.