https://github.com/pytorch/torchtitan/blob/d442743fed7980392a00eecd464b6db8522d8116/torchtitan/parallelisms/__init__.py#L46 Should be PP DP TP. This matters for NUMA across nodes.