Distributed Training Engineer

periodic-labs· Bits: Research, LLMs, machine learning, infra
Apply Now ↗
📍 Menlo Park, RemoteFullTime

About this role

About Periodic Labs

We are an AI + physical sciences lab building state of the art models to make novel scientific discoveries. We are well funded and growing rapidly. Team members are owners who identity and solve problems without boundaries or bureaucracy. We eagerly learn new tools and new science to push forward our mission.

About the role

You will optimize, operate and develop large-scale distributed LLM training systems that power AI scientific research. You will work closely with researchers to bring up, debug, and maintain mid-training and reinforcement learning workflows. You will build tools and directly support frontier-scale experiments to make Periodic Labs the world’s best AI + science lab for physicists, computational materials scientists, AI researchers, and engineers. You will contribute open-source large scale LLM training frameworks.

You might thrive in this role if you have experience with:

  • Training on clusters with ≥5,000 GPUs

  • 5D parallel LLM training

  • Distributed training frameworks such as Megatron-LM, FSDP, DeepSpeed, TorchTitan

  • Optimizing training throughput for large scale Mixture-of-Expert models

Frequently Asked Questions

Is the salary disclosed for the Distributed Training Engineer position at periodic-labs?
The salary for this Distributed Training Engineer role at periodic-labs is not publicly listed. Click "Apply Now" to learn more about the compensation package on their official careers page.
Where is the Distributed Training Engineer position at periodic-labs located?
This Distributed Training Engineer role at periodic-labs is based in Menlo Park, Remote. The position is listed as on-site or hybrid. Check the full job description or apply directly to confirm the work arrangement.
Is the Distributed Training Engineer role at periodic-labs full-time or part-time?
This is listed as a FullTime position. It is posted as a Distributed Training Engineer role in the Bits: Research, LLMs, machine learning, infra department at periodic-labs.
Which team or department does the Distributed Training Engineer at periodic-labs belong to?
This Distributed Training Engineer position is part of the Bits: Research, LLMs, machine learning, infra department at periodic-labs. See the full job description for more information about the team structure and responsibilities.
How do I apply for the Distributed Training Engineer position at periodic-labs?
Click the "Apply Now" button on this page. You will be redirected to periodic-labs's official application portal hosted on ashby where you can submit your application directly.
When was the Distributed Training Engineer job at periodic-labs posted?
This Distributed Training Engineer position at periodic-labs was posted on Sep 24, 2025. Apply as soon as possible — early applications are often reviewed first.
Distributed Training Engineer
periodic-labs
Apply for this role ↗

You'll be redirected to periodic-labs's official application page on Ashby ATS.