Member of Technical Staff - ML Performance

modalΒ· Engineering
Apply Now β†—
πŸ“ New YorkπŸ“ San FranciscoFullTimeπŸ’° USD 150K–350K/yr

About this role

About Us:

AI needs a new infrastructure layer. We're building it at Modal.

Every era of computing brought new workloads that previous infrastructure couldn't support: mainframes, databases, and the cloud. Each time, the company that rebuilt the layer underneath defined the decade. AI is no different, except it touches everything instead of one slice, and the window to build the layer underneath it is open right now.

Our customers include category-defining companies like Lovable, Ramp, Cognition, DoorDash, and Suno. They rely on Modal for instant GPU access, sub-second container starts, and native storage, so it's simple to serve low-latency inference, fine-tune models, and access production-ready sandboxes at scale.

We recently raised a $355M Series C at a $4.65B valuation, led by General Catalyst and Redpoint Ventures. We've crossed $300M+ ARR and grown fivefold since September.

Our team includes creators of popular open-source projects (e.g.,Seaborn,Luigi), academic researchers, international olympiad medalists, and experienced engineering and product leaders with decades of experience.

The Role:

We are looking for strong engineers with experience in making ML systems performant at scale. If you are interested in contributing to open-source projects and Modal’s container runtime to push language and diffusion models towards higher throughput and lower latency, we’d love to hear from you!

Requirements:

  • 5+ years of experience writing high-quality, high-performance code.

  • Experience working with torch, high-level ML frameworks, and inference engines (vLLM or TensorRT).

  • Familiarity with Nvidia GPU architecture and CUDA.

  • Experience with ML performance engineering (tell us a story about boosting GPU performance β€” debugging SM occupancy issues, rewriting an algorithm to be compute-bound, eliminating host overhead, etc).

  • Nice-to-have: familiarity with low-level operating system foundations (Linux kernel, file systems, containers, etc).

Frequently Asked Questions

What is the salary for the Member of Technical Staff - ML Performance role at modal?
The listed salary for this Member of Technical Staff - ML Performance position at modal is USD 150K–350K/yr. This is an FullTime role.
Where is the Member of Technical Staff - ML Performance position at modal located?
This Member of Technical Staff - ML Performance role at modal is based in New York, San Francisco. The position is listed as on-site or hybrid. Check the full job description or apply directly to confirm the work arrangement.
Is the Member of Technical Staff - ML Performance role at modal full-time or part-time?
This is listed as a FullTime position. It is posted as a Member of Technical Staff - ML Performance role in the Engineering department at modal.
Which team or department does the Member of Technical Staff - ML Performance at modal belong to?
This Member of Technical Staff - ML Performance position is part of the Engineering department at modal. See the full job description for more information about the team structure and responsibilities.
How do I apply for the Member of Technical Staff - ML Performance position at modal?
Click the "Apply Now" button on this page. You will be redirected to modal's official application portal hosted on ashby where you can submit your application directly.
When was the Member of Technical Staff - ML Performance job at modal posted?
This Member of Technical Staff - ML Performance position at modal was posted on Apr 21, 2026. Apply as soon as possible β€” early applications are often reviewed first.
Member of Technical Staff - ML Performance
modal Β· πŸ’° USD 150K–350K/yr
Apply for this role β†—

You'll be redirected to modal's official application page on Ashby ATS.