Member of Technical Staff - Efficient ML

embedding-vc· Moonlake
Apply Now ↗
📍 San Francisco Bay AreaFullTime

About this role

Introducing Moonlake, AI for creating world simulations.

Scope of Work

Training efficiency

  • Dataloaders, fusion, activation remat, gradient checkpointing.

  • FSDP/ZeRO/tensor+pipeline parallel; NCCL tuning.

GPU + kernel performance

  • Nsight profiling, Triton/CUDA kernels, fused ops.

  • Flash-attention–style speedups, sequence packing, KV-cache tricks.

Inference optimization

  • Low-latency serving, continuous batching, speculative decoding.

  • Quantization (GPTQ/AWQ), distillation, pruning.

Infra + reliability

  • SLURM/K8s multi-node jobs, checkpoint hygiene.

  • Determinism, env pinning, GPU failure handling.

We are committed to being an on-site, in-person team currently based in San Mateo

Frequently Asked Questions

Is the salary disclosed for the Member of Technical Staff - Efficient ML position at embedding-vc?
The salary for this Member of Technical Staff - Efficient ML role at embedding-vc is not publicly listed. Click "Apply Now" to learn more about the compensation package on their official careers page.
Where is the Member of Technical Staff - Efficient ML position at embedding-vc located?
This Member of Technical Staff - Efficient ML role at embedding-vc is based in San Francisco Bay Area. The position is listed as on-site or hybrid. Check the full job description or apply directly to confirm the work arrangement.
Is the Member of Technical Staff - Efficient ML role at embedding-vc full-time or part-time?
This is listed as a FullTime position. It is posted as a Member of Technical Staff - Efficient ML role in the Moonlake department at embedding-vc.
Which team or department does the Member of Technical Staff - Efficient ML at embedding-vc belong to?
This Member of Technical Staff - Efficient ML position is part of the Moonlake department at embedding-vc. See the full job description for more information about the team structure and responsibilities.
How do I apply for the Member of Technical Staff - Efficient ML position at embedding-vc?
Click the "Apply Now" button on this page. You will be redirected to embedding-vc's official application portal hosted on ashby where you can submit your application directly.
When was the Member of Technical Staff - Efficient ML job at embedding-vc posted?
This Member of Technical Staff - Efficient ML position at embedding-vc was posted on Jan 15, 2026. Apply as soon as possible — early applications are often reviewed first.
Member of Technical Staff - Efficient ML
embedding-vc
Apply for this role ↗

You'll be redirected to embedding-vc's official application page on Ashby ATS.