Member of Technical Staff - Kernels & GPU Performance

gimletยท Engineering
Apply Now โ†—
๐Ÿ“ San FranciscoFullTime๐Ÿ’ฐ USD 150Kโ€“350K/yr

About this role

About Us

Gimlet is building the next generation of AI infrastructure: large-scale AI datacenters and the orchestration platform that coordinates them.

The future of AI will require vastly more compute than exists today. But as AI workloads become more complex and new hardware architectures emerge, simply deploying more GPUs isn't enough. The challenge is making increasingly diverse compute work together.

Gimlet's platform intelligently partitions and routes workloads across heterogeneous hardware, enabling step-function improvements in performance and efficiency. Customers deploy through production-grade APIs without needing to think about hardware selection, placement, or optimization.

We work with foundation labs, hyperscalers, and AI-native companies to power production workloads at massive scale and help define the infrastructure layer for the future of AI.

About the role

Gimlet Labs is seeking a Member of Technical Staff focused on kernels and GPU performance. In this role, you will work close to accelerators and execution hardware to extract maximum performance from AI workloads across diverse and rapidly evolving platforms. You will analyze low-level execution behavior, design and optimize kernels, and ensure performance is reliable across both established and emerging hardware.

This role is ideal for engineers who enjoy deep performance work, reasoning about hardware tradeoffs, and turning theoretical peak performance into real-world results.

What you will work on

  • Design, implement, and optimize GPU and accelerator kernels for AI workloads

  • Analyze and tune performance across the GPU execution stack, including memory access patterns, synchronization, and instruction scheduling

  • Work with compilers and runtimes to ensure kernels integrate cleanly and perform well in end-to-end systems

  • Bring up and optimize execution on new or emerging accelerators

  • Profile, benchmark, and debug performance issues across kernels, runtimes, and hardware

  • Ensure performance optimizations are robust, correct, and production-ready at scale

You may be a good fit if

  • Strong software engineering fundamentals

  • Experience working on performance-critical systems close to hardware

  • Comfort reasoning about low-level execution behavior, memory hierarchies, and performance tradeoffs

Strong candidates may also have

  • Experience with CUDA, Triton, CUTLASS, or other accelerator programming models

  • Deep understanding of GPU execution models (warps/wavefronts, blocks, grids)

  • Experience optimizing memory access patterns (coalescing, shared memory, cache behavior)

  • Familiarity with occupancy, latency hiding, and instruction-level parallelism

  • Experience using profiling and performance analysis tools

  • Familiarity with multi-GPU or distributed execution is a plus

What Makes Gimlet Different

At Gimlet, you will work on infrastructure problems that span the full stack of modern AI systems. Our team operates across datacenters, networking, distributed systems, compilers, runtimes, orchestration, and performance engineering to build the foundation for the next generation of AI infrastructure.

As an early member of the team, you will have significant ownership, work alongside highly technical engineers, and help shape both the systems we build and how we scale the company.

We value people who are excited to work across domains, take ownership of meaningful problems, and build technology that enables the next generation of AI.

Frequently Asked Questions

What is the salary for the Member of Technical Staff - Kernels & GPU Performance role at gimlet?
The listed salary for this Member of Technical Staff - Kernels & GPU Performance position at gimlet is USD 150Kโ€“350K/yr. This is an FullTime role.
Where is the Member of Technical Staff - Kernels & GPU Performance position at gimlet located?
This Member of Technical Staff - Kernels & GPU Performance role at gimlet is based in San Francisco. The position is listed as on-site or hybrid. Check the full job description or apply directly to confirm the work arrangement.
Is the Member of Technical Staff - Kernels & GPU Performance role at gimlet full-time or part-time?
This is listed as a FullTime position. It is posted as a Member of Technical Staff - Kernels & GPU Performance role in the Engineering department at gimlet.
Which team or department does the Member of Technical Staff - Kernels & GPU Performance at gimlet belong to?
This Member of Technical Staff - Kernels & GPU Performance position is part of the Engineering department at gimlet. See the full job description for more information about the team structure and responsibilities.
How do I apply for the Member of Technical Staff - Kernels & GPU Performance position at gimlet?
Click the "Apply Now" button on this page. You will be redirected to gimlet's official application portal hosted on ashby where you can submit your application directly.
When was the Member of Technical Staff - Kernels & GPU Performance job at gimlet posted?
This Member of Technical Staff - Kernels & GPU Performance position at gimlet was posted on Mar 10, 2026. Apply as soon as possible โ€” early applications are often reviewed first.
Member of Technical Staff - Kernels & GPU Performance
gimlet ยท ๐Ÿ’ฐ USD 150Kโ€“350K/yr
Apply for this role โ†—

You'll be redirected to gimlet's official application page on Ashby ATS.