SDE IV - GPU Engineer

glance· Glance AI - Tech
Apply Now ↗
📍 Bangalore

About this role

Glance AI is an AI commerce platform shaping the next wave of e-commerce with inspiration-led shopping, less about searching for what you want and more about discovering who you could be. Operating in 140 countries, Glance AI transforms every screen into a stage for instant, personal, and joyful discovery, where inspiration becomes something you can explore, feel, and shop in the moment.

Its proprietary models, seamlessly integrated with Google’s most advanced AI platforms, Gemini and Imagen on Vertex AI, deliver hyper-realistic, deeply personal shopping experiences across categories such as fashion, beauty, travel, accessories, home décor, pets, and more. Designed to seamlessly integrate into everyday consumer technology, Glance AI reimagines the future of e-commerce with inspiration-led discovery and shopping.

With an open architecture built for effortless adoption across hardware and software ecosystems, Glance AI is creating a platform that can become a staple in everyday consumer technology. It partners with the world’s leading smartphone makers, connected TV manufacturers, telecom providers, and global brands — meeting people where they are: on mobile, smart TVs, and brand websites.

Through Glance AI’s rich first-party data and unparalleled consumer access, it harnesses InMobi’s global scale, insights, and targeting capabilities to create high-impact, performance-driven shopping journeys for brands worldwide. Part of the InMobi Group, a global technology and advertising leader reaching over 2 billion devices and serving more than 30,000 enterprise brands worldwide, Glance AI is backed by Google, Jio Platforms, and Mithril Capital.

About the Role

As a GPU Systems Engineer, you’ll lead design and optimization efforts across our GPU inference stack.
You will architect the libraries and runtime systems that enable Stable Diffusion, multimodal transformers, and emerging video generation models to run efficiently at scale.

You’ll guide cross-functional teams, influence hardware selection, and set the technical vision for GPU optimization practices across the company.

Key Responsibilities

  • Architect high-performance inference runtimes, kernel dispatchers, and memory planners for large diffusion and transformer workloads.
  • Lead investigations into cross-GPU performance bottlenecks, communication overheads, and scheduling inefficiencies.
  • Drive multi-GPU parallelism strategies — model, pipeline, and tensor parallelization.
  • Establish company-wide GPU optimization standards, tooling, and SLIs.
  • Collaborate with research to design scalable implementations of novel architectures.
  • Mentor engineers in profiling, tuning, and low-level optimization.
  • Partner with hardware vendors and infra teams to maximize cluster utilization.

Required Qualifications

  • 5+ years in high-performance computing, GPU runtime systems, or ML infrastructure.
  • Proven expertise in CUDA / Triton / C++, with deep understanding of GPU scheduling, occupancy, register usage, and tensor cores.
  • Experience building and maintaining distributed inference or training systems.
  • Ability to design abstractions balancing flexibility and performance.
  • Strong knowledge of NCCL, NVLink, PCIe, and interconnects.
  • Familiar with profiling automation and performance dashboards.
  • Excellent technical leadership and mentoring capabilities.

Preferred Qualifications

  • Background in compiler-aided optimization (TVM, XLA, MLIR, Triton).
  • Experience tuning Stable Diffusion or transformer inference pipelines.
  • Exposure to heterogeneous compute backends (AMD ROCm, TPU, ASICs).
  • Experience working with hardware–software co-design initiatives.
  • Open-source or research contributions in GPU optimization

 

"Glance collects and processes personal data such as your name, contact details, resume and other information that may contain personal data for the purpose of processing your application. Glance utilizes Greenhouse, a third-party platform. Please review Greenhouse's Privacy Policy to understand how the data collected from you is processed and managed. By clicking on 'Submit Application', you acknowledge and agree to the above privacy terms. Should you have any privacy concerns, you may contact us through the details mentioned in your application confirmation email."

Frequently Asked Questions

Is the salary disclosed for the SDE IV - GPU Engineer position at glance?
The salary for this SDE IV - GPU Engineer role at glance is not publicly listed. Click "Apply Now" to learn more about the compensation package on their official careers page.
Where is the SDE IV - GPU Engineer position at glance located?
This SDE IV - GPU Engineer role at glance is based in Bangalore. The position is listed as on-site or hybrid. Check the full job description or apply directly to confirm the work arrangement.
Which team or department does the SDE IV - GPU Engineer at glance belong to?
This SDE IV - GPU Engineer position is part of the Glance AI - Tech department at glance. See the full job description for more information about the team structure and responsibilities.
How do I apply for the SDE IV - GPU Engineer position at glance?
Click the "Apply Now" button on this page. You will be redirected to glance's official application portal hosted on greenhouse where you can submit your application directly.
When was the SDE IV - GPU Engineer job at glance posted?
This SDE IV - GPU Engineer position at glance was posted on Oct 9, 2025. Apply as soon as possible — early applications are often reviewed first.
SDE IV - GPU Engineer
glance
Apply for this role ↗

You'll be redirected to glance's official application page on Greenhouse.