CUDA Developer (AI/LLM & GPU Optimization)

Gramian Consulting Groupยท Talent Solutions
Apply Now โ†—
๐ŸŒ Remote๐Ÿ“ Colombia๐Ÿ“ TELECOMMUTEContract

About this role

Gramian Consultancy is a boutique consultancy specializing in IT professional services and engineering talent solutions. With a strong background in software engineering and leadership, we help companies build high-performing teams by matching them with professionals who truly fit their needs.

Role Overview

We are looking for experienced CUDA Developers to work on advanced AI and machine learning initiatives focused on improving the capabilities of large language models (LLMs). In this role, you will solve complex GPU programming challenges, optimize high-performance CUDA workloads, review AI-generated code, and contribute to the development of more capable AI systems.

Duration: 3 months

Commitment: 40h/week, 4h/day overlap with PST

Model: Contract, time and material

Location: 100% Remote: Bangladesh, Brazil, Colombia, Egypt, Ghana, India, Pakistan, Indonesia, Kenya, Nigeria, Turkey, Vietnam

Interview: 1 technical interview

Key Responsibilities

  • Solve advanced CUDA and GPU programming problems involving parallel computing and performance optimization
  • Review, evaluate, and improve AI-generated CUDA, C++, and Python code
  • Optimize GPU kernels for throughput, latency, memory efficiency, and resource utilization
  • Work with CUDA libraries and frameworks such as Thrust, cuBLAS, and cuDNN
  • Debug and resolve issues related to CUDA kernels, synchronization, and memory management
  • Develop high-quality technical prompts, solutions, explanations, and evaluations for AI model training
  • Collaborate with AI researchers, engineers, and evaluation teams
  • Stay up to date with the latest developments in CUDA, GPU architectures, and performance optimization techniques
  • 5+ years of professional software development experience with strong focus on CUDA development
  • Strong proficiency in C/C++
  • Strong hands-on experience with Python and scientific computing ecosystems
  • Experience working with PyTorch and NumPy
  • Experience with CUDA 12.3 or newer
  • Strong understanding of GPU programming, parallel computing, and performance optimization
  • Experience optimizing workloads for high-performance execution and efficient resource utilization
  • Experience with CUDA libraries such as Thrust, cuBLAS, and cuDNN

Frequently Asked Questions

Is the salary disclosed for the CUDA Developer (AI/LLM & GPU Optimization) position at Gramian Consulting Group?
The salary for this CUDA Developer (AI/LLM & GPU Optimization) role at Gramian Consulting Group is not publicly listed. Click "Apply Now" to learn more about the compensation package on their official careers page.
Is the CUDA Developer (AI/LLM & GPU Optimization) job at Gramian Consulting Group remote?
Yes, this CUDA Developer (AI/LLM & GPU Optimization) position at Gramian Consulting Group is remote, with team members based in Colombia, TELECOMMUTE. You can work from home or anywhere in the supported regions.
Is the CUDA Developer (AI/LLM & GPU Optimization) role at Gramian Consulting Group full-time or part-time?
This is listed as a Contract position. It is posted as a CUDA Developer (AI/LLM & GPU Optimization) role in the Talent Solutions department at Gramian Consulting Group.
Which team or department does the CUDA Developer (AI/LLM & GPU Optimization) at Gramian Consulting Group belong to?
This CUDA Developer (AI/LLM & GPU Optimization) position is part of the Talent Solutions department at Gramian Consulting Group. See the full job description for more information about the team structure and responsibilities.
How do I apply for the CUDA Developer (AI/LLM & GPU Optimization) position at Gramian Consulting Group?
Click the "Apply Now" button on this page. You will be redirected to Gramian Consulting Group's official application portal hosted on workable where you can submit your application directly.
When was the CUDA Developer (AI/LLM & GPU Optimization) job at Gramian Consulting Group posted?
This CUDA Developer (AI/LLM & GPU Optimization) position at Gramian Consulting Group was posted on May 22, 2026. Apply as soon as possible โ€” early applications are often reviewed first.
CUDA Developer (AI/LLM & GPU Optimization)
Gramian Consulting Group
Apply for this role โ†—

You'll be redirected to Gramian Consulting Group's official application page on workable.