Machine Learning Infrastructure Engineer

characterΒ· Technical Staff - ML
Apply Now β†—
🌍 RemoteπŸ“ Redwood City, CAFullTimeπŸ’° USD 150K–350K/yr

About this role

About the role

We’re looking for seasoned ML Infrastructure engineers with experience designing, building and maintaining training and serving infrastructure for ML research.

Responsibilities:

  • Provide infrastructure support to our ML research and product

  • Build tooling to diagnose cluster issues and hardware failures

  • Monitor deployments, manage experiments, and generally support our research

  • Maximize GPU allocation and utilization for both serving and training

Requirements:

  • 4+ years of experience supporting the infrastructure within an ML environment

  • Experience in developing tools used to diagnose ML infrastructure problems and failures

  • Experience with cloud platforms (e.g., Compute Engine, Kubernetes, Cloud Storage)

  • Experience working with GPUs

Nice to have

  • Experience with large GPU clusters and high-performance computing/networking

  • Experience with supporting large language model training

  • Experience with ML frameworks like Pytorch/TensorFlow/JAX

  • Experience with GPU kernel development

About Character.AI

Character.AI empowers people to connect, learn and tell stories through interactive entertainment. Over 20 million people visit Character.AI every month, using our technology to supercharge their creativity and imagination. Our platform lets users engage with tens of millions of characters, enjoy unlimited conversations, and embark on infinite adventures.


In just two years, we achieved unicorn status and were honored as Google Play's AI App of the Yearβ€”a testament to our innovative technology and visionary approach.


Join us and be a part of establishing this new entertainment paradigm while shaping the future of Consumer AI!

At Character, we value diversity and welcome applicants from all backgrounds. As an equal opportunity employer, we firmly uphold a non-discrimination policy based on race, religion, national origin, gender, sexual orientation, age, veteran status, or disability. Your unique perspectives are vital to our success.

Frequently Asked Questions

What is the salary for the Machine Learning Infrastructure Engineer role at character?
The listed salary for this Machine Learning Infrastructure Engineer position at character is USD 150K–350K/yr. This is a remote FullTime role.
Is the Machine Learning Infrastructure Engineer job at character remote?
Yes, this Machine Learning Infrastructure Engineer position at character is remote, with team members based in Redwood City, CA. You can work from home or anywhere in the supported regions.
Is the Machine Learning Infrastructure Engineer role at character full-time or part-time?
This is listed as a FullTime position. It is posted as a Machine Learning Infrastructure Engineer role in the Technical Staff - ML department at character.
Which team or department does the Machine Learning Infrastructure Engineer at character belong to?
This Machine Learning Infrastructure Engineer position is part of the Technical Staff - ML department at character. See the full job description for more information about the team structure and responsibilities.
How do I apply for the Machine Learning Infrastructure Engineer position at character?
Click the "Apply Now" button on this page. You will be redirected to character's official application portal hosted on ashby where you can submit your application directly.
When was the Machine Learning Infrastructure Engineer job at character posted?
This Machine Learning Infrastructure Engineer position at character was posted on Apr 24, 2025. Apply as soon as possible β€” early applications are often reviewed first.
Machine Learning Infrastructure Engineer
character Β· πŸ’° USD 150K–350K/yr
Apply for this role β†—

You'll be redirected to character's official application page on Ashby ATS.