Media Software Engineer, Speech (All Levels)

cantina· Engineering
Apply Now ↗
🌍 Remote📍 San FranciscoFullTime

About this role

About Cantina:

Cantina Labs is a social AI company, developing a suite of advanced real-time models that push the boundaries of expression, personality, and realism. We bring characters to life, transforming how people tell stories, connect, and create. We build and power ecosystems. Cantina, our flagship social AI platform, is just the beginning.

If you're excited about the potential AI has to shape human creativity and social interactions, join us in building the future!

About the Role:

The Media Team at Cantina is building the real-time infrastructure powering live conversations between people and AI. Our goal is simple but technically challenging: make interacting with AI feel fast, natural, and truly conversational.

We’re looking for a Software Engineer to help improve the speech, audio, and media systems at the heart of the Cantina experience. A major focus of this role is reducing latency and improving responsiveness so AI bots can hear users, process intent, and respond in real time — without awkward pauses or delays.

This team works across everything from low-level media pipelines and WebRTC frameworks to globally distributed infrastructure supporting real-time voice and video interactions across iOS, Android, and web.

If you’re excited by high-performance C++, real-time systems, speech technologies, and building the future of conversational AI, we’d love to talk.

What You’ll Do:

  • Improve the real-time speech and media systems powering live AI conversations.

  • Reduce latency and optimize responsiveness across audio streaming and speech pipelines.

  • Build new voice and video capabilities that enable more immersive interactions between users and AI bots.

  • Improve and extend our custom WebRTC infrastructure across iOS, Android, and web.

  • Work closely with product and platform teams to shape the future of conversational AI experiences.

What You’ll Bring: We welcome applicants across a wide range of experience levels, from new graduates to senior engineers. Responsibilities and leveling will be tailored to match the candidate’s background.

These are the minimum qualifications:

  • BS or MS in Computer Science, Computer Engineering, or a related field; or equivalent experience.

  • Excellent communications skills.

  • Experience with C or C++.

  • Strong computer science fundamentals, including familiarity with data structures and concurrent / multithreaded programming.

  • Exposure to system programming concepts, including network protocols; memory management; and distributed systems fundamentals.

  • Object-oriented programming and design skills.

  • Interest in solving challenging, subtle engineering problems.

These are the preferred qualifications:

  • Previous experience with WebRTC, streaming protocols, or other media-related technologies.

  • Familiarity with audio or video processing techniques and algorithms.

  • Experience creating backend server infrastructure.

  • Experience developing software for iOS and Android.

  • Familiarity with building services using Node.js.

  • Familiarity with artificial intelligence and machine learning techniques, particularly in relation to speech recognition and synthesis.

Location:

While we offer fully remote and hybrid employment opportunities, our Media Engineering team strongly desires candidates to be available (or willing to relocate) to work in the Bay Area. For reference, 95% of the Media Engineering team works from the Bay Area.

Compensation:

The anticipated annual base salary range for this role is between $120,000-$180,000. When determining compensation, a number of factors will be considered, including skills, experience, job scope, location, and competitive compensation market data.

Benefits:

  • Competitive salary and generous company equity

  • Medical, dental, and vision insurance – 99.99% of premiums covered by Cantina

  • 42 days of paid time off, including:

    • 15 PTO days

    • 10 sick days

    • 15 company holidays

    • 2 floating holidays

  • Generous parental leave & fertility support

  • 401(k) retirement savings plan

  • Lifestyle spending account – $500/month to use however you’d like

  • Complimentary lunch and snacks for in-office employees

  • One Medical membership, and more!

Frequently Asked Questions

Is the salary disclosed for the Media Software Engineer, Speech (All Levels) position at cantina?
The salary for this Media Software Engineer, Speech (All Levels) role at cantina is not publicly listed. Click "Apply Now" to learn more about the compensation package on their official careers page.
Is the Media Software Engineer, Speech (All Levels) job at cantina remote?
Yes, this Media Software Engineer, Speech (All Levels) position at cantina is remote, with team members based in San Francisco. You can work from home or anywhere in the supported regions.
Is the Media Software Engineer, Speech (All Levels) role at cantina full-time or part-time?
This is listed as a FullTime position. It is posted as a Media Software Engineer, Speech (All Levels) role in the Engineering department at cantina.
Which team or department does the Media Software Engineer, Speech (All Levels) at cantina belong to?
This Media Software Engineer, Speech (All Levels) position is part of the Engineering department at cantina. See the full job description for more information about the team structure and responsibilities.
How do I apply for the Media Software Engineer, Speech (All Levels) position at cantina?
Click the "Apply Now" button on this page. You will be redirected to cantina's official application portal hosted on ashby where you can submit your application directly.
When was the Media Software Engineer, Speech (All Levels) job at cantina posted?
This Media Software Engineer, Speech (All Levels) position at cantina was posted on Nov 24, 2025. Apply as soon as possible — early applications are often reviewed first.
Media Software Engineer, Speech (All Levels)
cantina
Apply for this role ↗

You'll be redirected to cantina's official application page on Ashby ATS.