Staff ML Research Engineer, Marengo
About this role
Who we are
μμ μ΄ν΄ AIμ κΈλ‘λ² κΈ°μ€μ ν¨κ» λ§λ€μ΄ κ° μΈμ¬λ₯Ό μ°Ύμ΅λλ€!
νΈμ°λΈλ©μ€λ λ°©λν μμ λ°μ΄ν°λ₯Ό ν¨κ³Όμ μΌλ‘ μ²λ¦¬νμ¬, μμμ νΉνλ κ²μ, λΆμ, μμ½, μΈμ¬μ΄νΈ μμ± κΈ°λ₯μ μ 곡νλ μΈκ³ μ΅κ³ μμ€μ μμ νΉν AI λͺ¨λΈμ λ§λ€κ³ μμ΅λλ€.
μΈκ³ μ΅λ μ€ν¬μΈ 리그μμλ νΈμ°λΈλ©μ€ λͺ¨λΈμ νμ©ν΄ λ°©λν κ²½κΈ° μμ μμμ λΉ λ₯΄κ³ μ ννκ² νμ΄λΌμ΄νΈλ₯Ό μ λ³νμ¬ μ΄κ°μΈνλ μμ² κ²½νμ μ 곡νκ³ μμ΅λλ€. κ΅λ΄ ν΅ν©κ΄μ μΌν°μμλ μκΈ° μν©μ μ μν λμνκΈ° μν΄ νΈμ°λΈλ©μ€μ ν¨κ» CCTV μμμ ν¨μ¨μ μΌλ‘ νμνκ³ μμΌλ©°, μ μΈκ³ μ£Όμ λ°©μ‘μ¬μ μ€νλμ€λ€μ μμμ΅ λͺ μ μμ²μλ₯Ό μν μ½ν μΈ μ μμ νΈμ°λΈλ©μ€ λͺ¨λΈμ νμ©νκ³ μμ΅λλ€.
νΈμ°λΈλ©μ€λ μνλμμ€μ½μ μμΈμ μ€νΌμ€λ₯Ό λ Deep Tech μ€ννΈμ μΌλ‘, 4λ μ°μ CB Insights μ μ μΈκ³ 100λ AI μ€ννΈμ μ μ΄λ¦μ μ¬λ Έμ΅λλ€. NVIDIA, NEA, Index Ventures, Databricks, Snowflake λ± μΈκ³μ μΈ VCμ κΈ°μ λ€λ‘λΆν° μ΄ 1μ΅ 1μ²λ§ λ¬λ¬ μ΄μμ ν¬μλ₯Ό μ μΉνμΌλ©°, νκ΅μμ κ°λ°λ AI λͺ¨λΈ μ€ μ μΌνκ² Amazon Bedrockμ ν΅ν΄ μλΉμ€λ©λλ€. μ°λ¦¬λ νμν λλ£λ€κ³Ό νμ μ μΈ μ νμ λ§λ€κ³ μ μΈκ³ κ³ κ°λ€κ³Ό ν¨κ» μ±μ₯νκ³ μμ΅λλ€.
νΈμ°λΈλ©μ€λ λ€μκ³Ό κ°μ ν΅μ¬ κ°μΉλ₯Ό μ€μ¬μΌλ‘ μΌν©λλ€.
λμ νμ λν΄ μ μ§νκ³ μ±μ°°ν μ μλ νλ
μ€ν¨μ νΌλλ°±μ λλ €μνμ§ μλ λκΈ°μ κ²Έμ
λμμλ νμ΅μ ν΅ν΄ νμ μλμ ν¨κ» λμ¬ κ°λ μμΈ
λμ μ μΈ λ¬Έμ λ₯Ό ν¨κ» ν΄κ²°νλ©° μ±μ₯νλ κ³Όμ μ μ¦κΈ°λ λΆμ΄λΌλ©΄, κ·Έ κΈ°νκ° μ¬κΈ° νΈμ°λΈλ©μ€μ μμ΅λλ€.
About the Team
νΈμ°λΈλ©μ€μ λ©ν°λͺ¨λ¬ μλ² λ© λͺ¨λΈ Marengoμ μ°κ΅¬κ°λ°μ λ΄λΉνλ νμ λλ€. λΉλμ€, μ€λμ€, ν μ€νΈ λ± λ€μν λͺ¨λ¬λ¦¬ν°λ₯Ό νλμ μλ² λ© κ³΅κ°(Embedding Space)μ ν΅ν©νλ λͺ¨λΈμ μ°κ΅¬νκ³ κ°λ°ν©λλ€.
Contrastive learning, temporal video understanding, multimodal representation learning λ± λ€μν μ°κ΅¬ μ£Όμ λ₯Ό λ€λ£¨λ©°, λκ·λͺ¨ νμ΅ λ°μ΄ν° νμ΄νλΌμΈ ꡬμΆλΆν° λͺ¨λΈ μν€ν μ² μ€κ³, λΆμ° νμ΅ μ΅μ ν, νκ° μ²΄κ³ μ€κ³κΉμ§ λͺ¨λΈ κ°λ°μ μ κ³Όμ μ μ± μμ§λλ€. NVIDIA B300 λ± μΈκ³ μ΅κ³ μμ€μ GPU 리μμ€μ λν μ κ·Ό κΆνμ λ°νμΌλ‘ λκ·λͺ¨ μ€νμ λΉ λ₯΄κ² μνν©λλ€.
μ°κ΅¬μμ νλ‘λμ κΉμ§μ κ°κ·Ήμ΄ λ§€μ° μ§§μ νκ²½μμ, Search, Product, Infrastructure νκ³Ό κΈ΄λ°ν νμ νλ©° μ μΈκ³ μμ² κ³ κ°μ΄ μ¬μ©νλ λͺ¨λΈμ νμ§μ μ§μμ μΌλ‘ ν₯μμν΅λλ€.
About the Role
As a Staff ML Research Engineer on the Marengo team, you will set the technical direction for TwelveLabs' next-generation multimodal embedding models and own the end-to-end model development process, from research strategy and data architecture to training infrastructure and evaluation frameworks.
This is a high-autonomy role at the intersection of multimodal representation learning, large-scale systems design, and cross-team technical leadership. We're looking for someone who thrives in ambiguity: someone who can identify the highest-impact research problems, define the technical approach, and drive cross-team execution to deliver models that serve customers worldwide.
In this role, you will
Set the technical direction for next-generation multimodal embedding model architecture, training methodology, and data strategy
Own end-to-end model development from research planning through large-scale distributed training to production evaluation
Architect and optimize training infrastructure: distributed training pipelines, data processing systems, experiment workflows, and GPU utilization across the team's compute fleet
Drive data strategy: design large-scale data curation, filtering, and quality frameworks that systematically improve model performance
Define evaluation methodology and quality standards for embedding models, ensuring rigorous benchmarking that captures what matters
Co-design embedding architectures with the search team, optimizing for end-to-end retrieval quality rather than isolated benchmarks
Drive cross-functional alignment with search, product, and infrastructure teams on model integration and performance requirements
Raise the research engineering bar through design review, experiment review, and technical mentorship
Even if you don't check every box, we encourage you to apply.
If you're a zero-to-one achiever, a ferocious learner, and a kind team player who motivates others, you'll find a home at TwelveLabs.
You may be a good fit if you have
7+ years of industry experience in computer vision, NLP, or multimodal learning, with a track record of owning and shipping ML systems end-to-end
Demonstrated ability to take ambiguous, loosely-defined research problems and drive them to concrete, impactful solutions, from problem identification through delivery
Deep expertise in large-scale distributed model training (Kernel optimization, FSDP, or similar)
Strong experience in contrastive learning, representation learning, or foundation model training
Proven end-to-end ownership: not just running experiments, but defining what to build, building it, deploying it, and iterating on it in production
Strong proficiency in Python and PyTorch
Evidence of both research depth and engineering impact: publications paired with shipped products, not one or the other
We evaluate based on relevant technical skills and sustained industry impact. This role is typically a strong fit for engineers with an MS and deep industry experience who have evolved from individual contributor to technical leader in production ML environments.
Preferred Qualifications
Experience training models at billion-parameter scale
Experience with training operations: pipeline reliability, monitoring, fault tolerance, cost optimization
Experience with large-scale data curation and data quality systems
Experience with temporal video understanding or multimodal video modeling
Deep experience with training infrastructure optimization (GPU utilization, mixed precision, communication optimization)
Track record of technical leadership: driving architectural decisions that shaped team or product direction
What makes this role unique
The gap between research and production is remarkably short here. Models you build will be used by thousands of companies worldwide within months. We work as a unified team toward the broader goal of video understanding, rather than solving isolated problems. Our research philosophy balances rigorous experimentation with real-world application: we aim to build multimodal systems that are powerful, trustworthy, and genuinely useful.
Others
Work Location: Seoul Itaewon office + Pangyo satellite office
Hiring Process
Application Review β Recruiter Interview (λΉλλ©΄/30λΆ) β Loop Interview [Hiring Manager Interview&Live Coding Test Interview] (λλ©΄/μ½ 90λΆ) β System Design Interview(λλ©΄/μ½ 60λΆ) β Final Round Interview (λΉλλ©΄/μ½ 30λΆ) β Reference Check β Offer
Benefits and Perks
Growth & Tools
κΈλ‘λ² B2B κ³ κ°κ³Ό ν¨κ» μ±μ₯νλ Global Team
μμ¨μ±κ³Ό νμ μ λͺ¨λ κ°μΆ νμ΄λΈλ¦¬λ 근무
μ΅μ λ§₯λΆ λ° 70λ§ μ μλΉ μ¬ν근무 μ₯λΉ μ§μ, 3λ μ£ΌκΈ°λ‘ μ΅μ μ₯λΉ κ΅μ²΄
Tokens never sleep - Tech μ§κ΅° LLM ν ν° λ¬΄μ ν μ§μ
κ°μ, 컨νΌλ°μ€, λ©€λ²μ λ±μ μ¬μ© κ°λ₯ν μ° 140λ§μ μλΉ μκΈ°κ°λ°λΉ μ§μ
μμ΄ κ΅μ‘ νλ‘κ·Έλ¨ λ° κΈλ‘λ² λ²λ νλ‘κ·Έλ¨ μ΄μ
μΌκ° λ° μ£Όλ§ μΆν΄κ·Ό νμλΉ μ§μ
Meal & Snack
μλΉΒ·κ΅ν΅λΉ λ± μμ λ‘κ² μ¬μ©ν μ μλ μ° 720λ§μ μλΉ λ²μΈμΉ΄λ μ 곡
μ¬λ¬΄μ€ λ΄ μ€λ΅λ° μ΄μ (κ°μ, 컀νΌ, μ μ² κ³ΌμΌ λ±)
μ¬λ¬΄μ€ 근무 μ, μ€ν 7μ μ΄ν μ λ μλ μ 곡
Wellness & Family
μ° 1ν λ³ΈμΈ λ° κ°μ‘± 1μΈμ 건κ°κ²μ§ μ 곡
λ¨μ²΄λ³΄ν κ°μ (μν΄λ³΄ν/μΉμ보ν/κ°μ‘± μν΄λ³΄ν μ€ ν 1)
λ κ° μλ°©μ μ’ λΉ μ§μ
μ°λ§ 2μ£Όκ° μ κΈ Holiday Break μ΄μ
Frequently Asked Questions
Is the salary disclosed for the Staff ML Research Engineer, Marengo position at twelve-labs?
Is the Staff ML Research Engineer, Marengo job at twelve-labs remote?
Is the Staff ML Research Engineer, Marengo role at twelve-labs full-time or part-time?
Which team or department does the Staff ML Research Engineer, Marengo at twelve-labs belong to?
How do I apply for the Staff ML Research Engineer, Marengo position at twelve-labs?
When was the Staff ML Research Engineer, Marengo job at twelve-labs posted?
You'll be redirected to twelve-labs's official application page on Ashby ATS.