AI Researcher – Multilingual Data

featherlessai· Research
Apply Now ↗
🌍 Remote📍 Remote (world)FullTime

About this role

About the Role

We’re looking for an AI Researcher focused on multilingual data to help us build and scale next-generation language models across diverse languages and domains. You’ll own research and execution around data sourcing, curation, evaluation, and training strategies for multilingual and low-resource languages, with a strong emphasis on publishing high-quality research and translating it into production systems.

This role is ideal for someone who enjoys working close to the frontier: balancing papers, prototypes, and real-world impact in a fast-moving startup environment.

What You’ll Do

  • Design and execute research on multilingual datasets, including data collection, filtering, deduplication, and quality measurement

  • Develop strategies for low-resource and long-tail languages (sampling, augmentation, curriculum design)

  • Research and improve cross-lingual transfer, alignment, and robustness in large language models

  • Build and maintain evaluation benchmarks for multilingual performance

  • Collaborate with engineers and researchers on training pipelines and model architecture decisions

  • Publish research at top venues (e.g., ACL, EMNLP, NeurIPS, ICML, ICLR) and contribute to open-source when appropriate

  • Translate research insights into practical improvements in production models

What We’re Looking For

  • Strong background in NLP / ML research, with a focus on multilingual or cross-lingual modeling

  • Publication record at respected conferences or journals (ACL, EMNLP, NeurIPS, ICML, ICLR, etc.)

  • Experience working with large-scale text datasets across multiple languages

  • Solid understanding of:

    • Tokenization and vocabulary design for multilingual models

    • Data quality metrics, filtering, and dataset bias

    • Transfer learning and multilingual representation learning

  • Comfortable prototyping in Python with modern ML frameworks (PyTorch, JAX, etc.)

  • Ability to operate independently and ship research in a startup pace environment

Nice to Have

  • Experience with low-resource languages or non-Latin scripts

  • Open-source contributions in NLP or data tooling

  • Experience training or evaluating large language models

  • Familiarity with multilingual benchmarks (e.g., XTREME, FLORES, TyDi QA)

Why Join Us

  • Real ownership over research direction and impact

  • A team that values papers and production

  • Access to meaningful scale: large datasets, modern infrastructure, and fast iteration

  • Competitive compensation and meaningful equity at an early stage

Frequently Asked Questions

Is the salary disclosed for the AI Researcher – Multilingual Data position at featherlessai?
The salary for this AI Researcher – Multilingual Data role at featherlessai is not publicly listed. Click "Apply Now" to learn more about the compensation package on their official careers page.
Is the AI Researcher – Multilingual Data job at featherlessai remote?
Yes, this AI Researcher – Multilingual Data position at featherlessai is remote, with team members based in Remote (world). You can work from home or anywhere in the supported regions.
Is the AI Researcher – Multilingual Data role at featherlessai full-time or part-time?
This is listed as a FullTime position. It is posted as a AI Researcher – Multilingual Data role in the Research department at featherlessai.
Which team or department does the AI Researcher – Multilingual Data at featherlessai belong to?
This AI Researcher – Multilingual Data position is part of the Research department at featherlessai. See the full job description for more information about the team structure and responsibilities.
How do I apply for the AI Researcher – Multilingual Data position at featherlessai?
Click the "Apply Now" button on this page. You will be redirected to featherlessai's official application portal hosted on ashby where you can submit your application directly.
When was the AI Researcher – Multilingual Data job at featherlessai posted?
This AI Researcher – Multilingual Data position at featherlessai was posted on Jan 23, 2026. Apply as soon as possible — early applications are often reviewed first.
AI Researcher – Multilingual Data
featherlessai
Apply for this role ↗

You'll be redirected to featherlessai's official application page on Ashby ATS.