R&D-029 AI Engineer (VLAs)
About this role
â»æ¥æ¬èªçãç¶ããŸãã
About AIRoA
The AI Robot Association (AIRoA) is launching a groundbreaking initiative: collecting one million hours of humanoid robot operation data with hundreds of robots, and leveraging it to train the worldâs most powerful Vision-Language-Action (VLA) models.
What makes AIRoA unique is not only the unprecedented scale of real-world data and humanoid platforms, but also our commitment to making everything open and accessible. We are building a shared ârobot data ecosystemâ where datasets, trained models, and benchmarks are available to everyone. Researchers around the world will be able to evaluate their models on standardized humanoid robots through our open evaluation platform.
Job Description
- Develop Vision-Language models, or equivalent multimodal models, with a view toward applications in the robotics domain
- Fine-tune existing models, conduct evaluations, perform error analysis, and improve performance
- Build training pipelines using real-world data, design evaluation metrics, and operate iterative improvement cycles
- Prepare and preprocess data, and build training environments for image, video, language, and action data
- Research the latest trends in technologies and academic studies, select appropriate technologies, and incorporate findings into model improvements
- Establish training infrastructure, inference infrastructure, and experimental environments for real-world model deployment
- Collaborate with related teams such as software engineers and robotics engineers to define requirements, design validation plans, and drive development
AIRoAã«ã€ããŠ
AI Robot AssociationïŒAIRoAïŒã¯ãç»æçãªåãçµã¿ãéå§ããŸããæ°çŸå°ã®ãã¥ãŒããã€ããããããçšããŠããã¥ãŒããã€ãããããã®æäœããŒã¿ã100äžæéååéãããããæŽ»çšããŠVision-Language-ActionïŒVLAïŒã¢ãã«ãåŠç¿ãããŸãã
ç§ãã¡ã®ç¬èªæ§ã¯ãå®äžçããŒã¿ãšãã¥ãŒããã€ããã©ãããã©ãŒã ã®åäŸã®ãªãèŠæš¡ã ãã§ã¯ãªãããããããã®ããªãŒãã³ã§ã¢ã¯ã»ã¹å¯èœã«ãããšããã³ãããã¡ã³ãã«ããããŸããAIRoAã¯ããŒã¿ã»ãããåŠç¿æžã¿ã¢ãã«ããã³ãããŒã¯ã誰ããå©çšã§ããå ±æã®ãããããã»ããŒã¿ã»ãšã³ã·ã¹ãã ãã®æ§ç¯ãç®æããŠããŸããå®çŸãæåããã°ãäžçäžã®ç ç©¶è ããç§ãã¡ã®ãªãŒãã³è©äŸ¡ãã©ãããã©ãŒã ãéããŠãæšæºåããããã¥ãŒããã€ãããããäžã§èªãã®ã¢ãã«ãè©äŸ¡ã§ããããã«ãªãããšãæåŸ ããŠããŸãã
æ¥åå 容
- ãããã£ã¯ã¹é åãžã®å¿çšãèŠæ®ãã Vision-Languageã¢ãã«ããŸãã¯ããã«æºãããã«ãã¢ãŒãã«ã¢ãã«ã®éçº
- æ¢åã¢ãã«ã® fine-tuningãè©äŸ¡ããšã©ãŒåæãæ§èœæ¹å
- å®ããŒã¿ãçšããåŠç¿ãã€ãã©ã€ã³æ§ç¯ãè©äŸ¡ææšèšèšãæ¹åãµã€ã¯ã«éçš
- ç»åã»åç»ã»èšèªã»è¡åããŒã¿çã察象ãšããããŒã¿æŽåãååŠçãåŠç¿ç°å¢æ§ç¯
- ææ°ã®ç ç©¶ã»æè¡ååã®èª¿æ»ãæè¡éžå®ãããã³ã¢ãã«æ¹åãžã®åæ
- ã¢ãã«ã®å®éçšãèŠæ®ããåŠç¿åºç€ã»æšè«åºç€ã»å®éšç°å¢ã®æŽå
- ãœãããŠã§ã¢ãšã³ãžãã¢ããããã£ã¯ã¹ãšã³ãžãã¢çã®é¢é£ããŒã ãšé£æºããèŠä»¶æŽçãæ€èšŒèšèšãéçºæšé²
â»æ¥æ¬èªçãç¶ããŸãã
Required Qualifications
- Experience leading machine learning models from deployment to improvement and operation in a production service environment
- Experience implementing, training, and evaluating machine learning models using Python and PyTorch
- Hands-on experience fine-tuning Vision-Language models, or equivalent multimodal models
- Experience building training pipelines with real-world data, designing evaluations, conducting error analysis, and operating improvement loops
- Ability to understand the latest research and technology trends and translate them into model improvements and practical product applications
Preferred Qualifications
- Experience developing Vision-Language-Action (VLA) models or multimodal models for robotics
- Experience with robot control, ROS / ROS 2, C++, and real-world hardware evaluation
- Knowledge of or experience in sensor integration, actuator control, action generation, and low-level control
- Familiarity with training and evaluation using simulators, Sim2Real, and domain adaptation
- Experience building training and inference infrastructure in cloud environments such as AWS or GCP
- Experience with reproducible and operationally robust development practices such as Docker, CI/CD, and MLOps
å¿ é èŠä»¶
- å®ãµãŒãã¹ç°å¢ã«ãããŠãæ©æ¢°åŠç¿ã¢ãã«ã®å°å ¥ããæ¹åã»éçšãŸã§æºãã£ãçµéš
- Python / PyTorch ãçšãã MLã¢ãã«ã®å®è£ ã»åŠç¿ã»è©äŸ¡ ã®çµéš
- Vision-Languageã¢ãã«ããŸãã¯ããã«æºãããã«ãã¢ãŒãã«ã¢ãã« ã® fine-tuning ã®å®åçµéš
- å®ããŒã¿ãçšããåŠç¿ãã€ãã©ã€ã³æ§ç¯ãè©äŸ¡èšèšããšã©ãŒåæãæ¹åã«ãŒãéçšã®çµéš
- ææ°ã®ç ç©¶ã»æè¡ååãçè§£ããã¢ãã«æ¹åãå®ãããã¯ããžã®å¿çšã«èœãšã蟌ããèœå
æè¿èŠä»¶
- Vision-Language-ActionïŒVLAïŒã¢ãã«ããŸãã¯ãããã£ã¯ã¹åããã«ãã¢ãŒãã«ã¢ãã«ã®éçºçµéš
- ããããå¶åŸ¡ãROS / ROS 2ãC++ã宿©è©äŸ¡ã®çµéš
- ã»ã³ãµçµ±åãã¢ã¯ãã¥ãšãŒã¿å¶åŸ¡ãè¡åçæãäœã¬ãã«å¶åŸ¡ã«é¢ããç¥èãŸãã¯çµéš
- ã·ãã¥ã¬ãŒã¿ãçšããåŠç¿ã»è©äŸ¡ãSim2RealãDomain Adaptationãªã©ã«é¢ããç¥èŠ
- AWS / GCP çã®ã¯ã©ãŠãç°å¢ãçšããåŠç¿ã»æšè«åºç€ã®æ§ç¯çµéš
- DockerãCI/CDãMLOps ãªã©åçŸæ§ã»éçšæ§ãæèããéçºçµéš
âWork location
Tokyo Ryutsu Center A Bldg. AW4-5, 6-1-1 Heiwajima, Ota-ku, Tokyo 143-0006, Japan
Frequently Asked Questions
Is the salary disclosed for the R&D-029 AI Engineer (VLAs) position at AI Robot Association?
Where is the R&D-029 AI Engineer (VLAs) position at AI Robot Association located?
Is the R&D-029 AI Engineer (VLAs) role at AI Robot Association full-time or part-time?
Which team or department does the R&D-029 AI Engineer (VLAs) at AI Robot Association belong to?
How do I apply for the R&D-029 AI Engineer (VLAs) position at AI Robot Association?
When was the R&D-029 AI Engineer (VLAs) job at AI Robot Association posted?
You'll be redirected to AI Robot Association's official application page on workable.