Member of Technical Staff - Large Scale Data Infrastructure

blackforestlabs· Engineering
Apply Now ↗

About this role

About Black Forest Labs

We’re the team behind Latent Diffusion, Stable Diffusion, and FLUX—foundational technologies that changed how the world creates images and video. We’re creating the generative models that power how people make images and video—tools used by millions of creators, developers, and businesses worldwide. Our FLUX models are among the most advanced in the world, and we’re just getting started.

Headquartered in Freiburg, Germany with a growing presence in San Francisco, we’re scaling fast while staying true to what makes us different: research excellence, open science, and building technology that expands human creativity.

Why This Role

We're looking for infrastructure engineers who want to work at peta-to-exabyte scale. You'll build the data systems behind the largest training runs on thousands of GPUs, where fixing one bottleneck lets researchers train the next breakthrough model.

What You’ll Work On

  • Scalable data loaders for training runs across thousands of GPUs
  • Efficient storage and retrieval systems for petabyte-scale datasets
  • Multi-cloud object storage abstraction
  • Execute large-scale data migrations across storage systems and providers
  • Debug and resolve performance bottlenecks in distributed data loading

Technical Focus

  • Python, PyTorch DataLoader internals
  • Object storage (e.g. S3, Azure Blob, GCS)
  • Parquet for metadata
  • Video: ffmpeg, PyAV, codec fundamentals

What We’re Looking For

  • Built and operated data pipelines at petabyte scale
  • Optimized data loading
  • Worked with petabyte-scale video and image datasets
  • Written processing jobs operating on millions of files
  • Debugged distributed system bottlenecks across large fleets of machines

Nice to have:

  • Experience streaming dataset formats (e.g. WebDataset)
  • Video codec internals and frame-accurate seeking
  • Distributed systems experience
  • Slurm and Kubernetes for job orchestration
  • Experience with object storage performance tuning across providers

How We Work Together

We’re a distributed team with real offices that people actually use. Depending on your role, you’ll either join us in Freiburg or SF at least 2 days a week (or one full week every other week), or work remotely with a monthly in-person week to stay connected. We’ll cover reasonable travel costs to make this possible. We think in-person time matters, and we’ve structured things to make it accessible to all. We’ll discuss what this will look like for the role during our interview process.

Everything we do is grounded in four values:

  • Obsessed. We are a frontier research lab. The science has to be right, the understanding deep, the product beautiful.
  • Low Ego. The work speaks. The best idea wins, no matter who said it. Credit is shared. Nobody is above any task.
  • Bold. We take the ambitious bet. We ship, we do not wait for conditions to be perfect.
  • Kind. People over politics. We treat each other with genuine warmth. Agency without empathy creates chaos.

If this sounds like work you’d enjoy, we’d love to hear from you.

Base Annual Salary:

EU €100,000 - €320,000 + Equity

US $150,000 - $400,000 + Equity

Frequently Asked Questions

Is the salary disclosed for the Member of Technical Staff - Large Scale Data Infrastructure position at blackforestlabs?
The salary for this Member of Technical Staff - Large Scale Data Infrastructure role at blackforestlabs is not publicly listed. Click "Apply Now" to learn more about the compensation package on their official careers page.
Where is the Member of Technical Staff - Large Scale Data Infrastructure position at blackforestlabs located?
This Member of Technical Staff - Large Scale Data Infrastructure role at blackforestlabs is based in Freiburg (Germany), San Francisco (USA). The position is listed as on-site or hybrid. Check the full job description or apply directly to confirm the work arrangement.
Which team or department does the Member of Technical Staff - Large Scale Data Infrastructure at blackforestlabs belong to?
This Member of Technical Staff - Large Scale Data Infrastructure position is part of the Engineering department at blackforestlabs. See the full job description for more information about the team structure and responsibilities.
How do I apply for the Member of Technical Staff - Large Scale Data Infrastructure position at blackforestlabs?
Click the "Apply Now" button on this page. You will be redirected to blackforestlabs's official application portal hosted on greenhouse where you can submit your application directly.
When was the Member of Technical Staff - Large Scale Data Infrastructure job at blackforestlabs posted?
This Member of Technical Staff - Large Scale Data Infrastructure position at blackforestlabs was posted on Dec 4, 2025. Apply as soon as possible — early applications are often reviewed first.
Member of Technical Staff - Large Scale Data Infrastructure
blackforestlabs
Apply for this role ↗

You'll be redirected to blackforestlabs's official application page on Greenhouse.