Signal Engineer

maincodeยท Data
Apply Now โ†—
๐Ÿ“ MelbourneFullTime

About this role

About the role

Matilda is Australia's LLM. What ends up in the corpus is what the model learns, so the quality of the data sets the ceiling on the quality of the model.

We're hiring a Signal Engineer to own that ceiling. You will build the pipelines that turn massive, messy, raw data into the dataset Matilda trains on. The work is part engineering, part editorial judgment, done in code.

A lot of the real gains in frontier models come from the data, and most of that work is underinvested in across the field. It is one of the highest-leverage places you can spend your time as an engineer.


What you'll work on

- Pipelines that ingest, clean, dedupe, filter, and score training data at TB to PB scale

- Quality classifiers and heuristics that separate useful data from the rest

- Dataset mixture design and experiments on what actually improves the model

- Tools to explore, sample, and audit what's in the corpus

- Close work with researchers and training engineers so data choices connect to model behaviour

What we're looking for

- Strong engineer. Python, data tooling, distributed processing, clean pipelines.

- High attention to detail. Small errors compound fast at this scale.

- Taste and judgment about what good training data looks like.

- Comfort working with very large, very messy datasets.

- Curiosity about how data shapes model behaviour.

- High learning velocity. You don't need a PhD or prior LLM experience.

Nice to have

- Experience with web-scale corpora or pretraining data pipelines

- Experience working with unstructured text data

- Familiarity with distributed data frameworks (Spark, Ray, or similar)

- Exposure to deduplication, quality classification, or tokenisation

Note

Full-time role based in Melbourne, working closely with our in-person team. At this time we are not able to offer visa sponsorship, so applicants must have existing and unrestricted work rights in Australia.

Frequently Asked Questions

Is the salary disclosed for the Signal Engineer position at maincode?
The salary for this Signal Engineer role at maincode is not publicly listed. Click "Apply Now" to learn more about the compensation package on their official careers page.
Where is the Signal Engineer position at maincode located?
This Signal Engineer role at maincode is based in Melbourne. The position is listed as on-site or hybrid. Check the full job description or apply directly to confirm the work arrangement.
Is the Signal Engineer role at maincode full-time or part-time?
This is listed as a FullTime position. It is posted as a Signal Engineer role in the Data department at maincode.
Which team or department does the Signal Engineer at maincode belong to?
This Signal Engineer position is part of the Data department at maincode. See the full job description for more information about the team structure and responsibilities.
How do I apply for the Signal Engineer position at maincode?
Click the "Apply Now" button on this page. You will be redirected to maincode's official application portal hosted on ashby where you can submit your application directly.
When was the Signal Engineer job at maincode posted?
This Signal Engineer position at maincode was posted on Apr 23, 2026. Apply as soon as possible โ€” early applications are often reviewed first.
Signal Engineer
maincode
Apply for this role โ†—

You'll be redirected to maincode's official application page on Ashby ATS.