Network Reliability Engineer
margo-group· Polska Team
About this role
#HPC #AI #GPU #CLUSTERS
YOUR DAILY ROUTINE
- Build a large AI infrastructure with monitoring, diagnosis, and remediation of production incidents- Troubleshoot high-impact production issues in collaboration with other engineering teams
- Participate in an on-call rotation to handle incidents and ensure service continuity
- Implement and maintain observability solutions to monitor AI infrastructure and application health
- Contribute to AI infrastructure lifecycle management across different environments and countries
- Promote and apply best practices in terms of stability, resiliency, scalability, and security
- Maintain clear technical documentation for tools and procedures
- Contribute to system and tool evolution based on production feedback
- Collaborate closely with development teams to ensure infrastructure readiness- Participate in team rituals and knowledge-sharing initiatives
ABOUT YOU
🎯 SOFTSKILLS :
- Proactive and solution-oriented mindset
- Passion for automation and continuous improvement
- Strong collaboration and communication skills
- Ability to work independently and in a team
- Willingness to mentor and share knowledge
💻 HARDSKILLS :
- Experience with Go or Python
- Strong scripting skills (Bash, Python)
- Hands-on experience with Linux systems (Ubuntu/Debian)
- Preferred hands-on experience with GPU & HPC infrastructure
- Knowledge of networking (VLAN/LAN, TCP/IP, DNS, BGP, load-balancing, IPv6, etc.)
- Familiarity with monitoring and logging tools (Prometheus, Grafana, Elastic, etc.)
- Comfortable with Infrastructure-as-Code (Ansible, Salt, AWX, etc.)
- Experience managing relational databases (MariaDB)
- Understanding of CI/CD pipelines (GitLab)
- Comfortable with English (written and spoken)
Frequently Asked Questions
What is the salary for the Network Reliability Engineer role at margo-group?
The listed salary for this Network Reliability Engineer position at margo-group is PLN 200–250. This is an Permanent contract & B2B role.
Where is the Network Reliability Engineer position at margo-group located?
This Network Reliability Engineer role at margo-group is based in Warsaw. The position is listed as on-site or hybrid. Check the full job description or apply directly to confirm the work arrangement.
Is the Network Reliability Engineer role at margo-group full-time or part-time?
This is listed as a Permanent contract & B2B position. It is posted as a Network Reliability Engineer role in the Polska Team department at margo-group.
Which team or department does the Network Reliability Engineer at margo-group belong to?
This Network Reliability Engineer position is part of the Polska Team department at margo-group. See the full job description for more information about the team structure and responsibilities.
How do I apply for the Network Reliability Engineer position at margo-group?
Click the "Apply Now" button on this page. You will be redirected to margo-group's official application portal hosted on lever where you can submit your application directly.
When was the Network Reliability Engineer job at margo-group posted?
This Network Reliability Engineer position at margo-group was posted on Jun 9, 2026. Apply as soon as possible — early applications are often reviewed first.
Network Reliability Engineer
margo-group · 💰 PLN 200–250
You'll be redirected to margo-group's official application page on Lever.