Lead Systems Engineer
About this role
Together we make breakthroughs possible.Â
At OCLC, we build technology with a purpose: to connect libraries and make knowledge accessible worldwide, because we believe that what is known must be shared. Our teams work with complex global datasets, AI and machine learning, hybrid cloud solutions, and other technologies that connect people and organizations to the information they need. We value the power of unique perspectives and experiences to unlock innovation. At OCLC, your ideas matter, whether you have two years of experience or 20. Youâll learn, create, and problem-solve with technologists, product developers, librarians, researchers, marketing pros, and support teams around the world.Â
Why join OCLC?Â
OCLC is consistently recognized as a best place to work by several independent programs. We recognize and reward people and results with a comprehensive Total Rewards package. This means competitive compensation that reflects your unique contributionsâperformance, experience, and skillsâalong with exceptional benefits, including best-in-class health coverage, retirement plans with generous company contributions, and a commitment to your overall well-being.
We know the best ideas donât always happen at a desk. Take a walking meeting around our 100-acre campus or enjoy lunch on the patio. Weâre committed to your successâboth personally and professionally. Hybrid work environment: For many roles, three days a week on-site, with occasional additional days based on business needs.Â
Free use of our on-site ïŹtness center, gym sports, group exercise classes, and game roomÂ
Onsite catering and cafeteria subsidized by OCLCÂ
Health and wellness eventsÂ
Work environments with individual and team spaces and the latest technology toolsÂ
Paid parental leave and adoption assistanceÂ
Tuition reimbursement and Public Service Loan Forgiveness eligibilityÂ
Company-subsidized pricing on local tickets and membershipsÂ
Join us in transforming how people everywhere access information and be part of a mission-driven team that makes a global impact.Â
The job details are as follows:
Weâre hiring a Lead DevOps Engineer to raise the standard of how we build, test, deploy, and operate software. This is a hands-on role with strong technical ownership and a developer enablement mindset: youâll reduce deployment friction, improve environment reliability and quality, strengthen observability, and lead incident resolution through to completion.Youâll lead through standards and influenceâbuilding reusable automation, mentoring others, and driving improvements across teams (including coordination with EMEA DevOps counterparts).
What youâll do
- Lead automation initiatives that eliminate repetitive tasks and reduce operational toil.
- Build and maintain Ansible automation to provision new environments and keep existing environments up to date.
- Propose and lead platform improvement projects using tools such as Ansible, Rundeck, and CI/CD systems.
- Design and improve CI/CD pipelines and deployment automation with safe rollout/rollback strategies and clear environment promotion.
- Enable developers through reusable âpaved roadâ tooling: templates, golden pipelines, self-service workflows, and guardrails that reduce manual work and tribal knowledge.
- Partner with engineering teams to improve delivery quality through:
- automated integration and regression testing,
- deployment validation and smoke testing,
- reliable and repeatable test/pre-production environments,
- quality gates that catch issues earlier.
- Improve observability across services and infrastructure (monitoring, logging, alerting, tracing), including visibility into deployment outcomes and failures.
- Lead analysis and resolution of production incidents across infrastructure, application, database, and network layers; drive RCAs and prevention work.
- Oversee platform patching and upgrades; plan, schedule, and monitor maintenance tasks.
- Coordinate and implement server/platform changes required by customers and internal teams.
- Document systems and processes, transfer knowledge, and mentor engineers to raise technical standards across the organization.
- Communicate proactively with stakeholders, manage multiple requests, and prioritize work effectively.
What success looks like
- Manual operational work is automated or removed; fewer repetitive tasks and fewer âonly one person knowsâ processes.
- Faster, safer releases with stronger validation, clearer rollback paths, and improved release confidence.
- More reliable environments and improved readiness for new customer onboarding.
- Better visibility into platform health and incidents: higher signal, less alert noise, faster diagnosis and recovery.
- Clear standards and reusable tooling adopted across teams; improved developer experience and reduced deployment friction.
Required qualifications
- Bachelorâs degree in Computer Science (or equivalent)Â or equivalent professional experience.
- 6+ years of RedHat Linux server administration experience, including production troubleshooting and log triage.
- 6+ years of extensive Ansible scripting and automation experience (or equivalent configuration management).
- 6+ years of scripting experience in Bash and Python.
- Experience building and operating CI/CD pipelines and deployment automation.
- Strong troubleshooting skills across distributed systems (infrastructure, application, database, and network layers).
- Strong working knowledge of MySQL (query language required); Postgres experience is a plus.
- Excellent communication skills and the ability to lead through planning, prioritization, and influence.
- Proven ability to context switch, manage multiple stakeholder requests, and deliver reliably under deadlines.
Preferred qualifications
- Container infrastructure design and implementation; experience with Docker; Kubernetes/Helm a plus.
- Experience with Rundeck, Jenkins, SOLR, and/or ETCD.
- Experience with monitoring/logging/alerting and modern observability practices (SLOs/SLIs, change correlation, incident reduction).
- Networking fundamentals (DNS, routing, connectivity troubleshooting).
- Familiarity with standard change management practices (e.g., ITIL).
- General programming knowledge/structure; Java familiarity is a plus.
- Experience with progressive delivery (canary/blue-green), feature flags, IaC (Terraform/CloudFormation), and secrets management (Vault or equivalent).
- Interest in extending observability to automated workflows and AI/agent activity (execution tracing, failures, permissions, cost visibility).
Working style
This role requires strong ownership, organization, attention to detail, proactive stakeholder communication, and a bias toward automation and repeatability. Youâll be expected to take ambiguous, high-impact problems through to resolution and leave the platform better than you found it.
The Wise Lead System Engineer functions as an embedded subject matter expert and technical project leader working from within the OCLC Wise development team. OCLC practices a hybrid work location model allowing at least 3 days a week in the office and 2 days remote.
Frequently Asked Questions
Is the salary disclosed for the Lead Systems Engineer position at oclc?
Where is the Lead Systems Engineer position at oclc located?
Is the Lead Systems Engineer role at oclc full-time or part-time?
Which team or department does the Lead Systems Engineer at oclc belong to?
How do I apply for the Lead Systems Engineer position at oclc?
You'll be redirected to oclc's official application page on Workday.