Cloud Operations Engineer - Infrastructure

qZcsEs6SLAUmDv4AbsPYFC· Cloud & Backend
Apply Now ↗
📍 Irvine, California, United StatesFull time

About this role

ABOUT US:

Headquartered in the United States, TP-Link Systems Inc. is a global provider of reliable networking devices and smart home products, consistently ranked as the world’s top provider of Wi-Fi devices. The company is committed to delivering innovative products that enhance people’s lives through faster, more reliable connectivity. With a commitment to excellence, TP-Link serves customers in over 170 countries and continues to grow its global footprint.

We believe technology changes the world for the better! At TP-Link Systems Inc, we are committed to crafting dependable, high-performance products to connect users worldwide with the wonders of technology. 

Embracing professionalism, innovation, excellence, and simplicity, we aim to assist our clients in achieving remarkable global performance and enable consumers to enjoy a seamless, effortless lifestyle. 

KEY RESPONSIBILITIES

  • Design, build, and maintain reliable, scalable, and secure cloud-native infrastructure platforms supporting large-scale production workloads.
  • Operate and optimize multi-account AWS environments, ensuring infrastructure is secure, repeatable, and auditable through Infrastructure as Code tools such as Terraform.
  • Manage production Kubernetes clusters, including provisioning, upgrades, autoscaling, networking, observability, capacity planning, and day-to-day operations.
  • Build and operate Kubernetes ecosystem components such as CRDs, Helm, HPA, Cluster Autoscaler, CoreDNS, and Cluster API.
  • Operate and improve GitOps-based deployment workflows using tools such as FluxCD or ArgoCD.
  • Manage and enhance Istio service mesh capabilities, including traffic routing, service discovery, resilience, security, and service-to-service communication.
  • Define and improve reliability practices, including SLOs, Error Budgets, monitoring, alerting, incident response, and post-mortems.
  • Participate in a scheduled on-call rotation to support production cloud infrastructure and Kubernetes platforms.
  • Troubleshoot complex production issues across cloud infrastructure, Kubernetes, Linux systems, networking, and distributed services.
  • Drive automation for infrastructure provisioning, configuration management, CI/CD pipelines, observability, and operational workflows using Terraform, Go, Python, or similar technologies.
  • Collaborate with application engineering, architecture, security, and platform teams to improve infrastructure reliability, scalability, and operational efficiency.

REQUIRED QUALIFICATIONS

  • Bachelor’s degree or above in Computer Science, Software Engineering, Information Technology, or a related field.
  • 2+ years of hands-on experience in cloud infrastructure, Kubernetes operations, platform engineering, SRE, or related areas.
  • Strong knowledge of AWS services, including EKS, IAM, VPC, EC2, S3, and related networking and security capabilities.
  • Hands-on experience operating Kubernetes in production environments, including cluster architecture, workload orchestration, networking, autoscaling, and troubleshooting.
  • Familiarity with Kubernetes ecosystem tools such as CRDs, Helm, Cluster API, HPA, Cluster Autoscaler, and CoreDNS.
  • Experience with GitOps tools such as FluxCD or ArgoCD.
  • Solid Linux administration and troubleshooting skills, including systemd, networking, and performance analysis.
  • Experience with CI/CD pipelines and infrastructure automation using Terraform, Go, Python, or similar tools.
  • Good understanding of reliability engineering practices, including SLOs, incident response, monitoring, alerting, and post-mortems.
  • Strong problem-solving skills and ability to diagnose and resolve complex infrastructure issues in distributed systems.
  • Good communication skills and ability to collaborate effectively with cross-functional engineering teams.
  • Willingness to participate in a scheduled on-call rotation.

PREFERRED QUALIFICATIONS

  • Experience with NVIDIA device plugins, GPU scheduling, or GPU workload operations in Kubernetes environments.
  • Experience with additional public cloud platforms such as Azure or Alibaba Cloud.
  • Kubernetes certifications such as CKA, CKAD, or CKS are a plus.

Salary range: TBD

  • Free snacks and drinks
  • Fully paid medical, dental, and vision insurance (partial coverage for dependents)
  • Contributions to 401k funds
  • Bi-annual reviews, and annual pay increases
  • Health and wellness benefits, including free gym membership
  • Quarterly team-building events

At TP-Link Systems Inc., we are continually searching for ambitious individuals who are passionate about their work. We believe that diversity fuels innovation, collaboration, and drives our entrepreneurial spirit. As a global company, we highly value diverse perspectives and are committed to cultivating an environment where all voices are heard, respected, and valued. We are dedicated to providing equal employment opportunities to all employees and applicants, and we prohibit discrimination and harassment of any kind based on race, color, religion, age, sex, national origin, disability status, genetics, protected veteran status, sexual orientation, gender identity or expression, or any other characteristic protected by federal, state, or local laws. Beyond compliance, we strive to create a supportive and growth-oriented workplace for everyone. If you share our passion and connection to this mission, we welcome you to apply and join us in building a vibrant and inclusive team at TP-Link Systems Inc.

Please, no third-party agency inquiries, and we are unable to offer visa sponsorships at this time.

Frequently Asked Questions

Is the salary disclosed for the Cloud Operations Engineer - Infrastructure position at qZcsEs6SLAUmDv4AbsPYFC?
The salary for this Cloud Operations Engineer - Infrastructure role at qZcsEs6SLAUmDv4AbsPYFC is not publicly listed. Click "Apply Now" to learn more about the compensation package on their official careers page.
Where is the Cloud Operations Engineer - Infrastructure position at qZcsEs6SLAUmDv4AbsPYFC located?
This Cloud Operations Engineer - Infrastructure role at qZcsEs6SLAUmDv4AbsPYFC is based in Irvine, California, United States. The position is listed as on-site or hybrid. Check the full job description or apply directly to confirm the work arrangement.
Is the Cloud Operations Engineer - Infrastructure role at qZcsEs6SLAUmDv4AbsPYFC full-time or part-time?
This is listed as a Full time position. It is posted as a Cloud Operations Engineer - Infrastructure role in the Cloud & Backend department at qZcsEs6SLAUmDv4AbsPYFC.
Which team or department does the Cloud Operations Engineer - Infrastructure at qZcsEs6SLAUmDv4AbsPYFC belong to?
This Cloud Operations Engineer - Infrastructure position is part of the Cloud & Backend department at qZcsEs6SLAUmDv4AbsPYFC. See the full job description for more information about the team structure and responsibilities.
How do I apply for the Cloud Operations Engineer - Infrastructure position at qZcsEs6SLAUmDv4AbsPYFC?
Click the "Apply Now" button on this page. You will be redirected to qZcsEs6SLAUmDv4AbsPYFC's official application portal hosted on workable where you can submit your application directly.
When was the Cloud Operations Engineer - Infrastructure job at qZcsEs6SLAUmDv4AbsPYFC posted?
This Cloud Operations Engineer - Infrastructure position at qZcsEs6SLAUmDv4AbsPYFC was posted on Jun 4, 2026. Apply as soon as possible — early applications are often reviewed first.
Cloud Operations Engineer - Infrastructure
qZcsEs6SLAUmDv4AbsPYFC
Apply for this role ↗

You'll be redirected to qZcsEs6SLAUmDv4AbsPYFC's official application page on workable.