Senior Cloud Operations Engineer – Plex
Company: Rockwell Automation
Location: Madison, WI (Remote)
Type: Full-time
Level: Senior
Remote: Yes
Posted: 2026-03-05
About this role
Rockwell Automation is a global technology leader focused on helping the world’s manufacturers be more productive, sustainable, and agile. With more than 28,000 employees who make the world better every day, we know we have something special. Behind our customers - amazing companies that help feed the world, provide life-saving medicine on a global scale, and focus on clean water and green mobility - our people are energized problem solvers that take pride in how the work we do changes the world for the better.
We welcome all makers, forward thinkers, and problem solvers who are looking for a place to do their best work. And if that’s you we would love to have you join us!
Job Description
Position Summary:
We are looking for a Senior Cloud Operations Engineer with a focus on Kubernetes and Automation to join our Plex Cloud Operations team. You will support the application tier both in our private and public cloud data centers. You will maintain and assist scaling our Kubernetes-based platform to ensure high availability, security, and performance. You will work closely with platform, development, security, and infrastructure teams to automate operations and improve multi-cluster management. You will also participate in an on-call rotation to support critical operations. You will report to the Cloud Operations Manager.
Your Responsibilities
- Maintain and improve our Kubernetes platform, ensuring high availability and scalability.
- Implement infrastructure/configuration as code to automate operations. (Terraform, Ansible, Helm, Flux, Kustomize)
- Enhance observability and logging using OpenTelemetry and Elastic.
- Building automated solutions that enable resiliency and self-healing of applications.
- Managing Server Operating Systems (Windows and Linux).
- Managing Web Servers (IIS 10).
- Troubleshoot production incidents, perform root cause analysis, and drive reliability improvements.
- Evaluate and implement cloud-native technolog...