Senior DevOps Engineer (Cloud & ML Infrastructure)
Company: Kpler
Location: Location not specified (Remote)
Type: Full-time
Remote: Yes
Posted: 2026-05-05
About this role
At Kpler, we are dedicated to helping our clients navigate complex markets with ease. By simplifying global trade information and providing valuable insights, we empower organisations to make informed decisions in commodities, energy, and maritime sectors.
Since our founding in 2014, we have focused on delivering top-tier intelligence through user-friendly platforms. Our team of over 700 experts from 35+ countries works tirelessly to transform intricate data into actionable strategies, ensuring our clients stay ahead in a dynamic market landscape. Join us to leverage cutting-edge innovation for impactful results and experience unparalleled support on your journey to success.
Your future position
As a Senior Platform Engineer you will join the Cloud Platform team to design, operate, and evolve Kpler’s cloud-native infrastructure supporting backend, data, and ML workloads. You will operate within the existing platform engineering framework and contributes to overall reliability, scalability, and cost efficiency of the platform. In addition, you will bring hands-on experience running ML/AI and GPU-based workloads in production, helping the team standardize and strengthen this scope as it grows. This is a senior+ individual contributor role combining operational excellence, architectural input, and hands-on execution in a 24/7 production environment.
Key Responsibilities
- Design, operate, and improve Kpler’s cloud-native infrastructure (Kubernetes, networking, compute, storage).
- Contribute to Infrastructure as Code, CI/CD pipelines, and platform automation.
- Ensure high availability, reliability, and security of production systems.
- Improve observability, monitoring, alerting, and incident response processes.
- Reduce MTTR and failure rates through structured reliability improvements.
- Optimize infrastructure cost and performance, including compute-intensive workloads.
- Support and help standardize ML/GPU-based workloads within the exis...