Senior DevOps Engineer
Company: ISHIR
Location: Remote (Remote)
Type: Full-time
Remote: Yes
Posted: 2026-04-01
About this role
Profile Summary:
We are looking for a Devops Engineer with experience in Kubernetes, CI/CD (GitLab CI), and AWS. Skilled in automation, scripting (Bash/Python), and observability tools like Prometheus and OpenTelemetry. Focused on reliable deployments, issue resolution, and maintaining efficient platform operations.
Shift Timings: Complete EST Overlap
Location: Remote
Role and Responsibilities:
- Operate and improve platform tools so product teams can ship reliably triaging tickets, fix build issues, and handling routine service requests (access, secrets, environment setup).
- Maintain and extend self-service workflows (templates, golden paths) by updating docs, examples, and guardrails under guidance from senior engineers.
- Perform day-to-day Kubernetes operations: deploy/update Helm charts, manage namespaces, diagnose rollout issues, and follow runbooks for incident response.
- Support CI/CD pipelines (e.g., GitLab CI): keep pipelines green, add/adjust jobs, implement basic quality gates, and help teams adopt safer deploy strategies (blue/green, canary).
- Monitor and operate the observability stack using Prometheus, Alert manager, and Thanos; maintain alert rules, dashboards, and SLO/SLA indicators; help reduce alert noise and improve signal quality.
- Assist with service instrumentation across the core observability pillars—tracing, logging, and metrics—with hands-on OpenTelemetry usage (collectors/SDKs) and related telemetry tooling.
- Contribute to and improve documentation: runbooks, FAQs, onboarding guides, and standard operating procedures.
- Participate in an on-call rotation as needed with a well-defined escalation path; assist during incidents, post small fixes, and capture learnings in docs.
- Help with cost- and performance-minded housekeeping: right-size workloads, prune unused resources, and automate routine tasks where appropriate.
WE'RE LOOKING FOR SOMEONE WHO HAS:
- 8+ years in a platform/SRE/DevOps ...