Software Engineer, Reliability & Platform
Company: Recruits Lab
Location: New York, NY
Salary: $170,000 - $220,000 a year
Type: Full-time
Posted: 2026-03-17
About this role
Location: New York, NY | Salary: $170,000–$220,000 | Equity: Competitive | Full-Time
About the Role
We are partnering with a fast-growing AI-powered fintech startup to hire a Software Engineer, Reliability & Platform. This is a high-impact role for an engineer who thrives in lean teams, has experience with scale, and wants ownership of a platform’s technical foundation.
You’ll be responsible for infrastructure, scalability, reliability, observability, and code quality across the full stack. You’ll investigate complex production issues, optimize backend services, improve iOS app performance, and strengthen cloud infrastructure. This role reports to the Head of Engineering and partners closely with leadership on platform-level improvements.
Tech Stack
- Frontend: Swift + SwiftUI
- Backend: Node.js (Express) + TypeScript
- Database: MongoDB
- Cloud & Infrastructure: AWS (Elastic Beanstalk, EC2, CloudWatch, CodeDeploy)
Key Responsibilities
- Lead deep investigations into production issues across iOS, backend services, and cloud infrastructure
- Perform root-cause analysis across APIs, async workflows, data models, third-party integrations, and infrastructure
- Optimize backend services for reliability, performance, and correctness
- Improve MongoDB queries, indexes, and data access patterns
- Strengthen AWS infrastructure, deployments, and monitoring
- Enhance observability using structured logging, metrics, and CloudWatch
- Reduce recurring bugs via improved validation, error handling, and system boundaries
- Improve CI/CD and release processes to reduce production risk
- Build internal tools to boost debugging efficiency and engineering velocity
- Partner with product and engineering to drive platform-level improvements
Must-Have Qualifications
- 3–7+ years of engineering experience with exposure to scalable systems
- Strong debugging skills in live production environments
- Deep understanding of APIs, async flows, an...