AI Cloud Engineer (Full-Time, Remote US)
Company: TRSA | Association for Linen, Uniform and Facility Services Industry
Location: Manhattan, NY (Remote)
Type: Full-time
Remote: Yes
Posted: 2026-06-03
About this role
Company
: LockedIn AI
Location:
Remote (US-Based)
- Optional Hybrid (New York, NY) Reports To: Co-Founder / CEO Compensation: $140,000 – $195,000 USD per year
**About LockedIn AI
LockedIn AI is a fast-growing AI-native platform trusted by over one million users worldwide. We build real-time AI tools that help candidates succeed in job interviews, coding assessments, and professional meetings.
Our core product delivers live AI assistance during interviews and assessments—helping users communicate clearly, think faster, and perform at their best in high-pressure situations.
We are now scaling our infrastructure to support the next generation of AI-powered real-time systems.
Role Overview
We are looking for a cloud-native, AI-infrastructure-focused AI Cloud Engineer to design and operate the cloud systems that power our machine learning and real-time AI products.
This Role Sits At The Intersection Of Cloud Engineering, DevOps, And AI Systems Architecture. You Will Own The Infrastructure Layer That Supports:
Model training and fine-tuning pipelines
Real-time LLM inference systems
GPU-based distributed compute environments
High-scale production AI services for 1M+ users
You will be responsible for building highly scalable, cost-efficient, and low-latency cloud infrastructure optimized specifically for AI workloads.
Key Responsibilities**
- AI Cloud Architecture
Design cloud-native infrastructure for AI/ML workloads
Build GPU-based compute environments for training and inference
Architect multi-stage environments (training, staging, production)
Optimize AWS / GCP / Azure infrastructure for AI performance and scale
- Model Serving & Inference Systems
Build and maintain low-latency inference pipelines for LLMs and AI services
Deploy model serving frameworks (vLLM, Triton, TensorRT, TGI, etc.)
Optimize throughput, batching, caching, an...