AI Cloud Engineer (Full-Time, Remote US)

Company: TRSA | Association for Linen, Uniform and Facility Services Industry

Location: Manhattan, NY (Remote)

Type: Full-time

Remote: Yes

Posted: 2026-06-03

About this role

Company
: LockedIn AI


Location:
Remote (US-Based)

  • Optional Hybrid (New York, NY) Reports To: Co-Founder / CEO Compensation: $140,000 – $195,000 USD per year

**About LockedIn AI
LockedIn AI is a fast-growing AI-native platform trusted by over one million users worldwide. We build real-time AI tools that help candidates succeed in job interviews, coding assessments, and professional meetings.


Our core product delivers live AI assistance during interviews and assessments—helping users communicate clearly, think faster, and perform at their best in high-pressure situations.


We are now scaling our infrastructure to support the next generation of AI-powered real-time systems.


Role Overview
We are looking for a cloud-native, AI-infrastructure-focused AI Cloud Engineer to design and operate the cloud systems that power our machine learning and real-time AI products.


This Role Sits At The Intersection Of Cloud Engineering, DevOps, And AI Systems Architecture. You Will Own The Infrastructure Layer That Supports:


Model training and fine-tuning pipelines


Real-time LLM inference systems


GPU-based distributed compute environments


High-scale production AI services for 1M+ users


You will be responsible for building highly scalable, cost-efficient, and low-latency cloud infrastructure optimized specifically for AI workloads.


Key Responsibilities**

  • AI Cloud Architecture

Design cloud-native infrastructure for AI/ML workloads


Build GPU-based compute environments for training and inference


Architect multi-stage environments (training, staging, production)


Optimize AWS / GCP / Azure infrastructure for AI performance and scale

  • Model Serving & Inference Systems

Build and maintain low-latency inference pipelines for LLMs and AI services


Deploy model serving frameworks (vLLM, Triton, TensorRT, TGI, etc.)


Optimize throughput, batching, caching, an...

Create Your Job Alert

Other AI Jobs

Other Jobs in Manhattan