Software Engineer, Cloud Infrastructure (Multiple Seniority Levels)
Company: Beacon AI
Location: San Carlos, CA (Remote)
Salary: $130,000 - $225,000 a year
Type: Full-time
Remote: Yes
Posted: 2026-05-05
About this role
## About Beacon AI
We’re a fast-moving team of aviators, engineers, and operators building an AI platform to make flying safer, more efficient, and more capable. Backed by top investors, we’ve secured a dozen Department of Defense contracts and partnered with major airlines to deliver mission-critical systems. We operate without silos or heavy processes. Small, focused teams own what they build, ship quickly, and learn fast, pushing the boundaries of how humans and AI work together in aviation.
# Role Overview
We are seeking skilled Cloud and ML Infrastructure Engineers to lead the buildout of our AWS foundation and our LLM platform. You will design, implement, and operate services that are scalable, reliable, and secure.
The broad scope means focus areas in LLM/ML Infra and IoT infra are strong bonus points. For ML Infra, build the stack that powers retrieval-augmented generation and application workflows built with frameworks like LangChain. Experience with IoT AWS services is a plus.
You will work closely with other engineers and product management. The ideal candidate is hands-on, comfortable with ambiguity, and excited to build from first principles.
# Key Responsibilities
- **Cloud Infrastructure Setup and Maintenance**
- Design, provision, and maintain AWS infrastructure using IaC tools such as AWS CDK or Terraform.
- Build CI/CD and testing for apps, infra, and ML pipelines using GitHub Actions, CodeBuild, and CodePipeline.
- Operate secure networking with VPCs, PrivateLink, and VPC endpoints. Manage IAM, KMS, Secrets Manager, and audit logging.
- LLM Platform and Runtime
- Stand up and operate model endpoints using AWS Bedrock and/or SageMaker; evaluate when to use ECS/EKS, Lambda, or Batch for inference jobs.
- Build and maintain application services that call LLMs through clean APIs, with streaming, batching, and backoff strategies.
- Implement prompt and tool execution flows with LangChain or similar, including agent ...