Senior Software Engineer - Storage
Company: Nvidia
Location: US, CA, Santa Clara (Remote)
Salary: $152k - $241.5k per year
Type: Full-time
Remote: Yes
Posted: 2026-06-12
About this role
NVIDIA is a pioneer in accelerated computing, known for inventing the GPU and driving breakthroughs in gaming, computer graphics, high-performance computing, and artificial intelligence. Our technology powers everything from generative AI to autonomous systems, and we continue to shape the future of computing through innovation and collaboration. Within this mission, our team, Managed AI Research Superclusters (MARS), builds and scales the infrastructure, platforms, and tools that enable researchers and engineers to develop the next generation of AI/ML systems. By joining us, you’ll help design solutions that power some of the world’s most advanced computing workloads.
We are seeking a Software Engineer to join our MARS team at NVIDIA. In this role, you will help design, build, and operate exascale infrastructure that powers AI research and development at unprecedented scale. You will work on distributed systems, large-scale storage and compute orchestration, and end-to-end automation that enable AI researchers to focus on innovation rather than infrastructure. You will collaborate closely with engineers and researchers across NVIDIA to architect reliable, efficient, and secure systems that underpin our Managed AI Research Superclusters — infrastructure capable of training frontier models and executing global-scale workloads.
What You’ll Be Doing:
- Design, develop, and operate distributed systems that manage data, compute, and networking for large-scale AI workloads.
- Build software and automation to orchestrate workloads across thousands of GPUs and petabytes of storage in multi-region clusters.
- Collaborate with AI/ML research teams to understand their requirements and translate them into scalable, high-performance solutions.
- Drive improvements in system reliability, performance, and observability to meet exascale standards.
- Partner with security, networking, and platform teams to ensure that MARS infrastructure meets the highest standards of robustn...