Software Engineer, Data Infrastructure & Acquisition - Charleston, SC, USA
Company: RemoteHunter
Location: Location not specified (Remote)
Type: Full-time
Remote: Yes
Posted: 2026-05-03
About this role
About Our Client
The organization is a leader in the text-to-speech (TTS) technology sector, developing products used by over 50 million people to convert written content—including PDFs, books, and articles—into high-quality audio. With applications across iOS, Android, Mac, Chrome, and Web, they are widely recognized for their commitment to accessibility and inclusive design. The company operates as a fully distributed global team of nearly 200 professionals, including world-class engineers and AI researchers from top tech hubs and academic institutions.
About the Opportunity
The Software Engineer, Data Infrastructure & Acquisition is a pivotal role responsible for managing the data lifecycle that fuels AI model training. You will focus on building high-quality, large-scale datasets by bridging infrastructure, engineering, and research efforts. By optimizing data pipelines and collaborating on the long-term data roadmap, this position directly impacts the advancement of the organization’s AI capabilities for future consumer and enterprise products.
Responsibilities
- Data Acquisition: Identify and acquire new, high-value sources of audio data for ingestion into AI training pipelines.
- Infrastructure Management: Operate and expand cloud-based ingestion infrastructure, primarily utilizing Google Cloud Platform (GCP) and Terraform.
- Performance Optimization: Partner with research scientists to enhance the cost-efficiency, throughput, and quality of data collection.
- Strategic Roadmap: Work alongside AI leadership to shape the dataset strategy that supports the next generation of audio products.
Requirements
- Education: BS, MS, or PhD in Computer Science or a related technical field.
- Experience: Minimum of 5 years of professional software development experience.
- Technical Skills: Proficiency in bash and Python scripting within Linux environments.
- Cloud & DevOps: Hands-on experience with Docker, Infrastructure-as-Code (Terraform), and a major cloud prov...