Software Engineer, Data Infrastructure & Acquisition - Tacoma, WA, USA
Company: RemoteHunter
Location: Location not specified (Remote)
Type: Full-time
Remote: Yes
Posted: 2026-05-04
About this role
1. About Our Client:
The organization operates in the text-to-speech technology sector, addressing the challenge of making reading accessible and removing barriers to learning. Its products convert various written formats such as PDFs, books, documents, news articles, and websites into audio to help users read faster and retain more information. The platform includes multiple applications across iOS, Android, Mac, Chrome, and web environments. The organization serves over 50 million users globally and has received recognition for inclusivity and design from major technology companies. It functions as a fully distributed team of nearly 200 employees worldwide, composed of engineers, AI researchers, and professionals from prominent tech companies and academic institutions.
2. About the Opportunity:
The Software Engineer, Data Infrastructure & Acquisition role focuses on supporting the data collection processes essential for training AI models. This position is responsible for sourcing, managing, and optimizing large-scale audio datasets to enhance model quality and efficiency. The role plays a key part in advancing the organization’s AI capabilities by collaborating with research and engineering teams to improve data infrastructure and scale data acquisition efforts.
3. Responsibilities:
• Identify and integrate new audio data sources into the ingestion pipeline
• Operate and enhance cloud infrastructure for data ingestion using GCP and Terraform
• Work closely with scientists to improve cost, throughput, and data quality metrics
• Collaborate with the AI team and leadership to develop the dataset roadmap for future product development
4. Requirements:
• BS, MS, or PhD in Computer Science or related field
• 5+ years of software development experience
• Proficiency in bash and Python scripting within Linux environments
• Experience with Docker, Infrastructure-as-Code, and at least one major cloud provider (preferably GCP)
• Familiarity with web crawlers a...