Software Engineer
Company: Dandelion Health Inc
Location: Remote (Remote)
Salary: $135,000 - $150,000 a year
Type: Full-time
Remote: Yes
Posted: 2026-04-05
About this role
Our Team
Dandelion Health was founded in 2020 by experts in health tech, hospital systems, academia, and clinical AI. We are building the world’s largest AI training and clinical development platform. Today, we pride ourselves on our ability to make data access as easy as possible for AI developers, pharma, and medical devices, while raising the bar for patient safety and data quality. Tomorrow, we will be the place where any healthcare organization can go to build a responsible clinical AI product. Our culture is all about learning from data and improving, so we can help our clients improve health through AI. Meet the rest of our team here.
Our Data
We partner with health systems to safely and ethically make their de-identified patient data available to AI developers. Currently, the data is acquired from Sharp HealthCare, Sanford Health, and Texas Health Resources – with two additional U.S. health systems joining soon.
We have clinical data dating back to July 1, 2016. This data represents over 10 million patients and includes but is not limited to:
- **Structured data** (e.g., 100% of the EMR, including some claims)
- **Unstructured text** (e.g., clinical notes, radiology reports)
- **Images** (e.g., DICOM, pathology)
- **Video**
- **Waveforms**
- **Continuous streaming monitoring data**
Your Role
Dandelion is constantly expanding the breadth, depth, and completeness of health system datasets while improving the speed and quality of our de-identification pipeline. As an engineer working on our de-identification pipelines, you will:
- Design and implement software systems that perform these de-identification rules at high scale and throughput (we de-identify billions of rows of data and millions of images each month) while constraining costs.
- Generate and execute quality assurance plans to validate our de-identification processes.
- Run de-identification pipelines in health system cloud environments, and optimize these pipelines to minimize er...