Principal Data Scientist, Health Informatics
About the role
Principal Data Scientist, Health Informatics
Waymark is a team of healthcare providers, technologists, and builders whose mission is to bring the best healthcare to people with Medicaid benefits. Guided by the communities we serve, we bring support and technology-enabled care to help primary care providers keep Medicaid patients healthy. We are building the tools and designing an approach to enable care to reach the patients who can benefit most.
Our core values embody the essence of what makes Waymark a unique team today, and what we look for, nurture, and sustain as a team. We are bold builders, believing that the greatest challenges in care delivery can be solved when we harness the power of community and technology. We are humble learners, seeking feedback and perspectives different from our own, and welcome challenges to our conclusions. We experiment to improve, actively seeking data to inform decisions and assess our own performance. We act with focused urgency, our commitment to our mission drives us to tirelessly pursue results.
About This Role
Waymark is seeking a Principal Data Scientist to own clinical data as a first-class input to modeling and to bring senior ML/AI and health economics judgment to our core data science products. As Waymark scales across health plan and health system partners, clinical data quality directly determines model accuracy. We need a senior owner accountable for data quality, normalization, and clinical validity across claims, EHR, and ADT.
This role sits at the intersection of clinical data expertise, applied ML/AI, and health economics methods. You will own the clinical data strategy that enables our modeling, defining how EHR and ADT data, across formats including FHIR, HL7v2, and C-CDA, should be structured, normalized, and validated as modeling inputs, with hands-on fluency in how these systems are structured and what the data actually represents clinically. You will build and ship production models that advance our existing machine learning and generative AI products, and operate as a senior technical leader, making architectural trade-offs, aligning data science, engineering, product, and clinical stakeholders, and raising the technical bar of the team.
This is a highly versatile role for someone who is equally fluent in clinical terminologies and production ML, and who can move work from prototype to deployment with rigor and speed.
Responsibilities
- Own clinical data quality across claims, EHR, and ADT: Define standards for how clinical data is structured, normalized, and validated as modeling inputs across payer claims (medical, pharmacy, eligibility), EHR data (Epic, Cerner, Athena), and real-time ADT feeds. Bring deep familiarity with EHR data formats (FHIR, HL7, C-CDA) and how data from systems like Epic, Cerner, and Athena maps to clinical reality. Hold the bar for clinical accuracy and completeness across all three sources.
- Build and ship production ML/AI models: Develop, validate, and deploy risk stratification, care gap prediction, treatment effect estimation, and LLM/foundation model applications — with rigor around leakage, calibration, fairness, and clinical face validity.
- Apply health economics and outcomes methods: Translate raw clinical and claims data into decision-grade evidence through risk adjustment, utilization measurement, cost attribution, quasi-experimental evaluation, and outcomes measurement aligned with CMS, NCQA, and MCO reporting standards.