CareerRiver

Principal Data Scientist, Health Informatics

Waymark · Remote

📍 US - Remote💰 $160,000 - $229,000via greenhousePosted 2026-06-08
Apply on company site ↗
CareerRiver pulls this listing straight from the employer's hiring system — no recruiter middleman, no reposts. Applying takes you directly to Waymark.
Principal Data Scientist, Health Informatics Waymark is a team of healthcare providers, technologists, and builders whose mission is to bring the best healthcare to people with Medicaid benefits. Guided by the communities we serve, we bring support and technology-enabled care to help primary care providers keep Medicaid patients healthy. We are building the tools and designing an approach to enable care to reach the patients who can benefit most. Our core values embody the essence of what makes Waymark a unique team today, and what we look for, nurture, and sustain as a team. We are bold builders, believing that the greatest challenges in care delivery can be solved when we harness the power of community and technology. We are humble learners, seeking feedback and perspectives different from our own, and welcome challenges to our conclusions. We experiment to improve, actively seeking data to inform decisions and assess our own performance. We act with focused urgency, our commitment to our mission drives us to tirelessly pursue results. About This Role Waymark is seeking a Principal Data Scientist to own clinical data as a first-class input to modeling and to bring senior ML/AI and health economics judgment to our core data science products. As Waymark scales across health plan and health system partners, clinical data quality directly determines model accuracy. We need a senior owner accountable for data quality, normalization, and clinical validity across claims, EHR, and ADT. This role sits at the intersection of clinical data expertise, applied ML/AI, and health economics methods. You will own the clinical data strategy that enables our modeling, defining how EHR and ADT data, across formats including FHIR, HL7v2, and C-CDA, should be structured, normalized, and validated as modeling inputs, with hands-on fluency in how these systems are structured and what the data actually represents clinically. You will build and ship production models that advance our existing machine learning and generative AI products, and operate as a senior technical leader, making architectural trade-offs, aligning data science, engineering, product, and clinical stakeholders, and raising the technical bar of the team. This is a highly versatile role for someone who is equally fluent in clinical terminologies and production ML, and who can move work from prototype to deployment with rigor and speed. Responsibilities Own clinical data quality across claims, EHR, and ADT: Define standards for how clinical data is structured, normalized, and validated as modeling inputs across payer claims (medical, pharmacy, eligibility), EHR data (Epic, Cerner, Athena), and real-time ADT feeds. Bring deep familiarity with EHR data formats (FHIR, HL7, C-CDA) and how data from systems like Epic, Cerner, and Athena maps to clinical reality. Hold the bar for clinical accuracy and completeness across all three sources. Build and ship production ML/AI models: Develop, validate, and deploy risk stratification, care gap prediction, treatment effect estimation, and LLM/foundation model applications — with rigor around leakage, calibration, fairness, and clinical face validity. Apply health economics and outcomes methods: Translate raw clinical and claims data into decision-grade evidence through risk adjustment, utilization measurement, cost attribution, quasi-experimental evaluation, and outcomes measurement aligned with CMS, NCQA, and MCO reporting standards. Advance machine and AI products: Bring senior modeling judgment to the product roadmap, owning the clinical and methodological soundness of what ships. Set standards and mentor: Make architectural trade-offs, drive alignment across data science, engineering, product, and clinical stakeholders, and mentor junior data scientists to raise the technical bar of the team. Minimum Qualifications Healthcare Data Expertise: Deep, hands-on fluency with claims, EHR, and ADT data, and strong command of clinical terminologies (ICD-10, SNOMED CT, LOINC, RxNorm, CPT/HCPCS) and value set curation. Standards Fluency: Working experience with healthcare data standards and exchange formats — FHIR, HL7v2, and C-CDA. Education: Master's degree in Data Science, Biostatistics, Health Informatics, Computer Science, or a related field. Python Proficiency: 7-8+ years of hands-on experience in Python, including data science and ML libraries. Applied ML/AI Experience: Demonstrated ability to build, validate, and deploy production ML models on healthcare data, with end-to-end ownership from development through deployment and maintenance in a live environment. Experience with ML pipelines, model versioning, and reproducible workflows at scale. Project Ownership: Proven ability to manage complex technical projects independently, align multiple stakeholders, and deliver on timelines. Preferred Qualifications PhD in health informatics, statistics, data science, or computer science Experience integrating EHR/HIE data via TEFCA, CommonWell, or comparable networks. Health Economics & Outcomes Methods: Experience with risk adjustment, utilization and cost measurement, and quasi-experimental evaluation. Familiarity with MLOps best practices including experiment tracking and model registry (e.g. MLflow), CI/CD for ML pipelines, feature stores, and workflow orchestration tools such as SageMaker Pipelines. Prior experience building on Medicaid or dual-eligible populations. Peer-reviewed publications in healthcare ML, AI, biostatistics, or health economics. Why This Role Matters Waymark is scaling across health plan and health system partners, and the depth of clinical insight we can extract from our data directly determines whether our models drive better care. This role sits at the center of what makes Waymark's models accurate and clinically actionable. By taking ownership you will: Define and own clinical data quality standards across claims, EHR, and ADT

More Remote jobs

Remote jobs · Browse all locations