Data Engineer, Human Cohorts
Calico · San Francisco Bay Area
📍 South San Francisco, CA💰 $191,000 - $195,000via greenhousePosted 2026-05-21
Apply on company site ↗
CareerRiver pulls this listing straight from the employer's hiring system — no recruiter middleman, no reposts. Applying takes you directly to Calico.
Who We Are:
Calico (Calico Life Sciences LLC) is an Alphabet-founded research and development company whose mission is to harness advanced technologies and model systems to increase our understanding of the biology that controls human aging. Calico will use that knowledge to devise interventions that enable people to lead longer and healthier lives. Calico’s highly innovative technology labs, its commitment to curiosity-driven discovery science, and, with academic and industry partners, its vibrant drug-development pipeline, together create an inspiring and exciting place to catalyze and enable medical breakthroughs.
Position Description:
Calico is seeking a Data Engineer to join our highly collaborative Engineering team and focus on developing high-performance research data infrastructure for large human cohorts. To succeed, you will need to be an enthusiastic team player, detail-oriented, extremely organized, and comfortable working on complex data, software, and scientific problems.
In this position, you will be the engineering lead for data infrastructure to support our human biology teams. You will drive projects from requirements-gathering to production deployment, engineering high-performance data systems that integrate with our internal data systems and our internally-developed AI platform.
Position Responsibilities:
End-to-End Project Ownership: Collaborate with data scientists and bench scientists to gather requirements, architect solutions, and deploy production-grade software that facilitates data movement, transformation, analysis, and visualization
Data Flow Architecture: Define and optimize data flows across the organization
Full-Stack Tool Development: Develop data systems and internal web applications (using React and Python) that allow stakeholders to review, visualize, and communicate complex scientific data
Mentorship & Leadership: Serve as a strong technical voice within a larger Engineering team; provide mentorship to junior engineers across Calico and help onboard future hires
Engineering Excellence: Champion best practices for infrastructure-as-code, CI/CD, and containerization while helping to set standards for data engineering at Calico
Position Requirements:
BS/MS/PhD in Computer Science, Data Science, or a related technical field, or equivalent practical experience
4+ years (for BS/MS) or 1-2 years (for PhD) of professional software or data engineering experience developing robust, production-grade, and high-performance R&D-focused information systems
Experience working with large-scale biological datasets
Fluency in Python and SQL with a strong grasp of software and data engineering principles (testing, modularity, design patterns, data modeling)
Demonstrated experience developing and deploying cloud-based applications on Google Cloud Platform (GCP) (preferred), AWS, or Azure
Strong experience with modern web frameworks and infrastructure, specifically FastAPI, React, Kubernetes, and Terraform
Proven ability to lead complex projects involving diverse stakeholders (e.g., ML engineers, computational biologists, bench scientists) from concept to production
Experience enforcing robust data governance policies and compliance with internal information security standards and best practices
Must be willing to work onsite at least four days per week
The estimated base salary range for this role is $191,000 - $195,000. Actual pay will be based on a number of factors including experience and qualifications. This position is also eligible for two annual cash bonuses.
More San Francisco Bay Area jobs
San Francisco Bay Area jobs · Browse all locations