Data Engineer, Human Cohorts

calicolabs· COMPUTING
Apply Now ↗
📍 South San Francisco, CA

About this role

Who We Are:

Calico (Calico Life Sciences LLC) is an Alphabet-founded research and development company whose mission is to harness advanced technologies and model systems to increase our understanding of the biology that controls human aging. Calico will use that knowledge to devise interventions that enable people to lead longer and healthier lives. Calico’s highly innovative technology labs, its commitment to curiosity-driven discovery science, and, with academic and industry partners, its vibrant drug-development pipeline, together create an inspiring and exciting place to catalyze and enable medical breakthroughs.

Position Description:

Calico is seeking a Data Engineer to join our highly collaborative Engineering team and focus on developing high-performance research data infrastructure for large human cohorts. To succeed, you will need to be an enthusiastic team player, detail-oriented, extremely organized, and comfortable working on complex data, software, and scientific problems.

In this position, you will be the engineering lead for data infrastructure to support our human biology teams. You will drive projects from requirements-gathering to production deployment, engineering high-performance data systems that integrate with our internal data systems and our internally-developed AI platform.

Position Responsibilities:

  • End-to-End Project Ownership: Collaborate with data scientists and bench scientists to gather requirements, architect solutions, and deploy production-grade software that facilitates data movement, transformation, analysis, and visualization
  • Data Flow Architecture: Define and optimize data flows across the organization
  • Full-Stack Tool Development: Develop data systems and internal web applications (using React and Python) that allow stakeholders to review, visualize, and communicate complex scientific data
  • Mentorship & Leadership: Serve as a strong technical voice within a larger Engineering team; provide mentorship to junior engineers across Calico and help onboard future hires
  • Engineering Excellence: Champion best practices for infrastructure-as-code, CI/CD, and containerization while helping to set standards for data engineering at Calico

Position Requirements:

  • BS/MS/PhD in Computer Science, Data Science, or a related technical field, or equivalent practical experience
  • 4+ years (for BS/MS) or 1-2 years (for PhD) of professional software or data engineering experience developing robust, production-grade, and high-performance R&D-focused information systems
  • Experience working with large-scale biological datasets
  • Fluency in Python and SQL with a strong grasp of software and data engineering principles (testing, modularity, design patterns, data modeling)
  • Demonstrated experience developing and deploying cloud-based applications on Google Cloud Platform (GCP) (preferred), AWS, or Azure
  • Strong experience with modern web frameworks and infrastructure, specifically FastAPI, React, Kubernetes, and Terraform
  • Proven ability to lead complex projects involving diverse stakeholders (e.g., ML engineers, computational biologists, bench scientists) from concept to production
  • Experience enforcing robust data governance policies and compliance with internal information security standards and best practices
  • Must be willing to work onsite at least four days per week

The estimated base salary range for this role is $191,000 - $195,000. Actual pay will be based on a number of factors including experience and qualifications. This position is also eligible for two annual cash bonuses.

 

Frequently Asked Questions

Is the salary disclosed for the Data Engineer, Human Cohorts position at calicolabs?
The salary for this Data Engineer, Human Cohorts role at calicolabs is not publicly listed. Click "Apply Now" to learn more about the compensation package on their official careers page.
Where is the Data Engineer, Human Cohorts position at calicolabs located?
This Data Engineer, Human Cohorts role at calicolabs is based in South San Francisco, CA. The position is listed as on-site or hybrid. Check the full job description or apply directly to confirm the work arrangement.
Which team or department does the Data Engineer, Human Cohorts at calicolabs belong to?
This Data Engineer, Human Cohorts position is part of the COMPUTING department at calicolabs. See the full job description for more information about the team structure and responsibilities.
How do I apply for the Data Engineer, Human Cohorts position at calicolabs?
Click the "Apply Now" button on this page. You will be redirected to calicolabs's official application portal hosted on greenhouse where you can submit your application directly.
When was the Data Engineer, Human Cohorts job at calicolabs posted?
This Data Engineer, Human Cohorts position at calicolabs was posted on May 21, 2026. Apply as soon as possible — early applications are often reviewed first.
Data Engineer, Human Cohorts
calicolabs
Apply for this role ↗

You'll be redirected to calicolabs's official application page on Greenhouse.