Staff Data Engineer
Arine · Remote
📍 Remote (United States of America)💰 $170,000-185,000via greenhousePosted 2026-04-29
Apply on company site ↗
CareerRiver pulls this listing straight from the employer's hiring system — no recruiter middleman, no reposts. Applying takes you directly to Arine.
Based in San Francisco, Arine is a rapidly growing healthcare technology and clinical services company with a mission to ensure individuals receive the safest and most effective treatments for their unique and evolving healthcare needs.
Frequently, medications cause more harm than good. Incorrect drugs and doses costs the US healthcare system over $528 billion in waste, avoidable harm, and hospitalizations each year. Arine is redefining what excellent healthcare looks like by solving these issues through our software platform (SaaS). We combine cutting edge data science, machine learning, AI, and deep clinical expertise to introduce a patient-centric view to medication management, and develop and deliver personalized care plans on a massive scale for patients and their care teams.
Arine is committed to improving the lives and health of complex patients that have an outsized impact on healthcare costs and have traditionally been difficult to identify and address. These patients face numerous challenges including complicated prescribing issues across multiple medications and providers, medication challenges with many chronic diseases, and patient issues with access to care. Backed by leading healthcare investors and collaborating with top healthcare organizations and providers, we deliver recommendations and facilitate clinical interventions that lead to significant, measurable health improvements for patients and cost savings for customers.
Why is Arine a Great Place to Work?:
Outstanding Team and Culture - Our shared mission unites and motivates us to do our best work. We have a relentless passion and commitment to the innovation required to be the market leader in medication intelligence.
Making a Proven Difference in Healthcare - We are saving patient lives, and enabling individuals to experience improved health outcomes, including significant reductions in hospitalizations and cost of care.
Market Opportunity - Arine is backed by leading healthcare investors and was founded to tackle one of the largest healthcare problems today. Non-optimized medications therapies which cost the US 275,000 lives and $528 billion annually.
Dramatic Growth - Arine is managing more than 18 million lives across prominent health plans after only 4 years in the market, and was ranked 236 on the 2024 Inc. 5000 list and was named the 5th fastest-growing company in the AI category.
The Role :
As a key technical leader and team architect working in a fast-paced environment, you will drive the design, development, and optimization of scalable data ingestion pipelines within the Arine platform. Leveraging expert-level proficiency in Python and AWS, you will architect solutions that handle diverse file types and large-scale healthcare datasets. You will have a direct impact on building reusable, configurable tools for handling data needs for the entire company.
A key part of this role is leading the team’s transformation toward AI-driven software development - shifting engineers from being primary builders of code to skilled directors and reviewers of AI-generated work.
What You'll be Doing:
Act as the team architect by leading system design reviews, offering recommendations, conducting comprehensive peer reviews, and demonstrating expert-level proficiency in Python and AWS services
Architecting and implementing scalable data ingestion pipelines, including incremental ingestion strategies for large-scale healthcare datasets
Developing reusable, configuration-driven, containerized pipeline components and toolsets that diverse engineering profiles can use and maintain
Work collaboratively with cross-functional teams to ensure their data requirements are met through ETL components
Designing and maintaining data transformation pipelines using dbt, including utilizing core concepts like macros, incremental models and dbt tests
Building monitoring and alerting systems for data ingestion processes and pipeline health
Applying software engineering best practices including test-driven development and modular design to data infrastructure, including refactoring existing ingestion processes to improve scalability and operational efficiency
Provide technical guidance, mentorship to junior engineers, and promote best practices and coding standards
Champion AI-assisted development across the team - establishing norms, workflows, and expectations for using AI coding tools (e.g., Claude Code, Cursor, Copilot) to generate, iterate, and ship production-quality code
Model the “builder to reviewer” shift - demonstrating how senior engineers direct AI agents to produce full solutions, then apply rigorous review, testing, and judgment to own the output
Identify opportunities to automate repetitive engineering work using LLMs and AI tooling, including pipeline scaffolding, boilerplate generation, data transformation logic, and documentation
Author and support high-quality technical documentation, assisting junior engineers in doing the same
Who You Are and What You Bring:
10+ years working in data engineering, with a focus on large-scale data ingestion and infrastructure
A track record of building automated, production-grade ETL processes using Python and DBT SQL
Strong understanding of ETL/ELT frameworks and distributed data processing
Demonstrated hands-on experience building software with AI coding tools - not just autocomplete, but directing AI agents to generate complete solutions and applying disciplined review and ownership of the output
A genuine conviction that AI-augmented development is the future of software engineering, paired with the judgment to validate, test, and take accountability for AI-generated code
Experience or strong interest in integrating LLMs into engineering workflows beyond development assistance - such as automating data quality checks, generating pipeline logic, or surfacing anomalies
Proven ability to handle and process vari
More Remote jobs
Remote jobs · Browse all locations