About Verantos: Verantos is the premier, internationally recognized real-world evidence (RWE) software pioneer, clinical data technology innovator, and healthcare analytics leader on an absolute mission to power life-saving biopharma research through high-accuracy data orchestration. As an advanced data science company operating an elite, AWS-centric software platform, Verantos seamlessly integrates heterogeneous, messy, and high-variance real-world data sources—including electronic health records (EHR)—to generate evidence with the uncompromised accuracy necessary for complex regulatory approval and insurance reimbursement use cases. Trusted by some of the largest biopharmaceutical corporations on earth, the company unifies artificial intelligence with cross-functional domain expertise. Verantos provides distributed systems engineers with an autonomous, remote-first environment where data-driven craftsmanship translates into direct, life-changing global clinical impacts safely.

Position Overview

We are seeking a highly analytical, systems-minded Senior Data Engineer to join our core centralized Data Engineering division in a full-time remote capacity within the United States. In this technical lead-level infrastructure seat, you will step up to claim individual technical accountability for the long-term design, evolution, and quarterly release of our core data product layer. Shifting completely away from passive database upkeep or chasing erratic sync faults manually, you will engineer automated data pipelines engineered to handle the chaotic, constantly changing realities of real-world clinical systems gracefully. This high-ownership position requires a seasoned systems craftsman with 8+ years of data engineering background who designs for complete observability from the start, writes clean pipeline tests to protect downstream consumers, and bridges the gap between low-level python configurations and the actual research goals of the clinical specialists relying on our data trees.

Key Responsibilities

Data Platform Architecture Ownership: Lead the structural design, scaling, and evolution of the master data platform, establishing reusable code patterns, quality standards, and integration patterns for the engineering team.
High-Variance Clinical Data Ingestion: Build and operate production-grade, multi-tenant ETL/ELT pipelines to ingest, clean, and normalize highly irregular, high-variance electronic health record (EHR) data structures at scale.
Hands-On Python and SQL Modeling: Program and refactor highly optimized analytical data flows, validation metrics, and automated migration routines natively using Python scripts and complex SQL queries.
Automated Failure Recovery Engineering: Design pipelines for complete automation from the start, constructing self-healing mechanisms that proactively detect schema drift, handle anomalies, and surface data quality issues without requiring manual intervention.
Snowflake Warehouse and dbt Orchestration: Map and manage complex data relationships and analytics transformation models natively inside modern data platforms, explicitly utilizing Snowflake and **dbt** environments.
Data Quality and Observability Triage: Build and scale rigorous data validation and profiling tests, utilizing data observability tooling to reflect the evolving analytical standards of downstream biopharma researchers.
Technical Mentorship and Code Review: Elevate the development velocity of the data engineering team, providing technical guidance through peer code reviews, architectural feedback, and shared engineering standards.
Cross-Functional Product Alignment: Collaborate fluidly alongside product managers, graphic designers, customer success representatives, and clinical domain experts to secure timely delivery of quarterly data product releases.

Required Skills & Qualifications

8+ years of verified professional history running advanced data engineering, enterprise pipeline architecture, backend systems development, or technical lead-level database consulting.
Deep, authoritative technical command of distributed database modeling, showing a production-tested history utilizing Snowflake and **dbt** as primary data platform tools.
Expert-tier programming proficiency writing clean, maintainable, and optimized pipeline automation logic natively using Python languages.
Proven capability to independently synthesize and translate irregular, high-variance data sources into structured, queryable schemas without requiring continuous manual debugging or oversight.
Outstanding verbal and written communication traits in fluent English, with a proven ability to engage meaningfully with non-technical product owners and clinical domain stakeholders alike.
Location Context: Parameters open exclusively to qualified senior data engineers based permanently and resident within the United States to execute 100% remotely from home under distributed virtual arrangements.

Preferred Strategic Indicators (Nice to Have)

Prior commercial history operating with healthcare data architectures, showcasing familiarity with the **OMOP CDM (Common Data Model)** layer.
Direct production experience handling highly regulated medical datasets, including electronic health records (EHR) or healthcare data standards such as **HL7** or **FHIR**.
Practical exposure running data observability frameworks and automated anomaly detection matrices within high-volume AWS cloud ecosystems.
An outcome-driven personal philosophy that actively uses and advocates for digital AI utilities inside your daily engineering workflow to maximize development velocity.

What We Offer

High-Yield U.S. Salaried Canvas: An attractive full-time base salary scale structured transparently between $150,000 – $220,000 USD per year, calibrated precisely to evaluate and reward your data architecture authority and pipeline execution velocity.
The exceptional professional canvas to directly direct, code-shape, and deploy the high-accuracy data architectures power-routing regulatory-grade real-world evidence for the world’s leading biopharma institutions.
Profound work-from-home remote parameters offering total location flexibility across the United States, complete schedule trust, and zero physical office geographical commuting friction.
Immediate integration into an elite, cross-functional AWS-centric tech stack workspace that unifies engineering, design, and clinical medicine to deliver visible real-world impact.
Access to comprehensive physical health, medical insurance protections, and holistic lifestyle calibration benefits.
Dedicated technical ownership tracks, with a culture that encourages continuous learning, active exploration of modern AI-driven anomaly monitoring tools, and peer-to-peer mentorship.

Senior Data Engineer

Job Description

Key Skills Required

Position Overview

Key Responsibilities

Required Skills & Qualifications

Preferred Strategic Indicators (Nice to Have)

What We Offer

How would you rate this job post?

Safety First