About phData: phData is a premier, internationally recognized modern data stack pioneer, cloud analytics innovator, and global data engineering services leader on an absolute mission to help enterprise organizations overcome their toughest data and artificial intelligence challenges. Celebrated as a historic 6x Snowflake Partner of the Year (2020–2025) and top-rated Partner of the Year for Fivetran, dbt, AWS, and Atlan, phData ranks as the world #1 leader in Snowflake Advanced Certifications with over 600 expert cloud credentials overall. Operating a high-performance, remote-first global workplace across the United States, Latin America, and India, the organization fosters an exciting community rooted in technological curiosity, transparent ownership, and mutual trust. phData provides elite data craftsmen with absolute delivery autonomy, removing legacy barriers to engineer robust, high-volume data-intensive applications safely across international enterprise networks.

Position Overview

We are seeking a highly analytical, systems-minded Senior Data Engineer to join our core centralized Data Engineering division in a full-time remote capacity within India. Operating in an outcomes-driven, client-facing professional services environment, you will claim individual strategic accountability for designing, building, and productionizing end-to-end cloud data architectures that are secure, scalable, performant, and robustly integrated across multi-tenant infrastructures. Shifting completely away from passive database upkeep or routine ticket tracking, you will translate complex commercial requirements into repeatable integration patterns and data models that align with delivery excellence guidelines. This position requires a seasoned data consultant with a 4-year computer science degree and 4+ years of background who writes clean, optimized codebases, leads detailed architectural presentations, and collaborates smoothly across global time zones to generate measurable business value.

Key Responsibilities

Client Data Platform Delivery: Own and drive the end-to-end design, development, and deployment of production-grade data processing solutions and pipelines for global enterprise clients.
Architecture Mapping and Modeling: Translate complex business requirements into technical data architectures, logical system views, sequence diagrams, and schema patterns natively utilizing SQL data warehouse modeling standards.
Hands-On Python and Script Optimization: Program and maintain highly scalable data processing, system automation, and workflow orchestration logic natively leveraging Python or Scala scripts.
Multi-Cloud Platform Optimization: Build, tune, and productionize large data streams inside core enterprise data environments, explicitly utilizing Snowflake, AWS, Azure, GCP, or Databricks configurations.
Technical Leadership and Reviews: Provide rigorous technical guidance during peer code reviews, proof-of-concept (POC) validations, and implementation workshops, ensuring high deliverables quality.
Data Integration and Streaming Triage: Integrate and curate data workflows across multiple heterogeneous sources, managing queues and pipelines natively leveraging **dbt**, Spark, Kafka, or Fivetran.
Internal Accelerator Development: Contribute to phData internal practice initiatives by designing reusable pipeline components, technical templates, development playbooks, and automated data curation scripts.
Cross-Functional Stakeholder Alignment: Collaborate fluidly with distributed cross-functional cells spanning data engineering, software development, analytics, and project leadership to deliver engagements on time and within scope.

Required Skills & Qualifications

4+ years of verified professional history running advanced data engineering, enterprise pipeline architecture, full-stack database development, or professional services cloud consulting.
Deep, authoritative technical command of relational schema configuration, ETL/ELT data patterns, and query tuning natively using advanced SQL languages.
Expert-tier programming proficiency writing clean, maintainable, and optimized data processing code bases natively using Python or Scala environments.
Hands-on production experience designing or deploying analytics infrastructure within centralized cloud-native platforms such as Snowflake, AWS, Azure, or Databricks.
Proven capability to independently compose detailed solution blueprints, logical diagrams, class hierarchies, and operational runbooks for complex technical architectures.
Outstanding verbal and written communication traits in fluent English, showcasing an absolute capacity to deliver technical presentations and maintain relationships with corporate client boards.
Academic Baseline Qualifications: A 4-year Bachelor’s degree from an accredited institution in Computer Science, software engineering, or a closely related quantitative discipline.
Location Context: Parameters open exclusively to qualified data engineers based permanently and resident within **India** to operate under our remote-first model, working primarily during India business hours (IST) with flexibility for global team syncs.

Preferred Strategic Indicators (Nice to Have)

Prior commercial experience implementing automated workflow management and data pipelines via tools like Apache Airflow, Luigi, or NiFi.
Familiarity with distributed data storage frameworks and NoSQL technologies, including S3, ADLS, HDFS, GCS, Elasticsearch/Solr, or Cassandra.
Contributions to open-source software, professional technical communities, whitepaper writing, or internal accelerator development.
An outcome-driven personal philosophy with an insatiable curiosity for exploring modern data stack integrations, vector platforms (e.g., Pinecone), or agentic AI retrieval systems.

What We Offer

The exceptional professional canvas to directly direct, code-shape, and deploy the foundational cloud data pipelines power-routing AI and big data initiatives for the world largest enterprises.
Highly competitive, capability-benchmarked full-time baseline compensation packages calibrated to reward your database authority, supplemented by performance-driven bonuses for creating approved technical content.
Profound work-from-home remote parameters offering total location flexibility across India, complete scheduling trust, and zero physical geographic office commuting friction.
Immediate eligibility to enroll in comprehensive corporate **Medical Insurance coverage structures for yourself, your family, and your parents**.
Secure long-term financial provisions featuring full company-paid Term Life and Personal Accident insurance parameters.
Elite workspace and wellness perks, including a structured **Wellness Allowance**, monthly broadband network reimbursement, and home office technology support.
Unrivaled continuous learning paths, including full financial coverage for paid technical certifications and dedicated professional development allowances.

Senior Data Engineer

Job Description

Key Skills Required

Position Overview

Key Responsibilities

Required Skills & Qualifications

Preferred Strategic Indicators (Nice to Have)

What We Offer

How would you rate this job post?

Safety First