Back to Jobs
AI & Machine Learning 1d ago

Senior Data/ML Engineer (AWS)

🌍Global
Full-time (1099)
Not Disclosed
Senior

Job Description

Key Skills Required

Master these to land this role

Machine LearningBestseller 🔥
Learn in 42 Hours
AI Engineer

Want to know if you're a match for this job?

Calculate My Match Score

About Capnexus: Capnexus is a premier, internationally recognized comprehensive services provider, retail software innovator, and global build-as-a-service pioneer on an absolute mission to help organizations automate and modernize enterprise workflows through repeatable business patterns. By specializing in full-lifecycle software development, complex system analysis, multi-platform data integration, and persistent implementation support, Capnexus delivers a complete suite of services that can be deployed as a single, highly integrated team. Operating on a customer-obsessed and delivery-focused culture where all work is strategic, the organization values lifetime learners who are hungry, curious, and self-motivated in their pursuit of knowledge. Capnexus eliminates rigid internal barriers to celebrate cross-functional skill sets, providing elite cloud technologists with an uncompromised remote canvas to build elegant, simple, and highly scalable enterprise solutions safely worldwide.

Position Overview

We are seeking a highly analytical, systems-minded Senior Data/ML Engineer (AWS) to join our core centralized data engineering and artificial intelligence organization in a full-time 1099 remote capacity. In this high-leverage technical leadership seat, you will claim individual strategic accountability for leading data architecture, multi-zone data lake development, and cross-functional machine learning integrations. Shifting completely away from boilerplate server management or passive report logging, you will participate in data discovery workshops to inventory source data networks, provision AWS target environments, and translate complex findings into enterprise data lake requirements. This senior position requires a seasoned cloud development veteran with 5+ years of data engineering history who configures automated ETL orchestration frameworks fluidly, deploys predictive foundation models smoothly under compressed timelines, and collaborates alongside AWS Professional Services to ensure data quality and integrity across all production stages.

Key Responsibilities

  • Multi-Zone Data Lake Architecture: Design and implement a multi-zone enterprise data lake architecture on Amazon S3 (raw, conformed, enriched, aggregated), configuring ingest, cleansing, and business layers in strict alignment with SOW metrics.
  • Automated Ingestion Pipeline Construction: Code, optimize, and maintain comprehensive batch and streaming data ingestion pipelines natively utilizing AI Engineer toolsets including AWS Glue, Amazon Kinesis, and AWS Data Pipeline.
  • Predictive Model Development and Tuning: Develop, train, and deploy advanced machine learning models natively leveraging Machine Learning paradigms inside Amazon SageMaker for automated lead scoring, predictive maintenance, and intelligent underwriting risk scoring.
  • Generative AI Foundation Integration: Integrate Amazon Bedrock foundation models to enable advanced generative AI capabilities, power-routing customer profile enrichment, hyper-personalization, and intelligent marketing automation.
  • Serverless Workflow Orchestration: Implement robust data transformation and orchestration frameworks using AWS Glue ETL, AWS Lambda, and AWS Step Functions, embedding AWS Glue Data Catalog for metadata management.
  • Identity Resolution and Deduplication: Design and deploy entity resolution pipelines using Amazon Entity Resolution to identify, deduplicate, and merge fragmented customer records into unified golden records supporting Customer 360 views.
  • Cloud-to-Cloud Dataset Migration: Support Azure data lake migration tracks, assessing schemas and transformation logic, provisioning target spaces, executing migration via AWS DataSync, and performing data reconciliation.
  • Data Governance and Security Enforcement: Implement strict data lake security policies using AWS Lake Formation, enforcing column-level encryption, row-level security parameters, and data governance documentation.

Required Skills & Qualifications

  • 5+ years of verified professional history running advanced data engineering, machine learning development, cloud architecture mapping, or database system consulting, with at least 2+ years of dedicated history in AWS cloud environments.
  • Deep, authoritative technical command of relational data structures, data modeling, feature engineering, and ML integration testing methodologies.
  • Expert-tier programming proficiency writing clean, fast, and practical data manipulation scripts natively utilizing Python and advanced SQL querying.
  • Production-tested experience configuring serverless analytics, model endpoint deployment, and visualization paths natively leveraging Amazon Athena, SageMaker Endpoints, and Amazon API Gateway.
  • Hands-on experience with Kiro CLI or comparable AI-assisted/agentic engineering development frameworks and automated code generation tools.
  • Outstanding verbal and written communication traits, with an absolute capacity to produce architecture documentation, pipeline runbooks, and participate in Agile/Scrum sprint ceremonies.
  • Location and Job Contract Type: Position open to qualified data specialists globally to operate under a 100% remote work-from-home layout under a Full-time, 1099 independent contractor engagement model.

Preferred Strategic Indicators (Nice to Have)

  • Prior commercial history leading data migration or machine learning infrastructure tracks within the real estate, property management, marketing technology, or insurance industries.
  • Hands-on operational familiarity with Azure data ecosystems, including Azure Data Lake, Azure Data Factory, or Azure Synapse.
  • Possession of accredited industry cloud credentials, highlighting active AWS Certification (Machine Learning Specialty, Data Analytics Specialty, or Solutions Architect) designation metrics.
  • Familiarity with MLOps best practices, including continuous model monitoring, drift detection, LLM prompt engineering, or Retrieval-Augmented Generation (RAG) architectures.

What We Offer

  • The exceptional professional canvas to directly direct, shape, and code-engineer the foundational data lake architectures and machine learning systems power-routing automated workflows for enterprise retail platforms.
  • Highly lucrative, capability-benchmarked full-time 1099 contractual compensation configurations calibrated precisely to reward your data architecture authority and pipeline execution velocity.
  • Profound work-from-home remote parameters providing an elite digital workplace, complete scheduling trust, and zero physical geographic office commuting friction.
  • An open-minded, high-performance environment built on outcomes and delivery, giving you the unique opportunity to collaborate directly alongside passionate technical leads.
  • Continuous skill acceleration support, providing an environment that values lifetime learners, supports cross-functional capability building, and encourages knowledge sharing across communities.

How would you rate this job post?

See what other professionals think about this role.

Is this company safe?

Ask Hyrizon AI to scan this company for potential red flags before you apply.

Safety First

  • Never pay for a job application.
  • Do not share sensitive bank info.
  • Verify the client before starting work.
Learn More