About Monzo: Monzo (monzo.com) is a premier, internationally recognized challenger bank, financial technology pioneer, and digital ecosystem developer operating on an absolute mission to protect, automate, and revolutionize traditional banking services for over 9 million users across the United Kingdom. From introducing its iconic hot coral smart cards and pioneering get-paid-early processing nodes to deploying personal, business, joint, and micro-savings portfolios, Monzo continuously engineers human-centric financial products that eliminate complexity from everyday money management. Recognized as a top-tier fintech disrupter with a 600-person engineering powerhouse, the company is built on an autonomous, transparent, and open-source-friendly corporate philosophy. Monzo equips high-performing backend engineers with an uncompromised platform canvas to leverage cutting-edge multi-cloud environments, manipulate distributed microservices, and deploy highly secure, low-latency AI solutions transforming financial lives.

Position Overview

We are seeking a highly analytical, connection-obsessed, and systems-minded ML/AI Platform Engineer to join our centralized Machine Learning Platform team under a flexible full-time engagement framework open to candidates based out of our London headquarters or operating 100% remotely across the United Kingdom. Operating as a primary platform authority and developer experience architect for our internal technical groups, you will step up to claim true individual operational and strategic accountability over model training infrastructure, low-latency serving pipelines, and high-concurrency evaluation workflows. Shifting completely away from routine standalone business logic text editing, isolated DevOps operations script running, passive dashboard monitoring, or localized data model adjustments, you will run an active microservices engineering, asynchronous distributed training, and automated telemetry development laboratory—partnering face-to-face with core machine learning engineers, data scientists, and product squads. This position requires a software engineering veteran (aiming at our L30 or L40 Engineering Progression framework) who maps out scalable platform APIs fluidly natively using Machine Learning infrastructure, builds robust utilities cleanly natively leveraging advanced Python Scripting and Go backend services, and commands highly fault-tolerant cluster architectures confidently under intense production scale.

Key Responsibilities

Core ML Infrastructure Architecture: Formulate, program, and maintain high-throughput backend services and APIs designed to train, evaluate, deploy, and serve machine learning models safely and reliably cleanly natively utilizing Machine Learning platform systems.
Developer Experience (DevEx) Optimization: Engineer robust, idiomatic Python utilities and backend tools to remove operational friction for distributed internal ML engineers and data scientists natively leveraging Python Scripting workflows.
Low-Latency Serving Governance: Architect and optimize microservices running real-time, low-latency scoring grids for high-stakes operational nodes, including live transaction fraud checks and automated credit underwriting.
Distributed Systems Management: Design, operate, and refactor complex distributed software blocks capable of managing scale, handling network concurrency smoothly, and failing gracefully under production stress.
Production Observability Auditing: Deeply investigate system runtime behavior, ensuring all production model endpoints remain completely reliable, highly observable, latency-optimized, and operationally safe.
Multi-Cloud Deployment Orchestration: Coordinate backend pipelines across multi-cloud systems using container environments and modern infrastructure-as-code deployments (utilizing Kubernetes, AWS, GCP, and Terraform configurations).
Open Source and Knowledge Synergy: Participate in technical knowledge-sharing blocks, contribute actively to open-source software networks, and align code standards with enterprise reliability rules.

Required Skills & Qualifications

Demonstrated professional track record of success writing production-grade backend software, managing scalable microservices, or building advanced developer-focused tooling.
Expert-tier capability designing distributed systems, tuning network parameters, and securing real-time application throughput natively utilizing Machine Learning infrastructure components.
Practical operational familiarity writing clean, performant, and well-tested code across both strongly typed ecosystems and scripting layers natively using Python Scripting alongside backend languages like Go.
Hands-on system deployment experience managing containers, configuring container networking, and utilizing cloud architectures (specifically requiring active production familiarity with Kubernetes and either AWS or GCP fabrics).
Familiarity with foundational machine learning platform components, such as automated model training pipelines, feature stores, model registries, experiment tracking systems, or LLM evaluation layers.
Outstanding written, verbal, and scannable technical-documentation communication attributes in business-fluent English to coordinate across distributed product nodes and engineering guilds.
Location Context: Position open to qualified software engineers located within London, Cardiff, or anywhere across the United Kingdom to execute a highly flexible remote or hybrid work-from-home track.

Preferred Strategic Indicators (Nice to Have)

Background operating explicitly within a regulated financial technology environment, high-security digital banking framework, or massive B2C digital scale infrastructure.
Prior experience with reverse proxy systems or service meshes, including active routing environments like Envoy Proxy.
An outcome-focused personal philosophy rooted in developer empathy, an interest in exploring production system edge-cases, and the ability to thrive inside highly autonomous, cross-functional engineering teams.

What We Offer

Vetted, Competitive UK Tech Salary Blueprint: A highly competitive baseline full-time salary package structured transparently between £85,000 and £110,000 GBP, calibrated precisely against your engineering craft depth under our public Progression Framework, plus performance-tied Incentive Awards.
Uncompromised Legal Mobility Support: Full corporate visa sponsorship validation and structured international relocation assistance to transition you smoothly into the United Kingdom tech hub.
Profound work-from-home remote parameters offering total location trust across the UK, complete personal schedule execution freedom, and full travel cost coverage from Monzo if office sessions are requested.
A dedicated annual learning budget of £1,000 GBP to acquire advanced engineering books, attend premium global tech conferences, or secure technical certifications.
Integration into a world-class, collaborative engineering culture of 600+ developers, offering regular catered team lunches, shared open-source initiatives, and massive career progression velocity.

ML/AI Platform Engineer

Job Description

Key Skills Required

Position Overview

Key Responsibilities

Required Skills & Qualifications

Preferred Strategic Indicators (Nice to Have)

What We Offer

How would you rate this job post?

Is this company safe?

Safety First