About Mastercard: Mastercard powers global economies and empowers individuals across more than 200 countries and territories worldwide. By connecting consumers, financial institutions, merchants, governments, and businesses, we build a safe, sustainable, and inclusive digital economy where everyone can prosper. Driven by a high Decency Quotient (DQ), Mastercard leverages secure data networks, state-of-the-art payment infrastructures, and revolutionary technologies to ensure that billions of global transactions remain simple, smart, secure, and universally accessible.

Position Overview

We are seeking a highly motivated, systems-fluent, and automation-passionate Senior Site Reliability Engineer (SRE) to join our centralized Business Operations division under a permanent, full-time remote configuration based out of Australia. In this business-critical infrastructure seat, you will act as a premier production readiness steward, ensuring the uncompromised stability, reliability, and scaling capacity of the distributed application frameworks that fuel Mastercard’s global commerce platforms. Shifting completely away from routine non-regulated data transcription entry, generic copywriting loops, or basic front-end template formatting, you will lead an active telemetry visualization, blameless post-mortem analysis, and automated infrastructure orchestration laboratory—partnering closely across embedded Product and Core Development squads. This position requires an engineering authority who structures reliability solutions fluidly natively using DevOps and site reliability primitives, writes highly performant automation tools and scripting pipelines, diagnoses complex cross-network exceptions, and implements proactive monitoring controls to mitigate systemic runtime risks across high-load global payment arrays.

Key Responsibilities

Production Readiness Stewardship: Formulate, execute, and enforce strict system availability and operational health standards across global applications natively utilizing DevOps principles.
Developer Run Ownership Support: Empower developer squads to build fault-tolerant systems by injecting software run principles—encompassing proactive operational design, configuration management, and scalability rules—early in the lifecycle.
Telemetry & Observability Instrumentation: Architect, implement, and maintain distributed monitoring frameworks using advanced scripting to aggregate and visualize system metrics, trace paths, and logging telemetry.
Automated Scripting & Tooling: Author and manage robust systems infrastructure code and diagnostic toollines leveraging languages such as Python, Go, or Bash to streamline incident response workflows.
Complex Incident Resolution & Triage: Lead real-time diagnostics, triage sequences, and systematic troubleshooting loops across multi-layered Linux/Unix environments, networks, and databases to minimize operational disruption.
Blameless Post-Mortem Governance: Conduct and document thorough, data-supported root cause analyses (RCA) and blameless post-mortem investigations to convert performance drops into preventative structural improvements.
Capacity Planning & Optimization: Monitor resource utilization profiles, forecast multi-region infrastructure demands, and tune computing allocations to support consistent performance scaling.
IT Service & Risk Management: Align application deployments with strict ITIL problem and change management frameworks, ensuring absolute compliance with security benchmarks and data protection policies.

Required Skills & Qualifications

A minimum of 5+ years of proven, successful professional history operating inside a Senior Site Reliability Engineer (SRE), DevOps Architect, Infrastructure Systems Engineer, or closely related high-availability software capacity.
Expert Systems Administration Command: Extensive production history configuring, maintaining, and troubleshooting complex Linux/Unix systems, cloud-networking components, and protocol security layers.
Robust Programming and Scripting Fluency: Deep, hands-on tool literacy writing performant infrastructure tools, deployment scripts, or automation modules using Python, Go, or Bash.
Demonstrated capability designing and operating scalable, secure, and fault-tolerant cloud infrastructures on major platforms (such as AWS, Azure, or GCP).
Strong operational familiarity orchestrating software delivery pipelines, container nodes, and continuous deployment environments (CI/CD, Docker, Kubernetes).
Outstanding written, verbal, and presentation communication strengths in English, with a proven ability to frame technical roadblocks cleanly for cross-functional vertical leads and business stakeholders.
Location Context: Position operates under remote guidelines open exclusively to qualified infrastructure engineers residing permanently within Australia.

Preferred Strategic Indicators (Nice to Have)

Prior experience or direct platform infrastructure history maintaining high-load systems inside the Fintech, digital payment processing, banking technology, or Merchant of Record (MoR) spaces.
Familiarity with financial compliance standards, enterprise security auditing patterns, or zero-trust data protection layouts.
Active engagement with global open-source site reliability communities or advanced enterprise architecture continuous learning networks.

What We Offer

Top-Tier Australian Tech Infrastructure Remuneration: A highly competitive annual base salary customized precisely to your SRE and systems history, supplemented by corporate bonus paths, stock equity participation, and a comprehensive rewards matrix.
100% remote workspace infrastructure autonomy anywhere within Australia, offering exceptional schedule flexibility to balance your personal and professional paths.
Uncompromising Global Strategy Impact: Elite professional credentials built by serving as the flagship reliability steward for an infrastructure network powering global digital commerce.
Comprehensive health care preservation benefits protecting employees, featuring premium localized medical, dental, and vision coverage frameworks.
Access to an inclusive global community that prioritizes personal growth, multi-continental career pathways, and regular mandatory security and technical skill expansion paths.

Senior Site Reliability Engineer

Job Description

Key Skills Required

Position Overview

Key Responsibilities

Required Skills & Qualifications

Preferred Strategic Indicators (Nice to Have)

What We Offer

How would you rate this job post?

Is this company safe?

Safety First