Back to Jobs
Development 48d ago

Director of Site Reliability Engineering (SRE)

United StatesUnited States
Full-time
Not Disclosed
Executive

Job Description

Key Skills Required

Master these to land this role

DevOpsBestseller 🔥
Learn in 63 Hours
BackendBestseller 🔥
Learn in 18 Hours
SRECloud OperationsInfrastructureLeadership

Want to know if you're a match for this job?

Calculate My Match Score

About the Role: We are seeking an ambitious and accomplished Director of SRE to join our Cloud Operations leadership team. In this role, you will lead the front-line teams responsible for delivering mission-critical SRE production services. You will be directly accountable for Backblaze's production infrastructure and performance against key SLOs. As a champion of engineering excellence, you'll focus on performance measurement, incident/change management, problem resolution, and process discipline.

What You'll Do

  • Lead a globally distributed team of 15+ highly technical teammates providing 24/7 services for SRE.
  • Own the single source of truth for the state of production and centrally manage all aspects of incident and change management.
  • Maintain a culture of continuous improvement, leveraging operational data to prioritize work across teams.
  • Lead and coordinate strategic initiatives to evolve and improve production support, incident/change/asset management.
  • Liaise with Vendor Management and Legal to manage critical contract renewal cycles.
  • Establish department-level objectives, policies, and procedures, creating OKRs or other measurements.
  • Recruit and coach the team to support Backblaze and individual career objectives.
  • Manage department budget and collaborate closely with Infrastructure Engineering, Customer Support, and Data Center Operations.

The Right Fit

  • 6+ years of management experience, with at least 3 years at the Director level.
  • 5+ years of hands-on technical experience in a field related to the team's focus.
  • Proven experience in a similar leadership role within the MSP or Infrastructure-as-a-Service industry.
  • Significant experience in cloud-scale data center systems and services.
  • Significant experience in managing mission-critical operations of complex global infrastructure.
  • Strong analytical and problem-solving abilities, with a data-driven approach to decision-making.
  • Excellent collaboration and communication skills, including building high-performing teams.
  • Ability to travel domestically and internationally as needed.

Backblaze Perks

  • RSU grants for full-time employees and Annual Company bonus plan.
  • Healthcare for family, including dental and vision.
  • 401K, ESPP program, and Flexible vacation policy.
  • Maternity & paternity leave, and Childcare bonus.
  • MacBook Pro for work plus a generous stipend to personalize your workstation.
  • Fertility treatment and support, learning & development program, and commuter benefits.

How would you rate this job post?

See what other professionals think about this role.

Is this company safe?

Ask Hyrizon AI to scan this company for potential red flags before you apply.

Safety First

  • Never pay for a job application.
  • Do not share sensitive bank info.
  • Verify the client before starting work.