Back to Jobs
AirbnbAI & Machine Learning 3h ago

Senior Staff Operations Engineer, AIOps

Remote (United States)
Full-time
$212,000—$265,000 USD
Be the first applicant! 🚀

Job Description

The Community You Will Join:

BizTech fosters culture and connection at Airbnb by providing reliable corporate tools, innovative products, and technical support for all teams. We drive technical breakthroughs and strategies that redefine what it means to belong anywhere, delivering greater value for the business and our people. The Global Operations team at BizTech manages production services across Airbnb’s corporate environment, delivering reliable operations through Observability, Incident Management, Core Operations, and AI-enabled automation. We partner across BizTech to scale service quality, efficiency, and resilience.

The Difference You Will Make:

As a Senior Staff Engineer in Operations, you will lead and mentor a high-performing team to scale our AI-enabled operations model and deliver AIOps solutions that streamline operational workstreams and help BizTech teams focus on their core work with confidence. Ops owns triage and resolution, proactive monitoring across networks, systems, applications, and cloud services via a homegrown observability platform, and drives process excellence through automation and shift-left programs. You will set the technical bar, model operational excellence, and ensure high-quality, reliable service.

  • Your scope includes leading projects across multiple products and platforms, delivering world-class outcomes that create customer and community value while balancing near- and long-term needs.
  • You will own the AIOps vision, strategy, and roadmap, partnering with the in-house Observability platform owner to leverage infrastructure and data for accurate, correlated insights that streamline Operations. You’ll drive execution by setting priorities, building accountability, and collaborating effectively across teams. You will partner with BizTech engineering teams to improve service efficiency and security, and lead 1–3 year operations architecture planning to connect production systems and improve compatibility and stability.
  • You will identify and eliminate recurring issues through scalable automation, improving operational performance and productivity. You’ll also lead the development and maintenance of testing and monitoring tooling to ensure automation platforms run reliably.
  • You will be accountable for the quality and reliability of BizTech services, including validating postmortems, driving root-cause analysis, and ensuring corrective actions are implemented.

A Typical Day:

  • You’ll lead technical strategy and discussions, partnering with Operations peers and cross-functional BizTech teams to build AIOps and automation solutions.
  • You’ll stay on top of tasks, engagements, and team interactions—active collaboration is key to success.
  • You’ll work in sprints, delivering project work across coding, testing, design, documentation, and operational readiness reviews.
  • You’ll dedicate part of each day to core Operations work, triaging tickets, spotting patterns, and driving scalable fixes that improve efficiency.
  • You’ll participate in an on-call rotation, leading high-severity incident response as both incident commander and operations engineer.

Your Expertise:

  • 15+ years of experience across AIOps, data catalog architecture, product development, and/or Technical Operations infrastructure.
  • Strong SDLC experience, including infrastructure as code, configuration management, distributed version control, and CI/CD.
  • Deep expertise in complex enterprise infrastructure, especially cloud (AWS and/or Google), with a focus on AI/automation, data catalog architecture, workflows, and correlation.
  • Solid understanding of corporate infrastructure and applications to translate into AIOps requirements and integrations.
  • Proven ability to lead cross-team, cross-org delivery of large-scale, technically complex, ambiguous initiatives that anticipate business needs.
  • Proficient in Python or Go.
  • Experience building API integrations and event-driven architectures (e.g., AWS Lambda/SQS).

Safety First

  • Never pay for a job application.
  • Do not share sensitive bank info.
  • Verify the client before starting work.