Sr. Manager, Site Reliability Engineering

Xometry
Full-time
Boston, MA
Posted on 3 months ago

Job Description

Xometry is seeking a Sr. Manager of Site Reliability Engineering (SRE) to join their organization. The Sr. Manager will be responsible for crafting the strategic direction for SRE teams and initiatives, helping Xometry build cost-effective, secure, fast, and reliable systems for their global manufacturing marketplace.

Responsibilities

  • Define standards, metrics, and practices to improve operational rigor, efficiency, and engineering velocity
  • Establish automated and self-service strategies to improve operational efficiency and development team self-sufficiency
  • Champion and measure observability, monitoring, and metrics practices
  • Supervise development, configuration, and maintenance of underlying platforms for deployed software
  • Supervise development, configuration, and maintenance of observability and monitoring tools
  • Supervise development, configuration, and maintenance of software development (CI/CD) tools

Requirements

  • 7+ years of experience in software development and site reliability
  • An iterative approach to balance short-term priorities with a long-term target architecture
  • Proven track record of building and growing a high-performing SRE team
  • Strong understanding of infrastructure automation observability within distributed systems
  • Experience in defining & operationalizing SLOs, SLAs, and error budgets
  • Demonstrated ability to interact and communicate effectively with various levels of stakeholders
  • US person (citizen or green card holder)

Benefits

  • No benefits