Site Reliability Engineer (SRE)

Fireblocks
Full-time
United States
$150,000 - $201,000
Posted on 5 months ago

Job Description

Fireblocks is seeking a Site Reliability Engineer to establish observability tools and manage production systems. The role involves improving monitoring, handling incidents, collaborating with R&D, and automating tasks to ensure system reliability and performance.

Responsibilities

  • Improve monitoring, alerting, and observability
  • Handle critical alerts and incidents
  • Identify root causes and prevent incidents
  • Collaborate with R&D and Support
  • Document actions and automate using Python, Lambda, etc.
  • Focus on observability, availability, reliability, and performance
  • Conduct on-call duties and emergency response

Requirements

  • 3+ years of SRE or Infra Backend experience in SaaS
  • Proficiency in Python/JavaScript/Bash
  • 3+ years of experience with Alerting & Monitoring systems
  • Experience with Linux systems
  • Cloud systems experience (AWS, Google Cloud, Azure)
  • Configuration management experience (Ansible, Chef, Puppet, ArgoCD)
  • Experience with Docker, Kubernetes, and Helm
  • SCM experience (Git, etc.)
  • Strong analytical and troubleshooting skills
  • Strong communication skills

Benefits

  • No benefits