Site Reliability Engineer

Offchain Labs
Full-time
Remote
Posted on 24 days ago

Job Description

Offchain Labs is seeking a Site Reliability Engineer to join their team and contribute to the scaling and security of blockchain technology. The role involves tackling real-world challenges, solving infrastructure problems, and ensuring system reliability within a remote-first environment. The company is at the forefront of Ethereum scaling solutions, powering Arbitrum and its growing ecosystem.

Responsibilities

  • Operating production Kubernetes clusters
  • Building scalable, declarative infrastructure
  • Deploying and maintaining Kubernetes environments
  • Designing CI/CD workflows
  • Designing and operating observability systems
  • Diagnosing networking and storage issues
  • Implementing secure-by-default infrastructure
  • Automating operational workflows
  • Responding to incidents and troubleshooting under pressure
  • Driving postmortems to improve system reliability

Requirements

  • Experience with blockchain technology
  • Ability to solve infrastructure problems unconventionally
  • Proficiency with tools like k9s or ArgoCD
  • Experience with GitOps-style systems
  • Comfort with Linux and shell scripting
  • Proficiency in Python or Go
  • Experience with cloud platforms (AWS, GCP, Azure)
  • Experience with on-call rotations
  • Design systems with security in mind
  • Strong problem-solving skills
  • Commitment to high-quality work
  • Ability to take ownership and collaborate openly

Benefits

  • No benefits