← Back to jobs
Strike
Strike

Site Reliability Engineer

engineeringfull-timeRemote
SALARY
Not listed
WORK TYPE
remote
JOB TYPE
full-time
INDUSTRY
crypto
Apply for this position
✦ AutoApply Let us apply to roles like this on your behalf.
Learn more

About the role

Role:

We are seeking a highly experienced Site Reliability Engineer located in Europe, with a strong track record of tackling complex reliability and scalability challenges, and a history of providing technical guidance to teams. If you're a seasoned problem-solver with a passion for automation and operational excellence, and enjoy elevating the skills of those around you, we want to hear from you.

What You'll Do:

  • Lead Technical Initiatives: Drive key technical initiatives focused on improving the reliability, performance, and scalability of our critical systems, often leading technical aspects within projects.
  • Architect and Implement Advanced Solutions: Design and implement sophisticated resilient and scalable solutions, leveraging your deep understanding of distributed systems.
  • Master Troubleshooting and Optimization: Lead complex troubleshooting efforts, identify deep-seated root causes, and implement advanced optimizations.
  • Build and Evangelize Automation: Develop and champion the adoption of robust automation frameworks and tools, potentially guiding more junior engineers in their development.
  • Elevate Observability Practices: Design and implement comprehensive and insightful monitoring and logging solutions, ensuring actionable insights are available across teams.
  • Provide Leadership in Incident Management: Take a leadership role in incident response, providing critical technical direction and mentorship during high-pressure situations.
  • Champion Post-Mortem Excellence: Lead and contribute to in-depth blameless post-mortem analyses, driving significant improvements based on learnings.
  • Mentor and Guide Team Members: Share your extensive knowledge and experience to mentor and guide other SREs and engineers, fostering their technical growth.

What We're Looking For:

  • Extensive experience with minimum 5 years in SRE, platform engineering, or software development with a strong operational focus.
  • Demonstrated experience in providing technical leadership, guidance, or mentorship to engineering teams.
  • Expert-level practical knowledge of cloud platforms, especially GCP.
  • Deep hands-on experience with container orchestration (Kubernetes) and infrastructure-as-code (Terraform, Helm, ArgoCD).
  • Strong command of multiple scripting and programming languages (Python, Go, Bash).
  • Proven expertise in building and leveraging advanced monitoring and observability tools (Prometheus, Grafana, ELK stack).
  • Exceptional analytical, problem-solving, and debugging skills at a senior level.
  • Excellent communication, collaboration, and influencing skills.
  • Organizational and leadership skills are a big plus

Compensation and Benefits:

  • Location dependent
✦ Let us apply for you
We find roles like this and apply on your behalf. Cover letter written for each one. Plans from $14.99/mo. Cancel anytime.
Join waitlist
Apply now