Strike

Site Reliability Engineer

engineeringfull-timeRemote

SALARY

Not listed

WORK TYPE

remote

JOB TYPE

full-time

INDUSTRY

crypto

Apply for this position

✦ AutoApply Let us apply to roles like this on your behalf.

Learn more

About the role

Role:

We are seeking a highly experienced Site Reliability Engineer located in Europe, with a strong track record of tackling complex reliability and scalability challenges, and a history of providing technical guidance to teams. If you're a seasoned problem-solver with a passion for automation and operational excellence, and enjoy elevating the skills of those around you, we want to hear from you.

What You'll Do:

Lead Technical Initiatives: Drive key technical initiatives focused on improving the reliability, performance, and scalability of our critical systems, often leading technical aspects within projects.
Architect and Implement Advanced Solutions: Design and implement sophisticated resilient and scalable solutions, leveraging your deep understanding of distributed systems.
Master Troubleshooting and Optimization: Lead complex troubleshooting efforts, identify deep-seated root causes, and implement advanced optimizations.
Build and Evangelize Automation: Develop and champion the adoption of robust automation frameworks and tools, potentially guiding more junior engineers in their development.
Elevate Observability Practices: Design and implement comprehensive and insightful monitoring and logging solutions, ensuring actionable insights are available across teams.
Provide Leadership in Incident Management: Take a leadership role in incident response, providing critical technical direction and mentorship during high-pressure situations.
Champion Post-Mortem Excellence: Lead and contribute to in-depth blameless post-mortem analyses, driving significant improvements based on learnings.
Mentor and Guide Team Members: Share your extensive knowledge and experience to mentor and guide other SREs and engineers, fostering their technical growth.

What We're Looking For:

Extensive experience with minimum 5 years in SRE, platform engineering, or software development with a strong operational focus.
Demonstrated experience in providing technical leadership, guidance, or mentorship to engineering teams.
Expert-level practical knowledge of cloud platforms, especially GCP.
Deep hands-on experience with container orchestration (Kubernetes) and infrastructure-as-code (Terraform, Helm, ArgoCD).
Strong command of multiple scripting and programming languages (Python, Go, Bash).
Proven expertise in building and leveraging advanced monitoring and observability tools (Prometheus, Grafana, ELK stack).
Exceptional analytical, problem-solving, and debugging skills at a senior level.
Excellent communication, collaboration, and influencing skills.
Organizational and leadership skills are a big plus

Compensation and Benefits:

Location dependent

✦ Let us apply for you

We find roles like this and apply on your behalf. Cover letter written for each one. Plans from $15/mo. Cancel anytime.

Get AutoApply

Apply now