Honeycomb

Senior Site Reliability Engineer

engineeringfull-timeRemote - Ireland

SALARY

Not listed

WORK TYPE

remote

JOB TYPE

full-time

INDUSTRY

general

Apply for this position

✦ AutoApply Let us apply to roles like this on your behalf.

Learn more

About the role

What We're Building

Honeycomb is a service for the near and present future, defining observability and raising expectations of what developer tools can do! We're working with well known companies like HelloFresh, Slack, LaunchDarkly, and Vanguard and more across a range of industries. This is an exciting time in our trajectory, we've closed Series D funding, scaled past the 200-person mark, and were named to Forbes' America's Best Startups of 2022 and 2023!

If you want to see what we've been up to, please check out these blog posts and Honeycomb.io press releases.

Who We Are

We come for the impact, and stay for the culture! We're a talented, opinionated, passionate, fiercely inclusive, and responsible group of bees. We have conviction and we strive to live our values every day. We want our people to do what they truly love amongst a team of highly talented (but humble) peers.

How We Work

We are a fully distributed company, which means we believe it is not where you sit, but how you deliver that matters most. We invest in our people and care about how you orient to our culture and processes. At the same time we imbue a lot of trust, autonomy, and accountability from Day 1.

Little more about the team:

Honeycomb's Site Reliability Engineering (SRE) team works at the intersection of infrastructure, developer experience, and organizational enablement. We lead technically complex, cross-team projects that improve reliability, scale systems, and make life easier for engineering teams. We're trusted across the company to set direction, solve ambiguous problems, and build processes that run smoothly. Our work spans AWS infrastructure, Kubernetes, Helm, Terraform, Kafka, and other tools, aligned with the sociotechnical needs of scaling a fast-growing company. We're a collaborative, diverse team that values experimentation, data-driven decisions, and maintaining a safe environment for healthy debate and innovation.

What you'll do in the role:

Help Honeycomb scale our backend systems to support our highest-volume customers.
Build organizational trust through transparent communication, giving and receiving direct and kind feedback.
Work with other backend teams to dive deep into our stack to make sure we're getting the most out of our infrastructure.
Be trained, become, and then train others as an Incident Commander.
Help SRE and Honeycomb develop a healthy cross-Atlantic engineering culture.
Participate in the team's on-call rotation as the EU side of a new follow-the-sun rotation.
Help the organization navigate tradeoffs between reliability and its other goals and priorities.
Optional: act as an external ambassador through blog posts, conference talks, and presentations with support from our DevRel team.

What You'll Bring:

Strong experience in AWS and Kubernetes.
Experience performing cost analysis and reduction.
Solid Helm, Terraform, and CI/CD experience.
Project management skills.
Software engineering experience (Golang is a plus, and so is performance engineering).
Experience with Kafka or another high-volume distributed system.
Excellent written and spoken communication skills, with the ability to tailor your communication for your audience and give direct feedback when you notice something wrong.
A curiosity to learn how people and systems work, and the willingness to make them partners in your initiatives.
Familiarity with observability

✦ Let us apply for you

We find roles like this and apply on your behalf. Cover letter written for each one. Plans from $15/mo. Cancel anytime.

Get AutoApply

Apply now