Staff Backend Engineer - Adaptive Telemetry | Canada | Remote
About the role
Staff Backend Engineer - Adaptive Telemetry
This is a remote position. We are looking for candidates in the Canadian time zones.
What is Grafana Cloud?
Grafana Cloud is our composable observability platform that integrates metrics, logs, traces, and profiles with Grafana. It allows our customers to leverage the best open source observability software – including Prometheus, Mimir, Loki, Tempo, and Pyroscope – without the overhead of installing, maintaining and scaling their own observability stack.
The Databases department owns and operates the telemetry databases that are Mimir for metrics, Loki for logs, Tempo for traces, and Pyroscope for profiles. We offer our databases as a Cloud service supporting Grafana Cloud.
Adaptive Telemetry Group
The Adaptive Telemetry group, part of the Databases department, has the mission of ensuring that all telemetry stored in our databases is worthy of attention. Under that mission, the group is responsible for the development of Adaptive Metrics, Adaptive Logs, Adaptive Traces and Adaptive Profiles.
Our Adaptive Telemetry solutions give users the ability to control and optimize their telemetry data. These solutions ensure that data storage is optimized based on individual usage patterns, so only the most valuable data is retained.
As a company we are remote-first and global, we embrace people of different experiences and backgrounds to build diverse teams where every person brings a new perspective to the software.
What will you be doing:
- Drive technical strategy and roadmap. Proactively define the architectural vision, prioritize work that unlocks major product or platform improvements, and influence product and engineering decisions.
- Lead end-to-end delivery of large, cross-functional projects. Own planning, design, execution, rollout and long-term operation of large initiatives.
- Own architecture, reliability, performance and cost for critical systems. Make pragmatic architecture choices that balance scalability, availability, latency and cost while ensuring systems remain maintainable and evolvable.