Stackblitz
Stackblitz

Staff Site Reliability Engineer

engineeringfull-timeRemote
SALARY
Not listed
WORK TYPE
remote
JOB TYPE
full-time
INDUSTRY
ai
Apply for this position
✦ AutoApply Let us apply to roles like this on your behalf.
Learn more

About the role

About Us

We’re Bolt.new by StackBlitz! We’re the team that brought you WebContainers, the first-of-its-kind technology that made it possible to run Node.js right inside your browser. That breakthrough kicked off our journey in 2019, and it’s what powers the blazing-fast online IDE used by over 1 million developers every month. But we didn’t stop there. We doubled down on everything we learned and built Bolt.new — the fastest way to go from idea to production without writing traditional code. It’s a next-gen, AI-powered app builder that helps you create, edit, and deploy full-stack web and mobile apps instantly, right in your browser. No installs. No setup. Just smart automation and instant dev environments that let you move at the speed of thought. We’re a fully remote team, globally distributed, deeply collaborative, and seriously passionate about building the future of software development.

About This Opportunity

As a Staff Site Reliability Engineer, you'll be the reliability conscience of our engineering organization, embedding with product and platform teams from the earliest stages of a project, shaping designs, and making sure what we build is observable, scalable, and operable long before it reaches production. The heart of this role is making the pager ring less over time, but the pager is real. Every SRE here shares our on-call rotation, and sometimes the work genuinely is rolling up your sleeves and digging into a live incident. You'll set technical direction, define the standards other engineers build against, and drive initiatives that span multiple teams. This is a high-influence individual-contributor role: you won't manage people, but you will change how the whole organization thinks about reliability. You'll respond to incidents and share the on-call rotation alongside the rest of the team, but your lasting impact is the incidents that never happen because reliability was designed in from the start, at the scale of millions of developers building real products on Bolt.new every day.

How You'll Contribute

  • Embed With Teams Early: Partner with development teams throughout the project lifecycle, from design and architecture reviews through launch readiness. Bringing an SRE perspective before code is written, not after it breaks. Shepherd projects to completion with reliability designed in.
  • Define Production-Readiness Standards: Establish and evolve the design reviews, launch checklists, and operational acceptance criteria that projects pass through, and own how teams adopt them across the org.
  • Make Reliability Measurable: Define meaningful SLIs, SLOs, and error budgets in collaboration with product and engineering, and help teams use them to make real prioritization decisions.
  • Build the Paved Roads: Create the frameworks, tooling, and golden paths across AWS, GCP, and Azure, with Terraform as the common backbone, that make the reliable way the easy way for every engineer.
  • Cross-Team Leadership: Partner across engineering, product, and design to align reliability work with business objectives. Influence roadmaps, resolve technical disagreements, identify process and technical debt across the organization, and propose solutions that accelerate velocity.
✦ Let us apply for you
We find roles like this and apply on your behalf. Cover letter written for each one. Plans from $15/mo. Cancel anytime.
Get AutoApply
Apply now
Staff Site Reliability Engineer at Stackblitz — Remote