Senior Infrastructure Engineer
About the role
Senior Infrastructure Engineer (OpenStack)
Location: UK (Remote)
Department: Infrastructure
Reporting to: Head of Infrastructure
About Nexgen Cloud
NexGen Cloud is the company behind Hyperstack, a full-stack AI cloud serving tens of thousands of customers from AI researchers to enterprises running the world's most compute-intensive workloads. We deliver on-demand and private GPU infrastructure to teams who treat performance as a requirement, not a feature.
We're a tight-knit, fast-moving team working at the cutting edge of AI cloud infrastructure. We practice what we preach, equipping our people with AI at every level so we can solve harder problems, ship faster, and keep raising the bar for what enterprise GPU infrastructure looks like.
The Role: Senior Infrastructure Engineer (OpenStack)
This role exists because our platform is scaling quickly — and complexity comes with it. As we expand our OpenStack and Kubernetes environments globally, we need engineers who can take real ownership of how the platform is designed, operated, and improved. You'll have direct ownership over business-critical infrastructure that impacts performance, reliability, and customer experience.
This is not a maintenance role. If you like solving hard problems, owning systems end-to-end, and seeing the impact of your work immediately — you'll enjoy this.
What You'll Be Doing
Rather than a long checklist, here's what success in this role looks like:
- Own the design, deployment, and operation of OpenStack and Kubernetes environments — ensuring platform performance, scalability, and resilience for GPU workloads
- Build and improve infrastructure using infrastructure-as-code and GitOps practices, driving automation across provisioning, deployment, and operational workflows
- Optimise GPU workload scheduling using Kubernetes and NVIDIA tooling, and implement monitoring, logging, and alerting to ensure platform stability
- Lead incident response and drive continuous improvement of reliability across the platform
- Maintain strong security controls across infrastructure and container layers — RBAC, network policies, and tenant isolation
- Work closely with Platform, DevOps, AI, Product, and Support teams to align infrastructure capabilities with customer and platform requirements
About You
We're more interested in how you think and work than in a perfect CV. You'll likely bring