Infrastructure Software Engineer
About the role
About Nebius
Nebius is leading a new era in cloud infrastructure for the global AI economy. We are building a full-stack AI cloud platform that supports developers and enterprises from data and model training through to production deployment, without the cost and complexity of building large in-house AI/ML infrastructure. Built by engineers, for engineers. From large-scale GPU orchestration to inference optimization, we own the hard problems across compute, storage, networking and applied AI.
The Role
Nebius operates large-scale, mission-critical bare-metal infrastructure. As a Software Engineer (Python), you will design and build systems that provision, configure, test, and manage physical hardware at scale. Your work will sit close to the metal—interfacing directly with servers, networks, and management controllers—while supporting highly automated, reliable infrastructure operations. You will collaborate closely with hardware, networking, and data center operations teams to ensure our platforms are robust, scalable, and production ready.
Your responsibilities will include:
- Design and develop backend services and automation in Python
- Build and maintain systems for hardware provisioning, testing, and lifecycle management
- Develop software that runs directly on bare-metal environments
- Integrate with Linux systems, using Bash and low-level tooling where needed
- Implement and maintain CI/CD pipelines for infrastructure-focused software
- Work with networking services including IPv4/IPv6, DHCP, DNS, network boot, and server boot workflows
- Interface with BMC controllers and management protocols (IPMI-style protocols, HTTP-based standards)
- Enable reliable hardware interaction and automation across large fleets
- Support ARM64 / ARM64EC architectures
- Design and integrate NoSQL data stores for system state and orchestration data
- Write clear documentation and contribute to operational excellence
What we expect you to have:
- Strong professional experience as a software engineer, with a focus on Python
- Solid experience with Linux systems and shell scripting
- Hands-on experience working with bare-metal servers or low-level infrastructure
- Strong understanding of networking fundamentals (IPv4/IPv6, DHCP, DNS, PXE / network boot)
- Experience interacting with hardware management interfaces (BMC, IPMI-like protocols, HTTP APIs)
- Familiarity with CI/CD systems and production deployment workflows
- Experience designing or working with NoSQL databases
- Ability to debug complex issues spanning software, hardware, and networks