Middle DevOps Engineer
About the role
About Our Environment
Our product supports over 67k B2B customers, managing business-critical document workflows that demand top availability and performance even under high load. Our highly resilient infrastructure runs on AWS and Kubernetes (EKS). We operate a service-oriented architecture comprising hundreds of microservices written primarily in Python and Java. Services interact using synchronous protocols (NATS, gRPC) and asynchronous, event-driven operations (RabbitMQ, Kafka, Debezium, and Flink). We manage hundreds of databases, predominantly PostgreSQL, with exposure to MySQL, MongoDB, OpenSearch, and Redis, handling many TBs of data. Due to the scale and complexity of this environment, all infrastructure management and service delivery are strictly enforced through Infrastructure as Code (IaC) and modern CI/CD practices to maintain a high availability target of 99.99%.
Key Responsibilities
- Platform & IaC Ownership: Analyze and implement infrastructure designs for services and shared components, managing them as Infrastructure as Code (IaC) using tools like Terraform and Helm within our cloud environment (AWS).
- Delivery Lifecycle Management: Design and implement robust CI/CD pipelines and own the full delivery lifecycle of infrastructure tools, services, and components from development testing through to production rollout.
- Developer Enablement: Actively participate in regular support cadences to provide hands-on technical assistance and expertise to development teams regarding platform adoption and usage.
- Reliability Integration: Integrate and maintain monitoring, logging, and alerting components for platform services, and participate in the team's on-call rotation for immediate incident mitigation within the platform ownership scope.
- Security & Compliance: Collaborate closely with the Security team to embed DevSecOps best practices and guardrails, ensuring the security and compliance of the platform and delivery process.
- Process Improvement: Drive continuous improvements in platform tooling usability, deployment efficiency, and environment stability.
Required Skills and Experience
- Education: While a degree in Computer Science or IT is a nice-to-have, we’re most interested in your technical skills and real-world experience.