Director, Managed Services Global Cloud Operations
About the role
Overview
The Director, Managed Services Global Cloud Operations will own the strategy, architecture, and global execution of InterSystems’ cloud-native Service Delivery Platform. This platform—built on Kubernetes and Spectro Cloud—underpins all SaaS and Managed Services delivery for InterSystems IRIS and the broader portfolio across private datacenters and public cloud providers.
This role is accountable for leading globally distributed engineering and operations teams that design, build, and run a highly available, secure, and scalable platform spanning InterSystems datacenters, AWS, Azure, and GCP. The Director will be responsible not only for operational excellence, but also for the cloud architecture, automation frameworks and software-driven operational model required to deliver IRIS as a truly global, cloud-native managed service.
Key Responsibilities
Platform Ownership & Cloud Architecture
- Own the end-to-end architecture and global operation of the InterSystems Service Delivery Platform, spanning Spectro Cloud, Kubernetes, Cilium, Helm, and multi-cloud infrastructure services.
- Define and evolve a cloud-first reference architecture supporting consistent, secure, and resilient service delivery across InterSystems datacenters and public cloud regions.
- Act as the senior technical authority for Kubernetes platform strategy, multi-cloud design, and runtime operational patterns.
Engineering-Led Operations & Automation
- Drive a platform engineering and SRE operating model, where infrastructure and operations are delivered through software, automation, and repeatable pipelines.
- Establish Infrastructure as Code (IaC) standards and practices using Terraform, Helm, and GitOps-based workflows to ensure deterministic, auditable platform deployments.
- Partner with Product Engineering and Cloud Engineering to define CI/CD pipelines, deployment automation, and lifecycle management for IRIS and supporting services.
- Ensure platform changes are engineered, tested, versioned, and rolled out with the same rigor as application software.
Reliability, Scale & Observability
- Define and enforce service-level objectives (SLOs), error budgets, and operational KPIs across regions and clouds.
- Lead the evolution of observability and telemetry using Coralogix and complementary tooling to enable full-stack visibility, proactive detection, and automated remediation.
- Establish mature incident response, problem management, and continuous improvement processes aligned with SRE best practices.
Security, Compliance & Risk
- Partner with Global Security, Compliance, and Risk teams to ensure platform architecture and operations meet regulatory, data protection, and access control requirements.
- Ensure security is embedded by design across Kubernetes, cloud infrastructure, CI/CD pipelines, and operational tooling.
Financial Stewardship & Growth Enablement
- Own capacity planning, scaling strategies, and cross-cloud cost management, balancing reliability, performance, and financial efficiency.
- Enable rapid global growth of Managed Services and SaaS offerings through standardized, scalable platform capabilities.
Leadership & Organizational Development
- Lead and grow a globally distributed organization of platform engineers, SREs, and operations leaders.
- Build a culture of accountability, technical excellence, and continuous learning.