About the Client
Our client is a fast-scaling, technology-driven company operating at the intersection of digital infrastructure and modern supply chain solutions. With a strong emphasis on innovation, automation, and cross-functional collaboration, the organization is deeply invested in cloud-native technologies and platform scalability. They're on a mission to enable seamless product delivery through highly reliable, performant, and secure infrastructure systems.
About the Role
Our client is seeking a hands-on Head of DevOps Engineering to lead the charge in redefining their infrastructure, platform operations, and incident response strategy. This is a deeply technical leadership role suited for a Principal-level DevOps or SRE architect who thrives on building scalable systems, driving automation, and fostering a high-ownership engineering culture. You'll be responsible for setting vision and execution across infrastructure, CI / CD, Kubernetes environments, and observability-while mentoring a growing team.
Responsibilities
- Architect & Automate : Lead the design and implementation of repeatable, Infrastructure-as-Code (IaC) environments-governing over 90% of infra via Terraform and GitOps principles.
- CI / CD Ownership : Redesign and own artifact-based deployment pipelines enabling safe, self-service deployments using tools like ArgoCD, Helm, and Docker.
- Platform Engineering : Build multi-account, multi-region Kubernetes infrastructure leveraging EKS / ECS, with intelligent autoscaling (Karpenter, HPA) and containerized workloads.
- Cost Optimization : Lead cloud cost control initiatives, including SPOT instance utilization, rightsizing, tagging strategies, and architectural remediation-driving measurable efficiency gains (25%+ reduction).
- Observability & Reliability : Roll out comprehensive observability tooling-centralized logging, alerting, tracing-and establish actionable IR runbooks and downtime mitigation strategies.
- Team Leadership & Culture : Mentor DevOps / SRE engineers, establishing a culture of speed, operational excellence, and continuous feedback.
- Cross-functional Enablement : Collaborate with engineering and product teams to foster GitOps workflows and enable safe, low-trust deployments that reduce operational friction.
- Process Standardization : Develop strategic roadmaps, author runbooks, and create reusable deployment patterns and documentation to scale DevOps practices across the organization.
Requirements
Experience : 8+ years in DevOps, Site Reliability Engineering, or Infrastructure roles-preferably in high-scale, cloud-native environments.Technical Leadership : Proven experience architecting robust CI / CD systems and infrastructure platforms, with an ops-focused mindset.IaC Mastery : Deep knowledge of Terraform (modular design), GitOps workflows, and infrastructure-as-code best practices.Kubernetes Expertise : Extensive hands-on experience with Kubernetes (EKS / ECS), Helm, and autoscaling solutions such as Karpenter and HPA.Cloud Cost Management : Demonstrated success in optimizing cloud usage and spend at scale (e.g., AWS tagging, SPOT strategies, rightsizing).Observability Focused : Strong understanding of observability stacks, with experience building SLOs, SLIs, monitoring dashboards, and incident workflows.Mentorship & Influence : Strong ability to lead and mentor senior engineers while driving organizational best practices across multiple teams.Communication Skills : Effective communicator able to distill complex technical systems to cross-functional stakeholders.Benefits & Why Join
Competitive compensation ($215,000-$230,000 total annual package, including bonus and / or equity)High-impact leadership role with strategic influence across engineering and operationsComprehensive health, dental, and vision insuranceGenerous PTO and company-observed holidays401(k) retirement plan with potential employer matchingFSAs and pre-tax commuter benefitsAccess to wellness and mental health support programsOpportunity to shape and lead a modern DevOps organization from the ground up