Description
The opportunity We are seeking a highly skilled and experienced DevOps Tech Lead to provide technical leadership across multiple DevOps teams within our R&D organization. As a key member of our DevOps and Infrastructure group at Aura, you will be instrumental in driving best practices, fostering technological innovation, and ensuring operational excellence across our infrastructure. This role offers a unique opportunity to shape the future of our cloud-native solutions, developer experience, and data infrastructure — including the rapidly growing AI and machine learning platform layer that underpins Aura's next generation of intelligent products. Design Cloud-Native Solutions: Architect and implement scalable cloud infrastructure across multiple environments using Kubernetes and IaC — including purpose-built platforms for AI/ML training, GPU-accelerated workloads, and real-time model serving. Lead Technical Strategy: Define the technical vision and roadmap for our DevOps capabilities, aligning multiple teams around common patterns, tools, and practices that drive efficiency and reliability. Boost Engineering Productivity: Enhance Developer Portal, CI/CD pipelines, and internal tooling to shorten release cycles and reduce friction. Build self-service capabilities that let data scientists and ML engineers deploy and monitor models without DevOps intervention. Optimize Data & AI Infrastructure: Engineer robust platforms that support our data-intensive workloads, ensuring performance, reliability, and cost-effectiveness for critical business intelligence systems. Drive DevOps Transformation: Champion modern DevOps practices across engineering, establishing shared standards for how Aura builds, deploys, and operates software at scale. Implement AI Observability: Build comprehensive tracking for AI/ML usage, cost attribution, and governance across users and services. Mentor & Build Culture: Grow the next generation of technical leaders while fostering a culture of innovation, knowledge sharing, and continuous improvement within the DevOps teams. What you'll be doing - Technical Leadership: Shape the future of Aura's infrastructure and influence architectural decisions across the organization. - Wide Range of Technologies: Work across the full DevOps spectrum – from IaC and Kubernetes to data platform and developer productivity tools. - Personal Development: Clear path for career advancement as you demonstrate success and impact — at the intersection of DevOps and AI infrastructure, one of the fastest-growing engineering domains. - Innovation Playground: Experiment with emerging technologies and implement solutions at scale — including GPU scheduling, vector databases, and LLM serving frameworks. - Strategic Impact: Direct line of sight between your work and Aura's business outcomes What we're looking for - 7+ years in DevOps, SRE, or platform engineering, with 2+ years leading teams or tech-leading across multiple squads. - Deep expertise with Kubernetes — architecture, networking, security, and cluster operations at scale. - Hands-on experience with Pulumi (or equivalent IaC) and at least one major cloud provider (AWS, GCP, or Azure). - Strong CI/CD expertise — designing and maintaining pipelines at scale (e.g., GitHub Actions, Jenkins, ArgoCD). - Solid understanding of networking, security, and observability in cloud-native environments. - Experience with monitoring and observability stacks (e.g. Prometheus, Gr