This is a Engineering Manager (Infra) - AI Reliability (Sydney based) role with Canva based in Sydney, NSW, AU Canva Role Seniority - mid level, senior More about the Engineering Manager (Infra) - AI Reliability (Sydney based) role at Canva Company Description Join the team redefining how the world experiences design. Hey, g'day, mabuhay, kia ora, 你好, hallo, vítejte! Thanks for stopping by. We know job hunting can be a little time consuming and you're probably keen to find out what's on offer, so we'll get straight to the point. Where and how you can work Our flagship campus is in Sydney. We also have a campus in Melbourne and co-working spaces in Brisbane, Perth, Adelaide and Auckland. But you have choice in where and how you work, we trust our Canvanauts to choose the balance that empowers them and their team to achieve their goals. Job Description What you’d be doing in this role As Canva scales change continues to be part of our DNA. But we like to think that's all part of the fun. So this will give you the flavour of the type of things you'll be working on when you start, but this will likely evolve. This role will see you: Building world-class AI infrastructure to support a 100 person research team at the forefront of creative AI Designing and scaling multi-cloud systems that support high-performance model training and inference Partnering across AWS, GCP, Cloudflare and GCore to optimise GPU compute environments Enhancing CI/CD pipelines and developer velocity within our AI platform teams Improving monitoring, alerting and system observability for AI workloads Driving alignment in DevOps best practices across the AI platform and CORE engineering teams Leading a high-impact engineering team in a fast-paced, cutting-edge environment You're probably a match if You’ve led DevOps or infrastructure teams, ideally in AI or high-performance computing environments You’ve worked closely with AI researchers or research teams , enabling experimentation, model training, or evaluation at scale You’ve built or led infrastructure that prioritises research velocity and reliability , not just production uptime You’re experienced with AWS (ECS, EC2, S3, IAM) and multi-cloud environments like GCP, Cloudflare or GCore You’ve worked with Kubernetes, SLURM, or similar distributed training infrastructure You’re fluent in infrastructure as code tools like Terraform You understand the lifecycle of AI models and how to support R&D at scale You have a strong grasp of containerisation, Linux fundamentals, and cloud networking You’re collaborative, curious, and passionate about enabling others to move fast and safely About the team You’ll be joining CORE (Canva Original Research & Exploration) — our in-house AI research lab. CORE brings together researchers, engineers, and infrastructure specialists to build world-class models that unlock creativity at scale. With over 100 researchers already in the team, this role is foundational to scaling our AI training and inference capabilities as we push the boundaries of what's possible in product-integrated AI. What's in it for you? Achieving our crazy big goals motivates us to work hard - and we do - but you'll experience lots of moments of magic, connectivity and fun woven throughout life at Canva, too. We also offer a range of benefits to set you up for every success in and outside of work. Here's a taste of what's on offer: Equity packages - we want our success to be yours too Inclusive parental leave policy that supports all parents & carers An annual Vibe & Thrive allowance to support your wellbeing, social connection, office setup & more Flexible leave options that empower you to be a force for good, take time to recharge and supports you personally Check out lifeatcanva.com for more info. Other stuff to know We make hiring decisions based on your experience, skills and passion, as well as how you can enhance Canva and our culture. When you apply, please tell us the pronouns you use and any reasonable adjustments you may need during the interview process. We celebrate all types of skills and backgrounds at Canva so even if you don’t feel like your skills quite match what’s listed above - we still want to hear from you! Please note that interviews are conducted virtually. Before we jump into the responsibilities of the role. No matter what you come in knowing, you’ll be learning new things all the time and the Canva team will be there to support your growth. Please consider applying even if you don't meet 100% of what’s outlined Key Responsibilities ️ Building AI infrastructure Designing multi-cloud systems Leading engineering teams Key Strengths DevOps leadership ☁️ Multi-cloud expertise ️ Infrastructure as code Containerization knowledge Collaboration skills AI model lifecycle understanding A Final Note: This is a role with Canva not with Hatch.