
Senior Machine Learning Engineer - Training Platform (AU remote)
Canva
Sydney
2 days ago
Senior Machine Learning Engineer - Training Platform (AU remote)
Canva is looking for a Senior Machine Learning Engineer to join the Training Platform team within the AI Platform group. This remote-friendly role based in Australia focuses on designing, scaling, and maintaining training infrastructure for AI workloads using Kubernetes. The ideal candidate has strong experience in training pipelines, distributed systems, and cloud infrastructure.
Hybrid
Full-time
Senior
Kubernetes
Ray
Salary
Not specified
Core Qualifications
Technical (Must-have)
KubernetesRayPyTorch distributed trainingFSxEFAHPC environmentscontainerized workloadsdistributed training frameworks
Soft Skills
collaborationownershipproblem solvingmentoring
Key Responsibilities
- Contribute to the evolution of Canva's unified training platform for AI training workloads
- Improve reliability, observability, debugging, and operational support for training systems
- Design and build platform capabilities for better scheduling, resource allocation, priority management, and quota management
- Collaborate with research scientists, ML engineers, product teams, and cloud/infrastructure teams
- Contribute to system design and architecture decisions across Canva's AI Platform
- Help shape platform roadmap and priorities based on user pain points and adoption needs
- Mentor engineers and share best practices in AI systems and infrastructure
Senior Machine Learning EngineerTraining PlatformKubernetesDistributed systemsAICloud infrastructurePyTorchRayAustralia remoteCanva