
Staff Machine Learning Engineer, ML Efficiency
Reddit, Inc.
Staff Machine Learning Engineer, ML Efficiency
Reddit seeks a Staff Machine Learning Engineer for the ML Efficiency team to build infrastructure and tooling for efficient ML training and inference. The role involves improving hardware utilization, optimizing distributed systems, and driving performance improvements. Requires 5+ years of software engineering experience and proficiency in Python.
Staff Machine Learning Engineer, ML Efficiency
Reddit seeks a Staff Machine Learning Engineer for the ML Efficiency team to build infrastructure and tooling for efficient ML training and inference. The role involves improving hardware utilization, optimizing distributed systems, and driving performance improvements. Requires 5+ years of software engineering experience and proficiency in Python.
Salary
Core Qualifications
Technical (Must-have)
Soft Skills
Preferred Qualifications
Technical (Nice-to-have)
Key Responsibilities
- Design and build systems that improve the efficiency of ML training and inference workloads.
- Develop tooling that helps ML engineers debug, profile, optimize, and monitor model performance.
- Improve GPU and general resource utilization through scheduling, resource management, caching, and workload optimization.
- Partner with ML researchers and product teams to identify bottlenecks and drive performance improvements.
- Build benchmarking frameworks and performance dashboards for training and serving systems.
- Optimize distributed training infrastructure, data pipelines, and model serving architectures.
- Lead cross-functional initiatives that improve the productivity of Reddit ML engineers.
- Drive technical strategy for ML platform scalability, reliability, and cost efficiency.