SOFTWARE ENGINEER, LARGE SCALE PRE-TRAINING PERFORMANCE
Google DeepMind
Full-time
Mountain View, California
$235,000 - $350,000
Posted on 5 months ago
Job Description
Software Engineer to redefine efficient training of frontier LLMs at massive scale. This role offers an opportunity to influence the design of frontier LLM models, and drive an effort to ensure efficient training and inference.
Responsibilities
Optimize performance of latest models on hardware accelerators
Guide model design to ensure inference-efficiency
Improve performance of LLM models on hardware accelerators
Collaborate with compiler, framework, and platform teams
Profile models to identify performance bottlenecks
Develop low-level custom kernels
Collaborate with research teams
Requirements
Proven track record of contributions to distributed training of LLMs
Experience in programming hardware accelerators via ML frameworks and low-level programming models
Experience in leveraging custom kernels and compiler infrastructure
Experience with Python and neural network training