SOFTWARE ENGINEER, LARGE SCALE PRE-TRAINING PERFORMANCE

Google DeepMind
Full-time
Mountain View, California
$235,000 - $350,000
Posted on 5 months ago

Job Description

Software Engineer to redefine efficient training of frontier LLMs at massive scale. This role offers an opportunity to influence the design of frontier LLM models, and drive an effort to ensure efficient training and inference.

Responsibilities

  • Optimize performance of latest models on hardware accelerators
  • Guide model design to ensure inference-efficiency
  • Improve performance of LLM models on hardware accelerators
  • Collaborate with compiler, framework, and platform teams
  • Profile models to identify performance bottlenecks
  • Develop low-level custom kernels
  • Collaborate with research teams

Requirements

  • Proven track record of contributions to distributed training of LLMs
  • Experience in programming hardware accelerators via ML frameworks and low-level programming models
  • Experience in leveraging custom kernels and compiler infrastructure
  • Experience with Python and neural network training

Benefits

  • No benefits