SENIOR PLATFORM ENGINEER I, AI EVALUATION (24 MONTHS FIXED-TERM)

Khan Academy
Full-time
Mountain View, CA
$137,871 - $172,339 USD / $186,306 - $232,883 CAN
Posted on 5 months ago

Job Description

We’re looking for an AI Platform Engineer to evolve and extend our internal evaluation framework for assessing the quality of our AI-driven experiences. This engineer will work with enough eval systems to quickly make sense of Khan's internal eval framework and recognize opportunities for improvement. You’ll work closely with ML data engineers and platform developers to help internal teams adopt an eval-driven development process.

Responsibilities

  • Be fluent in the range of offline and online evaluation strategies
  • Have intuitions about how to specify eval pipelines succinctly using declarative syntax
  • Understand the role of stratified datasets and ground truth labeling
  • Appreciate the range of eval scoring schemes from human raters to automated LLMs-as-judge

Requirements

  • Bachelor’s or Master’s degree in Computer Science, Data Engineering, or a related field
  • 5 years of Software Engineering experience with 2+ of those years working on the evaluation of generative AI systems
  • Strong programming skills in Go, Python, SQL, and at least one data pipeline framework
  • Familiarity with the architecture of large language models and their industry-standard APIs

Benefits

  • No benefits