As an Applied Research intern at Labelbox, you will design, build, and productionize evaluation and post‑training systems for frontier LLMs and multimodal models. You’ll own continuous, high-quality evals and benchmarks, create and curate post‑training datasets, and prototype training loops to measure and improve real‑world task and agent performance.