Gen AI for Recommendations
At Spotify, I help bridge research and product on a foundational LLM trained on user listening — mostly by owning evaluation, from offline metrics that reflect a model's behavior across tasks to the A/B tests that put it in front of listeners. I spend my time on semantic IDs, synthetic data, LLM-as-judge, online–offline correlation, and safety.