Lexington Whalen

Research Lead, SB Intuitions (SoftBank)  ·  NSF Graduate Research Fellow, Georgia Tech

I am a machine learning researcher specializing in efficient training and inference of large language models. At SoftBank's SB Intuitions, I co-lead pretraining and inference optimization for the Sarashina LLM family — Japan's largest language models. Previously I was at NVIDIA Research, where I co-developed Nemotron-Labs-Diffusion and co-led Efficient-DLM (ICML 2026). I was a NSF Graduate Research Fellow and pursued an M.S. in CS at Georgia Tech.

I am committed to mentoring students, and make time for this every week. Please feel free to message me on LinkedIn and I will try to respond. I don't care about background, and the request can be anything from resume reviews to just general advice. Please note that I cannot help with internship / university / job applications unless I directly was involved in your project.

Lexington Whalen
News
Preprints & Tech Reports
Selected Publications
Patents
Press / Advisement

I am open to calls about ML/AI advising, for companies or governments, and have in the past done chats with journalists and embassy members. These can be formal or informal; if interested, please message me via email or LinkedIn, with the word "Advising" in the title. Thanks!

_ Last updated: 2026-06-25