I am a member of technical staff at Microsoft Research. Before that I was a Google research fellow at the Simons Institute (UC Berkeley).

I work on language models (especially the Phi series of language models) and diffusion models theory. Before, my research was mostly focused on sampling, optimization and proximal methods. Here is a paper at the intersection of these topics.

CV

News

Some papers

LLMs

Diffusion models and sampling

Proximal methods in optimization and sampling