I am a member of technical staff at Microsoft Research. I work on language models (especially the Phi series of language models) and diffusion models theory.

Before, my research was mostly focused on sampling, optimization and proximal methods. Here is a paper at the intersection of these topics.

CV

News

Some papers

LLMs

Diffusion models and sampling

Proximal methods in optimization and sampling