Logo

Sheng Zhang

Microsoft Research

shezhan@microsoft.com

Redmond, Washington

I am a Principal Research Lead at Microsoft Research, where I build foundation models and frontier AI systems for multimodal reasoning and real-world applications.

My work includes foundation models used by millions of people (BiomedCLIP, Curiosity, GigaPath); test-time scaling methods for OpenAI frontier models (MedPrompt); and agent harnesses that extend LLMs to new modalities (Be My Eyes).

I also develop new post-training paradigms and data recipes (LLaVA-Med, OctoMed), and post-train models that address real-world problems frontier LLMs cannot yet solve (UniRG). I am fortunate to work with talented students and collaborators on a range of exciting research directions.

Publications [ Selected (9) | Full (69) | Google Scholar ]

Tutorials

Service