Chuhan Zhang

Chuhan Zhang (张楚晗)

Senior Research Scientist @ Google DeepMind

Email · Google Scholar · GitHub · X / Twitter

I am a researcher working on multimodal AI, with interests spanning video understanding, dynamic 3D scene reconstruction, and automatic evaluation pipelines for generative models. I am particularly keen on building models that are simple in design yet achieve strong performance. My recent work focuses on video spatial understanding and situated awareness in Gemini.

I completed my PhD at the Visual Geometry Group (VGG), University of Oxford, advised by Andrew Zisserman. Prior to my PhD, I obtained my MEng in Engineering Science at Exeter College, University of Oxford. Please feel free to reach out by email if you'd like to discuss research or collaborate.

Selected Publications