I am a Ph.D. candidate at UC San Diego, advised by Zhiting Hu. My research is funded by the Bloomberg Fellowship. I was a research scientist intern at Meta FAIR lab, mentored by Yuandong Tian and Jason Weston. I received my B.S. in Computer Science from Peking University. I'm currently looking for a full-time position in industry.

My research focuses on pushing the boundaries of machine reasoning in large language models. My work includes developing latent-space reasoning methods (Coconut, Coconut-Theory, Coconut-Dynamics), training LLMs to reason via reinforcement learning (Guru, OREO), building system-2 inference-time reasoning frameworks (RAP, LLM Reasoners), and finding better ways for LLM agents to interact with the world (ToolkenGPT, CocoaBench).

Bold indicates first or co-first authorship.

News

Selected Publications

COLM 2025 Quanta Magazine
Shibo Hao, Sainbayar Sukhbaatar, DiJia Su, Xian Li, Zhiting Hu, Jason Weston, Yuandong Tian
Conference on Language Models (COLM), 2025
NeurIPS 2025
Hanlin Zhu*, Shibo Hao*, Zhiting Hu, Jiantao Jiao, Stuart Russell, Yuandong Tian
Advances in Neural Information Processing Systems (NeurIPS), 2025
NeurIPS 2025
Zhoujun Cheng*, Shibo Hao*, Tianyang Liu*, Fan Zhou, Yutao Xie, Feng Yao, et al.
Advances in Neural Information Processing Systems (NeurIPS), 2025
Shibo Hao*, Yi Gu*, Haodi Ma, Joshua Jiahua Hong, Zhen Wang, Daisy Zhe Wang, Zhiting Hu
Empirical Methods in Natural Language Processing (EMNLP), 2023
NeurIPS 2023 Oral Best Paper · SoCalNLP
Shibo Hao, Tianyang Liu, Zhen Wang, Zhiting Hu
Advances in Neural Information Processing Systems (NeurIPS), 2023 · Oral Presentation

* equal contribution  ·  View all publications →