17 min read
3
Q*: Improving Multi-step Reasoning for LLMs with Deliberative Planning (Idea behind OpenAI o1)
Chaojie Wang, Yanchen Deng, Zhiyi Lyu, Liang Zeng, Jujie He, Shuicheng Yan, Bo An The paper introduces Q*, a groundbreaking…