
The Forty-Second International Conference on Machine Learning (ICML) starts off today in Vancouver, Canada. We’re excited to share the works that will be presented by the group and our collaborating authors. You can find links to our ICML 2025 papers below!
ReFocus: Visual Editing as a Chain of Thought for Structured Image Understanding

This work presents ReFocus, a framework that equips multimodal LLMs with the ability to generate “visual thoughts” by performing visual editing on the input image through code, shifting and refining their visual focuses. With experiments on a wide range of structured image understanding tasks involving tables and charts, we present an in-depth analysis of the effects of different visual edits and the ways ReFocus can edit the input image until an answer is reached.
Xingyu Fu, Minqian Liu, Zhengyuan Yang, John Corring, Yijuan Lu, Jianwei Yang, Dan Roth, Dinei Florencio, and Cha Zhang, ReFocus: Visual Editing as a Chain of Thought for Structured Image Understanding ICML (2025).
Tuesday, July 15, 4:30 pm PDT — Poster Session 2 West
GIVE: Structured Reasoning of Large Language Models with Knowledge Graph Inspired Veracity Extrapolation

This paper introduces GIVE (Graph Inspired Veracity Extrapolation), a framework that enhances Large Language Models’ reasoning on knowledge-intensive tasks by combining their internal knowledge with minimal, structured external knowledge graph cues. The method enables smaller LLMs to achieve or exceed the performance of larger models on complex scientific reasoning tasks, while also reducing hallucination.
Jiashu He, Mingyu Ma, Jinxuan Fan, Dan Roth, Wei Wang, and Alejandro Ribeiro, GIVE: Structured Reasoning of Large Language Models with Knowledge Graph Inspired Veracity Extrapolation ICML (2025).
Thursday, July 17, 4:30 pm PDT — Poster Session 6 East
Includes some text generated by AI.