arXiv AI Publications - 2025 Week 51

Publications de la semaine #51 - 2025

Here are the top 5 most relevant AI papers from arXiv week 51/2025, complete with analysis and insights.

Publications at a Glance

Model-First Reasoning LLM Agents: Reducing Hallucinations through Explicit Problem Modeling Annu Rana, Gaurav Kumar | 12/16/2025

Generative Adversarial Reasoner: Enhancing LLM Reasoning with Adversarial Reinforcement Learning Qihao Liu, Luoxin Ye, Wufei Ma, Yu-Cheng Chou, Alan Yuille | 12/18/2025

Activation Oracles: Training and Evaluating LLMs as General-Purpose Activation Explainers Adam Karvonen, James Chua, Clément Dumas, Kit Fraser-Taliente, Subhash Kantamneni, Julian Minder, Euan Ong, Arnab Sen Sharma, Daniel Wen, Owain Evans, Samuel Marks | 12/17/2025

Stepwise Think-Critique: A Unified Framework for Robust and Interpretable LLM Reasoning Jiaqi Xu, Cuiling Lan, Xuejin Chen, Yan LU | 12/17/2025

Kascade: A Practical Sparse Attention Method for Long-Context LLM Inference Dhruv Deshmukh, Saurabh Goyal, Nipun Kwatra, Ramachandran Ramjee | 12/18/2025

Model-First Reasoning LLM Agents: Reducing Hallucinations through Explicit Problem Modeling

Published

12/16/2025

arXiv ID

[2512.14474v1]

Authors

Annu Rana, Gaurav Kumar

Key Insights

This research introduces the Model-First Reasoning (MFR) paradigm, which emphasizes explicit problem modeling in LLMs to enhance their performance in complex multi-step planning tasks. By demonstrating that representational deficiencies are a primary cause of planning failures, MFR offers a novel approach that significantly reduces constraint violations and improves solution quality.

Potential Impact

The adoption of MFR could transform how LLMs are utilized in various domains requiring intricate planning, such as healthcare and logistics, leading to more reliable and interpretable outcomes. This innovation may pave the way for advanced AI agents that can tackle complex real-world problems with greater accuracy and consistency.

back to list

Generative Adversarial Reasoner: Enhancing LLM Reasoning with Adversarial Reinforcement Learning

Published

12/18/2025

arXiv ID

[2512.16917v1]

Authors

Qihao Liu, Luoxin Ye, Wufei Ma, Yu-Cheng Chou, Alan Yuille

Key Insights

This research introduces the Generative Adversarial Reasoner, a novel framework that enhances the reasoning capabilities of large language models through adversarial reinforcement learning, addressing common errors in logical reasoning and calculations. The method's unique approach of co-evolving a reasoner and a discriminator allows for improved credit assignment and sample efficiency in training, leading to significant performance gains on mathematical benchmarks.

Potential Impact

By improving the reasoning quality of LLMs, this framework could revolutionize applications in fields requiring rigorous logical reasoning, such as automated theorem proving, education, and complex decision-making systems. The modularity of the discriminator also opens avenues for more flexible reward shaping, potentially enhancing LLMs' alignment with human preferences and increasing their usability in diverse contexts.

back to list

Activation Oracles: Training and Evaluating LLMs as General-Purpose Activation Explainers

Published

12/17/2025

arXiv ID

[2512.15674v1]

Authors

Adam Karvonen, James Chua, Clément Dumas, Kit Fraser-Taliente, Subhash Kantamneni, Julian Minder, Euan Ong, Arnab Sen Sharma, Daniel Wen, Owain Evans, Samuel Marks

Key Insights

This research introduces Activation Oracles (AOs), a novel approach that leverages LatentQA to train large language models to interpret their own activations and provide natural language explanations, significantly advancing the understanding of LLM behavior. The study demonstrates that these models can generalize well across diverse training datasets and outperform existing white- and black-box interpretation techniques in several tasks.

Potential Impact

By enabling a more intuitive understanding of LLM activations, Activation Oracles could transform how researchers and practitioners interpret and trust AI systems, potentially leading to improved model design and deployment in critical applications. This innovative method may also drive further research into enhancing LLM transparency and accountability, which is increasingly vital in ethical AI development.

back to list

Stepwise Think-Critique: A Unified Framework for Robust and Interpretable LLM Reasoning

Published

12/17/2025

arXiv ID

[2512.15662v1]

Authors

Jiaqi Xu, Cuiling Lan, Xuejin Chen, Yan LU

Key Insights

The research introduces the Stepwise Think-Critique (STC) framework, which innovatively combines reasoning and self-evaluation in large language models (LLMs) to enhance their critical thinking capabilities. By interleaving these processes within a single model and employing a hybrid reinforcement learning objective, STC improves both the quality of reasoning and the interpretability of the model's outputs.

Potential Impact

This unified approach could significantly transform how LLMs are utilized in complex problem-solving tasks by enabling them to provide not only solutions but also transparent reasoning processes. The enhanced critical thinking features may lead to more reliable applications in fields requiring high-stakes decision-making, such as healthcare, law, and education, by ensuring that LLM outputs can be better understood and trusted.

back to list

Kascade: A Practical Sparse Attention Method for Long-Context LLM Inference

Published

12/18/2025

arXiv ID

[2512.16391v1]

Authors

Dhruv Deshmukh, Saurabh Goyal, Nipun Kwatra, Ramachandran Ramjee

Key Insights

Kascade introduces a training-free sparse attention mechanism that capitalizes on the intrinsic sparsity of post-softmax attention and the stability of high-weight keys across layers, enabling efficient computation of Top-k indices. This innovative approach allows for significant speed improvements in long-context LLM inference while maintaining high accuracy, marking a notable advancement in attention mechanisms.

Potential Impact

By dramatically reducing inference latency in long-context models, Kascade could facilitate the deployment of more efficient and responsive AI applications, particularly in areas requiring extensive reasoning and retrieval-augmented generation. This advancement may lead to broader adoption of long-context LLMs in real-time applications, reshaping the landscape of natural language processing and machine learning.

back to list