Benchmarking Uncertainty Metrics for LLM Target-Aware Search

Guo, Pei-Fu; Tsai, Yun-Da; Lin, Shou-De

doi:10.18653/v1/2025.findings-emnlp.226

Benchmarking Uncertainty Metrics for LLM Target-Aware Search

Journal

EMNLP 2025 - 2025 Conference on Empirical Methods in Natural Language Processing, Findings of EMNLP 2025

Start Page

4230

End Page

4238

ISBN (of the container)

979-889176335-7

Date Issued

2025-11-04

Author(s)

Guo, Pei-Fu

Tsai, Yun-Da

Lin, Shou-De

DOI

10.18653/v1/2025.findings-emnlp.226

URI

https://www.scopus.com/record/display.uri?eid=2-s2.0-105028964127&origin=resultslist

https://scholars.lib.ntu.edu.tw/handle/123456789/736828

Abstract

LLM search methods, such as Chain of Thought (CoT) and Tree of Thought (ToT), enhance LLM reasoning by exploring multiple reasoning paths. When combined with search algorithms like MCTS and Bandit methods, their effectiveness relies heavily on uncertainty estimation to prioritize paths that align with specific search objectives. However, it remains unclear whether existing LLM uncertainty metrics adequately capture the diverse types of uncertainty required to guide different search objectives. In this work, we introduce a framework for uncertainty benchmarking, identifying four distinct uncertainty types: Answer, Correctness, Aleatoric, and Epistemic Uncertainty. Each type serves different optimization goals in search. Our experiments demonstrate that current metrics often align with only a subset of these uncertainty types, limiting their effectiveness for objective-aligned search in some cases. These findings highlight the need for additional target-aware uncertainty estimators that can adapt to various optimization goals in LLM search. Code is available at link.

Event(s)

30th Conference on Empirical Methods in Natural Language Processing, EMNLP 2025

Publisher

Association for Computational Linguistics

Type

conference paper

Benchmarking Uncertainty Metrics for LLM Target-Aware Search

關於 (About)

聯絡資訊 (Contact Us)

相關網站 (Useful Links)

關於開放取用 (Open Access, OA)

出版社期刊論文授權政策 (Copyright)

使用說明 (Instructions)

登入說明 (Sign-in)

匯入著作 (Submission)