SelfReflect: Can LLMs Talk Their Inside Reply Distribution?

February 2, 2026

45

The frequent strategy to speak a big language mannequin’s (LLM) uncertainty is so as to add a proportion quantity or a hedging phrase to its response. However is that this all we will do? As a substitute of producing a single reply after which hedging it, an LLM that’s totally clear to the person wants to have the ability to mirror on its inside perception distribution and output a abstract of all choices it deems attainable, and the way probably they’re. To check whether or not LLMs possess this functionality, we develop the SelfReflect metric, an information-theoretic distance between a given abstract and a distribution over solutions. In interventional and human research, we discover that SelfReflect signifies even slight deviations, yielding a nice measure of faithfulness between a abstract string and an LLM’s precise inside distribution over solutions. With SelfReflect, we make a convincing destructive statement: trendy LLMs are, throughout the board, incapable of unveiling what they’re unsure about, neither by way of reasoning, nor chains-of-thoughts, nor express finetuning. Nevertheless, we do discover that LLMs are in a position to generate trustworthy summaries of their uncertainties if we assist them by sampling a number of outputs and feeding them again into the context. This easy strategy shines a lightweight on the common means of speaking LLM uncertainties whose future growth the SelfReflect rating allows.

† Impartial Researcher
‡ Tübingen AI Heart

SelfReflect: Can LLMs Talk Their Inside Reply Distribution?

Related Articles

Probiotic cream that ramps up warmth manufacturing may stop frostbite

JavaScript for Everybody: Destructuring | CSS-Tips

High 5 GitHub Repositories for Free Claude Expertise (1000+ Expertise)

Latest Articles

Probiotic cream that ramps up warmth manufacturing may stop frostbite

JavaScript for Everybody: Destructuring | CSS-Tips

High 5 GitHub Repositories for Free Claude Expertise (1000+ Expertise)

3 causes for adopting MCP

Galaxy Z TriFold is perhaps lifeless, however a successor is already within the works