Guardrails for LLMs: Measuring AI ‘Hallucination’ and Verbosity

May 12, 2026

65

# Introduction

Giant language fashions (LLMs) have a style for utilizing “flowery”, generally overly verbose language of their responses. Ask a easy query, and chances are high you might get flooded with paragraphs of overly detailed, enthusiastic, and sophisticated prose. This standard habits is rooted of their coaching, as they’re optimized to be as useful and conversational as potential.

Sadly, verbosity is a severe side to have beneath the radar, and could be argued to typically correlate with an elevated odds of a serious subject: hallucinations. The extra phrases are generated in a response, the upper the probabilities of drifting from grounded data and venturing into “the artwork of fabrication”.

In sum, strong guardrails are wanted to forestall this double-sided downside, beginning with verbosity checks. This text reveals the way to use the Textstat Python library to measure readability and detect overly complicated responses earlier than they attain the top consumer, forcing the mannequin to refine its response.

# Setting a Complexity Finances with Textstat

The Textstat Python library can be utilized to compute scores such because the automated readability index (ARI); it estimates the grade degree (degree of examine) wanted to grasp a bit of textual content, reminiscent of a mannequin response. If this complexity metric exceeds a price range or threshold — reminiscent of 10.0, equal to a Tenth-grade studying degree — a re-prompting loop could be routinely triggered to require a extra concise, less complicated response. This technique not solely dispels flowery language however may additionally assist cut back hallucination dangers, as a result of the mannequin adheres to core information extra strictly in consequence.

# Implementing the LangChain Pipeline

Let’s examine the way to implement the above-described technique and combine it right into a LangChain pipeline that may be simply run in a Google Colab pocket book. You will want a Hugging Face API token, obtainable at no cost at https://huggingface.co/settings/tokens. Create a brand new “secret” named HF_TOKEN on the left-hand aspect menu of Colab by clicking on the “Secrets and techniques” icon (it appears like a key). Paste the generated API token within the “Worth” discipline, and you’re all arrange!

To start out, set up the required libraries:

!pip set up textstat langchain_huggingface langchain_community

The next code is Google Colab-specific, and you might want to regulate it accordingly if you’re working in a special atmosphere. It focuses on recovering the saved API token:

from google.colab import userdata

# Get hold of Hugging Face API token saved in your Colab session's Secrets and techniques
HF_TOKEN = userdata.get('HF_TOKEN')

# Confirm token restoration
if not HF_TOKEN:
    print("WARNING: The token 'HF_TOKEN' wasn't discovered. This may increasingly trigger errors.")
else:
    print("Hugging Face Token loaded efficiently.")

Within the following piece of code, we carry out a number of actions. First, it units up parts for native textual content era by way of a pre-trained Hugging Face mannequin — particularly distilgpt2. After that, the mannequin is built-in right into a LangChain pipeline.

import textstat
from langchain_core.prompts import PromptTemplate
# Importing mandatory courses for native Hugging Face pipelines
from transformers import AutoModelForCausalLM, AutoTokenizer, pipeline
from langchain_community.llms import HuggingFacePipeline

# Initializing a free-tier, local-friendly, appropriate LLM for textual content era
model_id = "distilgpt2"
tokenizer = AutoTokenizer.from_pretrained(model_id)
mannequin = AutoModelForCausalLM.from_pretrained(model_id)

# Making a text-generation pipeline
pipe = pipeline(
    "text-generation", 
    mannequin=mannequin, 
    tokenizer=tokenizer, 
    max_new_tokens=100,
    system=0 # Use GPU if obtainable, in any other case it would default to CPU
)

# Wrapping the pipeline in HuggingFacePipeline
llm = HuggingFacePipeline(pipeline=pipe)

Our core mechanism for measuring and managing verbosity is applied subsequent. The next perform generates a abstract of textual content handed to it (assumed to be an LLM’s response) and tries to make sure the abstract doesn’t exceed a threshold degree of complexity. Observe that when utilizing an acceptable immediate template, era fashions like distilgpt2 can be utilized for acquiring textual content summaries, though the standard of such summarizations might not match that of heavier, summarization-focused fashions. We selected this mannequin as a consequence of its reliability for native execution in a constrained atmosphere.

def safe_summarize(text_input, complexity_budget=10.0):
    print("n--- Beginning Abstract Course of ---")
    print(f"Enter textual content size: {len(text_input)} characters")
    print(f"Goal complexity price range (ARI rating): {complexity_budget}")

    # Step 1: Preliminary Abstract Era
    print("Producing preliminary complete abstract...")
    base_prompt = PromptTemplate.from_template(
        "Present a complete abstract of the next: {textual content}"
    )
    chain = base_prompt | llm
    abstract = chain.invoke({"textual content": text_input})
    print("Preliminary Abstract generated:")
    print("-------------------------")
    print(abstract)
    print("-------------------------")

    # Step 2: Measure Readability
    ari_score = textstat.automated_readability_index(abstract)
    print(f"Preliminary ARI Rating: {ari_score:.2f}")

    # Step 3: Implement Complexity Finances
    if ari_score > complexity_budget:
        print("Finances exceeded! Preliminary abstract is simply too complicated.")
        print("Triggering simplification guardrail...")
        simplification_prompt = PromptTemplate.from_template(
            "The next textual content is simply too verbose. Rewrite it concisely "
            "utilizing easy vocabulary, stripping away flowery language:nn{textual content}"
        )
        simplify_chain = simplification_prompt | llm
        simplified_summary = simplify_chain.invoke({"textual content": abstract})

        new_ari = textstat.automated_readability_index(simplified_summary)
        print("Simplified Abstract generated:")
        print("-------------------------")
        print(simplified_summary)
        print("-------------------------")
        print(f"Revised ARI Rating: {new_ari:.2f}")
        abstract = simplified_summary
    else:
        print("Preliminary abstract is inside complexity price range. No simplification wanted.")

    print("--- Abstract Course of Completed ---")
    return abstract

Discover additionally within the code above that ARI scores are calculated to estimate textual content complexity.

The ultimate a part of the code instance exams the perform outlined beforehand, passing pattern textual content and a complexity price range of 10.0, and printing the ultimate outcomes.

# 1. Offering some extremely verbose, complicated pattern textual content
sample_text = """
The inextricably intertwined permutations of cognitive computational arrays inside the 
realm of Giant Language Fashions typically precipitate a cascade of unnecessarily labyrinthine 
lexical constructions. This propensity for circumlocution, while seemingly indicative of 
profound erudition, incessantly obfuscates the foundational semantic payload, thereby 
rendering the generated discourse considerably much less accessible to the quintessential layperson.
"""

# 2. Calling the perform
print("Operating summarizer pipeline...n")
final_output = safe_summarize(sample_text, complexity_budget=10.0)

# 3. Printing the ultimate end result
print("n--- Closing Guardrailed Abstract ---")
print(final_output)

The ensuing printed messages could also be fairly prolonged, however you will notice a delicate lower within the ARI rating after calling the pre-trained mannequin for summarization. Don’t count on miraculous outcomes, although: the mannequin chosen, whereas light-weight, is just not nice at summarizing textual content, so the ARI rating discount is reasonably modest. You possibly can strive utilizing different fashions like google/flan-t5-small to see how they carry out for textual content summarization, however be warned — these fashions will likely be heavier and tougher to run.

# Wrapping Up

This text reveals the way to implement an infrastructure for measuring and controlling overly verbose LLM responses by calling an auxiliary mannequin to summarize them earlier than approving their degree of complexity. Hallucinations are a byproduct of excessive verbosity in lots of eventualities. Whereas the implementation proven right here focuses on assessing verbosity, there are particular checks that will also be used for measuring hallucinations — reminiscent of semantic consistency checks, pure language inference (NLI) cross-encoders, and LLM-as-a-judge options.

Iván Palomares Carrascosa is a frontrunner, author, speaker, and adviser in AI, machine studying, deep studying & LLMs. He trains and guides others in harnessing AI in the actual world.

Guardrails for LLMs: Measuring AI ‘Hallucination’ and Verbosity

# Introduction

# Setting a Complexity Finances with Textstat

# Implementing the LangChain Pipeline

# Wrapping Up

Related Articles

translateX() | CSS-Methods

Finest practices for Infrastructure as Code CI/CD on Azure

Greatest AirPods Prime Day Offers 2026: Get AirPods 4, AirPods Professional and Max for much less

Latest Articles

translateX() | CSS-Methods

Finest practices for Infrastructure as Code CI/CD on Azure

Greatest AirPods Prime Day Offers 2026: Get AirPods 4, AirPods Professional and Max for much less

A few of the final surviving Neanderthals had been remarkably numerous ‪—‬ suggesting inbreeding did not doom them

Incircles and Excircles of Pythagorean triangles