Hallucination | Opik Documentation

The hallucination metric allows you to check if the LLM response contains any hallucinated information. In order to check for hallucination, you will need to provide the LLM input, LLM output. If the context is provided, this will also be used to check for hallucinations.

How to use the Hallucination metric

You can use the Hallucination metric as follows:

1 from opik.evaluation.metrics import Hallucination
2 
3 metric = Hallucination()
4 
5 metric.score(
6 input="What is the capital of France?",
7 output="The capital of France is Paris. It is famous for its iconic Eiffel Tower and rich cultural heritage.",
8 )

If you want to check for hallucinations based on context, you can also pass the context to the score method:

1 metric.score(
2     input="What is the capital of France?",
3     output="The capital of France is Paris. It is famous for its iconic Eiffel Tower and rich cultural heritage.",
4     context=["France is a country in Western Europe. Its capital is Paris, which is known for landmarks like the Eiffel Tower."],
5 )

Asynchronous scoring is also supported with the ascore method in Python and score method in TypeScript (which is always async).

The hallucination score is either 0 or 1. A score of 0 indicates that no hallucinations were detected, a score of 1 indicates that hallucinations were detected.

Hallucination Prompt

Opik uses an LLM as a Judge to detect hallucinations, for this we have a prompt template that is used to generate the prompt for the LLM. By default, the gpt-4o model is used to detect hallucinations but you can change this to any model supported by LiteLLM by setting the model parameter. You can learn more about customizing models in the Customize models for LLM as a Judge metrics section.

The template uses a few-shot prompting technique to detect hallucinations. The template is as follows:

1 You are an expert judge tasked with evaluating the faithfulness of an AI-generated answer to the given context. Analyze the provided INPUT, CONTEXT, and OUTPUT to determine if the OUTPUT contains any hallucinations or unfaithful information.
2 
3 Guidelines:
4 
5 1. The OUTPUT must not introduce new information beyond what's provided in the CONTEXT.
6 2. The OUTPUT must not contradict any information given in the CONTEXT.
7 3. The OUTPUT should not contradict well-established facts or general knowledge.
8 4. Ignore the INPUT when evaluating faithfulness; it's provided for context only.
9 5. Consider partial hallucinations where some information is correct but other parts are not.
10 6. Pay close attention to the subject of statements. Ensure that attributes, actions, or dates are correctly associated with the right entities (e.g., a person vs. a TV show they star in).
11 7. Be vigilant for subtle misattributions or conflations of information, even if the date or other details are correct.
12 8. Check that the OUTPUT doesn't oversimplify or generalize information in a way that changes its meaning or accuracy.
13 
14 Analyze the text thoroughly and assign a hallucination score between 0 and 1, where:
15 
16 - 0.0: The OUTPUT is entirely faithful to the CONTEXT
17 - 1.0: The OUTPUT is entirely unfaithful to the CONTEXT
18 
19 {examples_str}
20 
21 INPUT (for context only, not to be used for faithfulness evaluation):
22 {input}
23 
24 CONTEXT:
25 {context}
26 
27 OUTPUT:
28 {output}
29 
30 It is crucial that you provide your answer in the following JSON format:
31 {{
32     "score": <your score between 0.0 and 1.0>,
33     "reason": ["reason 1", "reason 2"]
34 }}
35 Reasons amount is not restricted. Output must be JSON format only.