Update Groundedness and Relevance Evaluators Prompts #43514

m7md7sien · 2025-10-19T17:07:25Z

Description

Update Groundedness and Relevance Evaluators prompts:

Groundedness Update without_query guidelines
Relevance Evaluator Prompt Multi-turn Conversation Handling

If an SDK is being regenerated based on a new API spec, a link to the pull request containing these API spec changes should be included above.

All SDK Contribution checklist:

The pull request does not introduce [breaking changes]
CHANGELOG is updated for new features, bug fixes or other significant changes.
I have read the contribution guidelines.

General Guidelines and Best Practices

Title of the pull request is clear and informative.
There are a small number of commits, each of which have an informative message. This means that previously merged commits do not appear in the history of the PR. For more information on cleaning up the commits in your PR, see this page.

Testing Guidelines

Pull request includes test coverage for the included changes.

…ion Handling

…edentials

Copilot

Pull Request Overview

Updates evaluator prompt templates to improve groundedness guidance and add multi-turn conversation handling for relevance scoring. Key changes adjust relevance prompt to use conversation history semantics, revise groundedness rating definitions, and update asset tagging plus a test modification.

Relevance prompt now references CONVERSATION_HISTORY with multi-turn example
Groundedness (without query) prompt definitions and rubric wording revised
Test parameter list altered (RelevanceEvaluator removed) and asset tag updated

Reviewed Changes

Copilot reviewed 5 out of 5 changed files in this pull request and generated no comments.

Show a summary per file

File	Description
sdk/evaluation/azure-ai-evaluation/tests/e2etests/test_builtin_evaluators.py	Removes RelevanceEvaluator from a parametrized test case.
sdk/evaluation/azure-ai-evaluation/azure/ai/evaluation/_evaluators/_relevance/relevance.prompty	Introduces multi-turn conversation handling and renames semantic concept from QUERY to CONVERSATION_HISTORY while retaining the input variable name.
sdk/evaluation/azure-ai-evaluation/azure/ai/evaluation/_evaluators/_groundedness/groundedness_without_query.prompty	Refactors groundedness rubric definitions and examples.
sdk/evaluation/azure-ai-evaluation/azure/ai/evaluation/_evaluators/_groundedness/_groundedness.py	Adjusts query presence check to treat None values like missing keys.
sdk/evaluation/azure-ai-evaluation/assets.json	Updates asset tag identifier.

Comments suppressed due to low confidence (1)

sdk/evaluation/azure-ai-evaluation/tests/e2etests/test_builtin_evaluators.py:1324

Removing RelevanceEvaluator from this parametrized test reduces coverage for protected material JSON evaluation relative to other evaluators. If still supported, keep it in the list or add a dedicated test validating its updated multi-turn prompt behavior.

            RelevanceEvaluator,

m7md7sien added 5 commits October 9, 2025 18:58

Enhance Relevance Evaluator Prompt with Improved Multi-turn Conversat…

d3980ff

…ion Handling

Remove RelevanceEvaluator from test_prompty_based_evaluator_custom_cr…

8adbd3a

…edentials

Update asset tag

91591da

Merge branch 'main' into mohessie/update_relevance_prompt

2cc442a

Merge groundedness changes and update assets

e21d65a

m7md7sien requested a review from a team as a code owner October 19, 2025 17:07

Copilot AI review requested due to automatic review settings October 19, 2025 17:07

github-actions bot added the Evaluation Issues related to the client library for Azure AI Evaluation label Oct 19, 2025

Copilot AI reviewed Oct 19, 2025

View reviewed changes

Update Relevance Prompt

1763ad0

m7md7sien mentioned this pull request Oct 20, 2025

Relevance Evaluator Prompt Multi-turn Conversation Handling #43322

Open

6 tasks

Merge branch 'main' into mohessie/update-prompts

aa37242

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Update Groundedness and Relevance Evaluators Prompts #43514

Update Groundedness and Relevance Evaluators Prompts #43514

Uh oh!

m7md7sien commented Oct 19, 2025 •

edited

Loading

Uh oh!

Copilot AI left a comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Update Groundedness and Relevance Evaluators Prompts #43514

Are you sure you want to change the base?

Update Groundedness and Relevance Evaluators Prompts #43514

Uh oh!

Conversation

m7md7sien commented Oct 19, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Description

All SDK Contribution checklist:

General Guidelines and Best Practices

Testing Guidelines

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull Request Overview

Reviewed Changes

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

m7md7sien commented Oct 19, 2025 •

edited

Loading