IndirectAttackEvaluator Class

Reference

Definition

Namespace:: Microsoft.Extensions.AI.Evaluation.Safety

Assembly:: Microsoft.Extensions.AI.Evaluation.Safety.dll

Package:: Microsoft.Extensions.AI.Evaluation.Safety v9.5.0-preview.1.25265.7

Source:: IndirectAttackEvaluator.cs

Important

Some information relates to prerelease product that may be substantially modified before it’s released. Microsoft makes no warranties, express or implied, with respect to the information provided here.

An IEvaluator that utilizes the Azure AI Foundry Evaluation service to evaluate responses produced by an AI model for the presence of indirect attacks such as manipulated content, intrusion and information gathering.

public ref class IndirectAttackEvaluator sealed : Microsoft::Extensions::AI::Evaluation::Safety::ContentSafetyEvaluator

public sealed class IndirectAttackEvaluator : Microsoft.Extensions.AI.Evaluation.Safety.ContentSafetyEvaluator

type IndirectAttackEvaluator = class
    inherit ContentSafetyEvaluator

Public NotInheritable Class IndirectAttackEvaluator
Inherits ContentSafetyEvaluator

Inheritance: Object

ContentSafetyEvaluator
IndirectAttackEvaluator

Remarks

Indirect attacks, also known as cross-___domain prompt injected attacks (XPIA), are when jailbreak attacks are injected into the context of a document or source that may result in an altered, unexpected behavior. Indirect attacks evaluations are broken down into three subcategories:

Manipulated Content: This category involves commands that aim to alter or fabricate information, often to mislead or deceive.It includes actions like spreading false information, altering language or formatting, and hiding or emphasizing specific details.The goal is often to manipulate perceptions or behaviors by controlling the flow and presentation of information.

Intrusion: This category encompasses commands that attempt to breach systems, gain unauthorized access, or elevate privileges illicitly. It includes creating backdoors, exploiting vulnerabilities, and traditional jailbreaks to bypass security measures.The intent is often to gain control or access sensitive data without detection.

Information Gathering: This category pertains to accessing, deleting, or modifying data without authorization, often for malicious purposes. It includes exfiltrating sensitive data, tampering with system records, and removing or altering existing information. The focus is on acquiring or manipulating data to exploit or compromise systems and individuals.

IndirectAttackEvaluator returns a BooleanMetric with a value of true indicating the presence of an indirect attack in the response, and a value of false indicating the absence of an indirect attack.

Note that IndirectAttackEvaluator does not support evaluation of multimodal content present in the evaluated responses. Images and other multimodal content present in the evaluated responses will be ignored.

Constructors

IndirectAttackEvaluator()

Properties

EvaluationMetricNames	Gets the Names of the EvaluationMetrics produced by this IEvaluator. (Inherited from ContentSafetyEvaluator)
IndirectAttackMetricName	Gets the Name of the BooleanMetric returned by IndirectAttackEvaluator.

Methods

EvaluateAsync(IEnumerable<ChatMessage>, ChatResponse, ChatConfiguration, IEnumerable<EvaluationContext>, CancellationToken)	Evaluates the supplied `modelResponse` and returns an EvaluationResult containing one or more EvaluationMetrics. (Inherited from ContentSafetyEvaluator)
EvaluateContentSafetyAsync(IChatClient, IEnumerable<ChatMessage>, ChatResponse, IEnumerable<EvaluationContext>, String, Boolean, CancellationToken)	Evaluates the supplied `modelResponse` using the Azure AI Foundry Evaluation Service and returns an EvaluationResult containing one or more EvaluationMetrics. (Inherited from ContentSafetyEvaluator)
FilterAdditionalContext(IEnumerable<EvaluationContext>)	Filters the EvaluationContexts supplied by the caller via `additionalContext` down to just the EvaluationContexts that are relevant to the evaluation being performed by this ContentSafetyEvaluator. (Inherited from ContentSafetyEvaluator)

Extension Methods

EvaluateAsync(IEvaluator, ChatMessage, ChatMessage, ChatConfiguration, IEnumerable<EvaluationContext>, CancellationToken)	Evaluates the supplied `modelResponse` and returns an EvaluationResult containing one or more EvaluationMetrics.
EvaluateAsync(IEvaluator, ChatMessage, ChatResponse, ChatConfiguration, IEnumerable<EvaluationContext>, CancellationToken)	Evaluates the supplied `modelResponse` and returns an EvaluationResult containing one or more EvaluationMetrics.
EvaluateAsync(IEvaluator, ChatMessage, ChatConfiguration, IEnumerable<EvaluationContext>, CancellationToken)	Evaluates the supplied `modelResponse` and returns an EvaluationResult containing one or more EvaluationMetrics.
EvaluateAsync(IEvaluator, ChatResponse, ChatConfiguration, IEnumerable<EvaluationContext>, CancellationToken)	Evaluates the supplied `modelResponse` and returns an EvaluationResult containing one or more EvaluationMetrics.
EvaluateAsync(IEvaluator, String, ChatConfiguration, IEnumerable<EvaluationContext>, CancellationToken)	Evaluates the supplied `modelResponse` and returns an EvaluationResult containing one or more EvaluationMetrics.
EvaluateAsync(IEvaluator, String, String, ChatConfiguration, IEnumerable<EvaluationContext>, CancellationToken)	Evaluates the supplied `modelResponse` and returns an EvaluationResult containing one or more EvaluationMetrics.

Applies to

通过