FluencyEvaluator Class
Definition
Important
Some information relates to prerelease product that may be substantially modified before it’s released. Microsoft makes no warranties, express or implied, with respect to the information provided here.
An IEvaluator that evaluates the 'Fluency' of a response produced by an AI model.
public ref class FluencyEvaluator sealed : Microsoft::Extensions::AI::Evaluation::IEvaluator
public sealed class FluencyEvaluator : Microsoft.Extensions.AI.Evaluation.IEvaluator
type FluencyEvaluator = class
interface IEvaluator
Public NotInheritable Class FluencyEvaluator
Implements IEvaluator
- Inheritance
-
FluencyEvaluator
- Implements
Remarks
FluencyEvaluator measures the extent to which the response being evaluated is linguistically correct (i.e., conforms to grammatical rules, syntactic structures, and appropriate vocabulary usage). It returns a NumericMetric that contains a score for 'Fluency'. The score is a number between 1 and 5, with 1 indicating a poor score, and 5 indicating an excellent score.
Note: FluencyEvaluator is an AI-based evaluator that uses an AI model to perform its evaluation. While the prompt that this evaluator uses to perform its evaluation is designed to be model-agnostic, the performance of this prompt (and the resulting evaluation) can vary depending on the model used, and can be especially poor when a smaller / local model is used.
The prompt that FluencyEvaluator uses has been tested against (and tuned to work well with) the following models. So, using this evaluator with a model from the following list is likely to produce the best results. (The model to be used can be configured via ChatClient.)
GPT-4o
Constructors
FluencyEvaluator() |
Properties
EvaluationMetricNames |
Gets the Names of the EvaluationMetrics produced by this IEvaluator. |
FluencyMetricName |
Gets the Name of the NumericMetric returned by FluencyEvaluator. |
Methods
EvaluateAsync(IEnumerable<ChatMessage>, ChatResponse, ChatConfiguration, IEnumerable<EvaluationContext>, CancellationToken) |
Evaluates the supplied |