Skip to content

EvalTask

One declarative eval task.

Example Usage

typescript
import { EvalTask } from "@meetkai/mka1/models/components";

let value: EvalTask = {
  id: "<id>",
  type: "math",
  dataset: {},
  promptTemplate: "<value>",
  grader: {
    type: "python",
  },
};

Fields

FieldTypeRequiredDescription
idstring✔️N/A
namestringN/A
typecomponents.EvalTaskType✔️N/A
datasetcomponents.EvalDataset✔️Dataset backing an eval task.
promptTemplatestring✔️N/A
targetTemplatestringN/A
choicesstring[]N/A
outputExtractioncomponents.EvalOutputExtractionN/A
metricsany[]N/A
gradercomponents.EvalPythonGrader✔️N/A
preprocesscomponents.EvalPythonPreprocessorN/A
fewshotcomponents.EvalFewshotConfigN/A
numFewshotnumberlm-eval style alias for fewshot.count.
metadataRecord<string, any>N/A