Skip to content

ReinforcementHyperparameters

Example Usage

typescript
import { ReinforcementHyperparameters } from "@meetkai/mka1/models/components";

let value: ReinforcementHyperparameters = {};

Fields

FieldTypeRequiredDescription
batchSizecomponents.ReinforcementHyperparametersBatchSizeNumber of examples in each batch
computeMultipliercomponents.ComputeMultiplierScales compute for exploration during training
evalIntervalcomponents.EvalIntervalNumber of steps between evaluations
evalSamplescomponents.EvalSamplesNumber of samples generated per eval step
learningRateMultipliercomponents.ReinforcementHyperparametersLearningRateMultiplierScaling factor for learning rate
nEpochscomponents.ReinforcementHyperparametersNEpochsNumber of epochs to train for
reasoningEffortcomponents.ReinforcementHyperparametersReasoningEffortReasoning effort level