EvalRowResult

One model interaction: the prompt, the raw response, its parsed structure (thinking / answer / tool calls), a score, and free-form metadata.

from modal_training_gym.common.eval import EvalRowResult

One model interaction: the prompt, the raw response, its parsed structure (thinking / answer / tool calls), a score, and free-form metadata.

Constructor

EvalRowResult(**data)

Parameter	Type	Default	Description

Attribute	Type	Default	Description
`score`	`float`
`prompt`	`str`
`response`	`str`
`parsed_response`	`ParsedResponse \| None`
`metadata`	`dict[str, Any]`

Source: modal_training_gym/common/eval.py