Skip to content
GitHub
View on GitHub

API Reference

API reference for training-gym public classes.

Complete reference for the training-gym Python library.

ClassDescription
ModelConfigurationBase class for model identity and weight-download logic.
HFModelConfigurationModelConfiguration for models hosted on HuggingFace.
ModelArchitectureTransformer architecture parameters for a specific model.
DatasetConfigDataset configuration shared across training frameworks.
WandbConfigWeights & Biases logging configuration shared across all frameworks.
ModalRayClusterBase class for bootstrapping a Ray cluster inside Modal clustered functions.
LlmJudgeLLM-as-judge client for an OpenAI-compatible chat-completions endpoint.
TrainResultOne completed training run’s checkpoint handle.
ClassDescription
Qwen3-4BQwen3-4B (4 billion parameters) from Alibaba.
Qwen3-32BQwen3-32B (32 billion parameters) from Alibaba.
GLM-4.7GLM-4.7 large MoE model from Zhipu AI.
Llama2-7BLlama 2 7B from Meta.
Kimi-K2.5Kimi K2.5 from Moonshot AI.
ClassDescription
SlimeConfigslime GRPO training configuration.
ModalConfig (slime)Modal infrastructure configuration for slime — image setup and dev overlays.
MsSwiftFrameworkConfigms-swift Megatron SFT configuration, including Modal infrastructure.
MsSwiftConfigTop-level wrapper that composes an ms-swift Megatron SFT run.
MilesFrameworkConfigMiles RLVR configuration, including Modal infrastructure.
MilesConfigTop-level wrapper that composes a Miles RLVR training run.
HarborFrameworkConfigHarbor + Miles configuration for sandbox-based RL training.
HarborConfigTop-level wrapper that composes a Harbor + Miles RLVR training run.