K2-V2: A 360-Open, Reasoning-Enhanced Open Foundation Model

This repository contains the codebase for K2-V2, a fully open-source, reasoning-enhanced foundation model. K2-V2 is designed with a "360-Open" philosophy, providing full transparency across the training pipeline—including data processing, pre-training, mid-training (annealing), and supervised fine-tuning (SFT).

Setup

The core training scripts follows Megatron-LM. We recommend using a Docker container for reproducibility, but local environments can be set up, by instoalling the latest pytorch, cuda, nccl and NVdia Apex.

We follow the Megatron-LM setup, and use the NGC's PyTorch container with DGX nodes for environment setup.

Training Scripts

Pre-training

The master pre-training script is located under pre-train/k2v2_70b_400nodes_120bsz.sh. Several data paths are marked to be adjusted based on your cluster environment.

Mid-training

The mid-training script is located in the mid-train. It is similar to pre-training, with a key difference on context length across the 4 stages. The data can be obtained through TxT360-Midas, the datasets are organized as subsets, corresponding to the stages.

Training Monitor

We develop a light-weight training monitor for our large scale training jobs. Note that the monitor isn't necessary for your training jobs. It is shared in case it is useful.

Supervised Fine-Tuning (SFT)

Instructions are located in the sft. This script conducts a simple SFT using the TxT360-3efforts dataset. Since the dataset is organized as typical chat templates, feel free to use other SFT library you find handy.

Evaluation

Checkout the Eval360 repository for our evaluation framework, it is a language model evaluation workspace built around the LM Evaluation Harness. It provides opinionated scripts and automation for benchmarking large checkpoints on reasoning, math, and code suites while coordinating large-cluster workflows (SLURM, Ray, and vLLM). The repository glues together local checkpoints, Hugging Face models, and multi-node serving endpoints to streamline end-to-end evaluation runs.

Citation

@misc{k2team2025k2v2360openreasoningenhancedllm,
      title={K2-V2: A 360-Open, Reasoning-Enhanced LLM}, 
      author={K2 Team and Zhengzhong Liu and Liping Tang and Linghao Jin and Haonan Li and Nikhil Ranjan and Desai Fan and Shaurya Rohatgi and Richard Fan and Omkar Pangarkar and Huijuan Wang and Zhoujun Cheng and Suqi Sun and Seungwook Han and Bowen Tan and Gurpreet Gosal and Xudong Han and Varad Pimpalkhute and Shibo Hao and Ming Shan Hee and Joel Hestness and Haolong Jia and Liqun Ma and Aaryamonvikram Singh and Daria Soboleva and Natalia Vassilieva and Renxi Wang and Yingquan Wu and Yuekai Sun and Taylor Killian and Alexander Moreno and John Maggs and Hector Ren and Guowei He and Hongyi Wang and Xuezhe Ma and Yuqi Wang and Mikhail Yurochkin and Eric P. Xing},
      year={2025},
      eprint={2512.06201},
      archivePrefix={arXiv},
      primaryClass={cs.LG},
      url={https://arxiv.org/abs/2512.06201}, 
}

Name		Name	Last commit message	Last commit date
Latest commit History 7 Commits
data_banks		data_banks
megatron		megatron
mid-train		mid-train
monitor		monitor
pre-train		pre-train
sft		sft
tools		tools
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
monitoring_wandb.py		monitoring_wandb.py
pretrain_llm360.py		pretrain_llm360.py
run_monitor.py		run_monitor.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

K2-V2: A 360-Open, Reasoning-Enhanced Open Foundation Model

Setup

Training Scripts

Citation

About

Uh oh!

Releases

Packages

Contributors 3

Uh oh!

Languages

License

LLM360/k2v2_train

Folders and files

Latest commit

History

Repository files navigation

K2-V2: A 360-Open, Reasoning-Enhanced Open Foundation Model

Setup

Training Scripts

Citation

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Contributors 3

Uh oh!

Languages

Packages