PURE
Collection
PRM and fine-tuned LLM used in our PURE github repo: https://github.com/CJReinforce/PURE • 5 items • Updated • 2
🚨 This repo does not include the Process Reward Model (PRM). For access to the PRM, please refer to here.
This repository hosts a fine-tuned LLM optimized for better mathematical reasoning capabilities via only process rewards.