In a Training Loop 🔄
sirynoma
uavleeva
·
AI & ML interests
None yet
Organizations
models 13
uavleeva/grpo_merged_math_sql_code_ties_001
Text Generation • Updated • 1
uavleeva/grpo_mixed_run_002
Updated
uavleeva/grpo_sql_run_005
Updated
uavleeva/grpo_merged_math_sql_code_linear_001
Text Generation • Updated
uavleeva/grpo_code_run_002
Updated
uavleeva/grpo_mixed_run_004
Updated
uavleeva/grpo_math_run_level3_all_rewards_001
Updated
uavleeva/grpo_sql_run_002
Updated
uavleeva/grpo_sql_run_004
Updated
uavleeva/grpo_mixed_run_001
Updated