AI & ML interests
None defined yet.
Recent Activity
models 35
AIPlans/Qwen3-0.6B-PPO
Text Generation • 0.6B • Updated • 156 • 1
AIPlans/Qwen3-0.6B-KTO1
Text Generation • 0.8B • Updated • 16
AIPlans/Qwen3-0.6B-ORPO-Crosscoder-MixedDataset
Updated
AIPlans/Qwen3-0.6B-GRPO-Crosscoder-MixedDataset
Updated
AIPlans/Qwen3-0.6B-KTO-Crosscoder-MixedDataset
Updated
AIPlans/Qwen3-0.6B-IPO-Crosscoder-MixedDataset
Updated
AIPlans/Crosscoder_GRPO
Updated
AIPlans/Qwen3-0.6B-ReMax
Reinforcement Learning • 0.6B • Updated • 5 • 2
AIPlans/Qwen3-0.6B-GRPO-RM_NVIDIA
Text Generation • 0.6B • Updated • 1
AIPlans/Qwen3-0.6B-GRPO_Epoch2
Text Generation • 0.6B • Updated • 1
datasets 17
AIPlans/Helpsteer2-helpfulness-prompts
Viewer • Updated • 7.22k • 8
AIPlans/helpsteer2-helpfulness-preference-cleaned
Viewer • Updated • 6.99k • 15
AIPlans/trackio-experiments
Updated • 7
AIPlans/ultrafeedback_binarized_chinese
Viewer • Updated • 14k • 18
AIPlans/ultrafeedback_binarized
Viewer • Updated • 14k • 16
AIPlans/FilteredPKU-SafeRLHF_chinese
Viewer • Updated • 12k • 10
AIPlans/FilteredPKU-SafeRLHF
Viewer • Updated • 12k • 10
AIPlans/SafetyBench_WithLabels_Better_chinese
Viewer • Updated • 546 • 42
AIPlans/SafetyBench_WithLabels
Viewer • Updated • 546 • 29
AIPlans/ToxiGen_chinese
Viewer • Updated • 1k • 15