grpo-rl-training

Expert guidance for GRPO/RL fine-tuning with TRL for reasoning and task-specific model training

$ 설치

git clone https://github.com/zechenzhangAGI/AI-research-SKILLs /tmp/AI-research-SKILLs && cp -r /tmp/AI-research-SKILLs/06-post-training/grpo-rl-training ~/.claude/skills/AI-research-SKILLs

// tip: Run this command in your terminal to install the skill

Repository

zechenzhangAGI

Author

zechenzhangAGI/AI-research-SKILLs/06-post-training/grpo-rl-training

Stars

Forks

Updated1w ago

Added1w ago

Actions

View on GitHub Download ZIP Report Issue

Related Skills