deepspeed

Expert guidance for distributed training with DeepSpeed - ZeRO optimization stages, pipeline parallelism, FP16/BF16/FP8, 1-bit Adam, sparse attention

$ インストール

git clone https://github.com/zechenzhangAGI/AI-research-SKILLs /tmp/AI-research-SKILLs && cp -r /tmp/AI-research-SKILLs/08-distributed-training/deepspeed ~/.claude/skills/AI-research-SKILLs

// tip: Run this command in your terminal to install the skill

Repository

zechenzhangAGI

Author

zechenzhangAGI/AI-research-SKILLs/08-distributed-training/deepspeed

Stars

Forks

Updated6d ago

Added6d ago

Actions

View on GitHub Download ZIP Report Issue

Related Skills