Training library for Megatron-based models with bidirectional Hugging Face conversion capability

697 stars
348 forks
Python
68 views