megatron-deepspeed
Описание
Ongoing research training transformer language models at scale, including: BERT & GPT-2
Языки
Python
- Makefile
- C++
- C
- Cuda
- HTML
- Shell
- Dockerfile
год назад
README.md
Ongoing research training transformer language models at scale, including: BERT & GPT-2
Python