Olatunji Ruwase 485a365ffe ZeRO-Offload passing model tests (#374) 4 years ago
..
BingBertSquad 838f53b761 Switches BBS example to use mbsize=3 and gas=2 to fit in 16GB of memory. (#341) 4 years ago
Megatron_GPT2 485a365ffe ZeRO-Offload passing model tests (#374) 4 years ago
run_sanity_check.py 485a365ffe ZeRO-Offload passing model tests (#374) 4 years ago