Olatunji Ruwase 485a365ffe ZeRO-Offload passing model tests (#374) 4 年之前
..
BingBertSquad 838f53b761 Switches BBS example to use mbsize=3 and gas=2 to fit in 16GB of memory. (#341) 4 年之前
Megatron_GPT2 485a365ffe ZeRO-Offload passing model tests (#374) 4 年之前
run_sanity_check.py 485a365ffe ZeRO-Offload passing model tests (#374) 4 年之前