作者 | SHA1 备注 | 提交日期 |
---|---|---|
leiwen83 | 1e0c39c6bf enable pipeline checkpoint loading mode (#3629) | 1 年之前 |
Joe Mayer | 8afcda2ac9 ZeRO Gradient Accumulation Dtype. (#2847) | 1 年之前 |
Guo Yejun | b4626194e4 zero/mics.py: use on_accelerator instead of cuda only (#3806) | 1 年之前 |
Heyang Qin | d18aa2c79c ZeRO++ (#3784) | 1 年之前 |
Zhen Zhang | c88af21432 [MiCS] [Fix] saving and loading model checkpoint logic for MiCS sharding (#3440) | 1 年之前 |
Zhen Zhang | 2e99f6edf6 [DRAFT] Tentative implementation of MiCS (#2964) | 1 年之前 |