Author | SHA1 Message | Date |
---|---|---|
leiwen83 | 1e0c39c6bf enable pipeline checkpoint loading mode (#3629) | 1 year ago |
Joe Mayer | 8afcda2ac9 ZeRO Gradient Accumulation Dtype. (#2847) | 1 year ago |
Guo Yejun | b4626194e4 zero/mics.py: use on_accelerator instead of cuda only (#3806) | 1 year ago |
Heyang Qin | d18aa2c79c ZeRO++ (#3784) | 1 year ago |
Zhen Zhang | c88af21432 [MiCS] [Fix] saving and loading model checkpoint logic for MiCS sharding (#3440) | 1 year ago |
Zhen Zhang | 2e99f6edf6 [DRAFT] Tentative implementation of MiCS (#2964) | 1 year ago |