Commit History

Author SHA1 Message Date
  leiwen83 1e0c39c6bf enable pipeline checkpoint loading mode (#3629) 1 year ago
  Joe Mayer 8afcda2ac9 ZeRO Gradient Accumulation Dtype. (#2847) 1 year ago
  Guo Yejun b4626194e4 zero/mics.py: use on_accelerator instead of cuda only (#3806) 1 year ago
  Heyang Qin d18aa2c79c ZeRO++ (#3784) 1 year ago
  Zhen Zhang c88af21432 [MiCS] [Fix] saving and loading model checkpoint logic for MiCS sharding (#3440) 1 year ago
  Zhen Zhang 2e99f6edf6 [DRAFT] Tentative implementation of MiCS (#2964) 1 year ago