Masahiro Tanaka
|
2c51aba0b7
Add custom reshaping for universal checkpoint (#5390)
|
6 月之前 |
Masahiro Tanaka
|
c56a4b9e0d
Improve universal checkpoint (#5289)
|
6 月之前 |
Masahiro Tanaka
|
b112c99ea8
Fix loading a universal checkpoint (#5263)
|
7 月之前 |
Sam Ade Jacobs
|
8998707a2f
Universal Checkpoint for Sequence Parallelism (#4752)
|
10 月之前 |
Moshe Island
|
ce5e56a82e
universal-ckp: support megatron-deepspeed llama model (#4666)
|
11 月之前 |
Moshe Island
|
8ad187d84f
Universal ckp fixes (#4588)
|
11 月之前 |
Michael Wyatt
|
b361c72761
Update DeepSpeed copyright license to Apache 2.0 (#3111)
|
1 年之前 |
Jeff Rasley
|
91d63e0228
update formatter version and style settings (#3098)
|
1 年之前 |
Olatunji Ruwase
|
799120e7e4
Universal checkpoint for zero stage 1 (#2284)
|
2 年之前 |
Olatunji Ruwase
|
53182531ed
Refactor universal checkpointing and tensor fragments (#2253)
|
2 年之前 |