Abhishek Jindal
|
28b9d5c231
Add condition when dimension is greater than 2 (#4390)
|
1 year ago |
Masahiro Tanaka
|
42858a9891
save tensors in context of memory_efficient_linear (#3413)
|
1 year ago |
Olatunji Ruwase
|
47f9f13bd3
DeepSpeed Chat (#3186)
|
1 year ago |
Michael Wyatt
|
b361c72761
Update DeepSpeed copyright license to Apache 2.0 (#3111)
|
1 year ago |
Jeff Rasley
|
91d63e0228
update formatter version and style settings (#3098)
|
1 year ago |
Ma, Guokai
|
090d49e79f
pre-commit check for torch.cuda in code (#2981)
|
1 year ago |
Jeff Rasley
|
da84e60d98
add missing license info to top of all source code (#2889)
|
1 year ago |
Ma, Guokai
|
98cc35b6a8
Abstract accelerator (step 3) (#2677)
|
1 year ago |
Karim Foda
|
735406e536
fix import errors (#2026)
|
2 years ago |
Ammar Ahmad Awan
|
36ad3119d5
DeepSpeed comm backend v1 (#1985)
|
2 years ago |
Jeff Rasley
|
50893458d6
Fairseq support (#1915)
|
2 years ago |
Alex Hedges
|
be789b1665
Fix many typos (#1423)
|
3 years ago |
Olatunji Ruwase
|
e88ebbcfc9
Use amp autocast in ZeRO3 linear (#990)
|
3 years ago |
Jeff Rasley
|
0d4a54a04d
ZeRO-Infinity (#976)
|
3 years ago |
Samyam Rajbhandari
|
b5f56b2c6b
Fix for fragmented linear inputs in ZeRO 3 Linear layers where reshap… (#881)
|
3 years ago |
Samyam Rajbhandari
|
599258f979
ZeRO 3 Offload (#834)
|
3 years ago |