Logan Adams
|
6b2365e4fa
Re-enable elastic training for torch 2+ (#4010)
|
1 year ago |
Michael Wyatt
|
b361c72761
Update DeepSpeed copyright license to Apache 2.0 (#3111)
|
1 year ago |
Jeff Rasley
|
91d63e0228
update formatter version and style settings (#3098)
|
1 year ago |
Ma, Guokai
|
98cc35b6a8
Abstract accelerator (step 3) (#2677)
|
1 year ago |
loadams
|
34a11688c4
Change zero_grad() argument to match pytorch (#2741)
|
1 year ago |
JackieWu
|
323c266cfe
[Bug Fixed] use torch.cuda.is_available() (#2661)
|
1 year ago |
Karim Foda
|
735406e536
fix import errors (#2026)
|
2 years ago |
Ammar Ahmad Awan
|
36ad3119d5
DeepSpeed comm backend v1 (#1985)
|
2 years ago |
Jeff Rasley
|
50893458d6
Fairseq support (#1915)
|
2 years ago |
Olatunji Ruwase
|
135a625619
Move param_shapes to model files (#1732)
|
2 years ago |
Alex Hedges
|
4cf970e6bb
Add codespell to pre-commit checks (#1717)
|
2 years ago |
Jeff Rasley
|
3293cf72a0
[ZeRO] Default disable elastic ckpt in stage 1+2 and reduce CPU memory overhead during ckpt load (#1525)
|
2 years ago |
Jeff Rasley
|
e2fdd254ed
Big science related changes (#1407)
|
3 years ago |
Ammar Ahmad Awan
|
f28432441b
DeepSpeed MoE (#1310)
|
3 years ago |
Reza Yazdani
|
ed3de0c21b
Quantization + inference release (#1091)
|
3 years ago |
Conglong Li
|
67a48aaa89
1-bit LAMB optimizer (#970)
|
3 years ago |
Stas Bekman
|
29853c3eed
less scary overflow notice (#833)
|
3 years ago |
Shaden Smith
|
65c2f974d8
Pipeline parallel training engine. (#392)
|
4 years ago |
Jeff Rasley
|
e5bbc2e559
Sparse attn + ops/runtime refactor + v0.3.0 (#343)
|
4 years ago |