Ma, Guokai
|
98cc35b6a8
Abstract accelerator (step 3) (#2677)
|
1 年之前 |
Alexander Jipa
|
0f0e38c520
fixes #2498 (#2603)
|
1 年之前 |
Conglong Li
|
ef869377e9
DeepSpeed Data Efficiency Library (#2585)
|
1 年之前 |
Adam Moody
|
b8fb9c3f1a
parallelize writing of layer checkpoint files across data parallel instances (#1419)
|
2 年之前 |
Arpan Jain
|
1ed5aa96a8
Elastic Training support in DeepSpeed (#2153) (#2156)
|
2 年之前 |
trajep
|
e669aaf55b
Trajepl/nebula ckpt engine (#2085)
|
2 年之前 |
Alex Hedges
|
316c4a43e0
Add flake8 to pre-commit checks (#2051)
|
2 年之前 |
Quentin Anthony
|
5349347bb6
DeepSpeed Communication Profiling and Logging (#2012)
|
2 年之前 |
Karim Foda
|
735406e536
fix import errors (#2026)
|
2 年之前 |
Quentin Anthony
|
c87f6ee209
DeepSpeed Monitor Module (Master) (#2013)
|
2 年之前 |
Ammar Ahmad Awan
|
36ad3119d5
DeepSpeed comm backend v1 (#1985)
|
2 年之前 |
Jeff Rasley
|
50893458d6
Fairseq support (#1915)
|
2 年之前 |
Stas Bekman
|
dbeadf16b5
[pipe] prevent deadlock with multiple evals sequence (#1944)
|
2 年之前 |
Zhengqiang Yin
|
a3b90030fd
Fix time error (#1934)
|
2 年之前 |
Jeff Rasley
|
b4fcd98ff0
Inference PP changes for neox (#1899)
|
2 年之前 |
Olatunji Ruwase
|
56c5223868
bf16+pipeline parallelism (#1801)
|
2 年之前 |
Du Li
|
97f8a9eb66
fixing a bf16 support issue (#1760)
|
2 年之前 |
Alex Hedges
|
4cf970e6bb
Add codespell to pre-commit checks (#1717)
|
2 年之前 |
Conglong Li
|
29bee73f03
fix pp (#1474)
|
3 年之前 |
Conglong Li
|
17a479dd8c
fix pipeline engine (#1473)
|
3 年之前 |
Conglong Li
|
cd7967d6b5
fix cl for pp support (#1443)
|
3 年之前 |
Conglong Li
|
fbea7b493f
CL for big science (#1440)
|
3 年之前 |
Thomas Wang
|
9c672783e9
Big science fix passing multiple tensors (#1400)
|
3 年之前 |
Hyunwoong Ko
|
30965ea734
Add flexibility of pipeline parallel module and engine (#1399)
|
3 年之前 |
Jeff Rasley
|
e2fdd254ed
Big science related changes (#1407)
|
3 年之前 |
Reza Yazdani
|
ed3de0c21b
Quantization + inference release (#1091)
|
3 年之前 |
Jeff Rasley
|
cfa63f5dad
ZeRO stage 1 refresh (#1042)
|
3 年之前 |
Conglong Li
|
67a48aaa89
1-bit LAMB optimizer (#970)
|
3 年之前 |
Shaden Smith
|
fbece50b21
assert no Z2/Z3 with pipeline and fix some docs links (#980)
|
3 年之前 |
Shaden Smith
|
5e522efc27
set_batch_fn and remove old sanity check (#712)
|
3 年之前 |