Wenhao Hu
|
8abdaee243
Add cpu adagrad (#1358)
|
3 年之前 |
Alex Hedges
|
be789b1665
Fix many typos (#1423)
|
3 年之前 |
Ammar Ahmad Awan
|
f28432441b
DeepSpeed MoE (#1310)
|
3 年之前 |
Reza Yazdani
|
ed3de0c21b
Quantization + inference release (#1091)
|
3 年之前 |
Reza Yazdani
|
c78c29f938
supporting different hidden dimensions (#559)
|
3 年之前 |
RezaYazdaniAminabadi
|
f0f2a70268
support dynamic sequence length in transformer kernels (#424)
|
4 年之前 |
Jeff Rasley
|
41db1c2f03
ZeRO-Offload release (#391)
|
4 年之前 |
Jeff Rasley
|
734d8991c8
Transformer kernel release (#242)
|
4 年之前 |