john li
|
46bb08c2df
Include cublas error details when getting cublas handle fails (#3695)
|
1 年之前 |
Olatunji Ruwase
|
47f9f13bd3
DeepSpeed Chat (#3186)
|
1 年之前 |
Michael Wyatt
|
b361c72761
Update DeepSpeed copyright license to Apache 2.0 (#3111)
|
1 年之前 |
Jeff Rasley
|
da84e60d98
add missing license info to top of all source code (#2889)
|
1 年之前 |
Reza Yazdani
|
ee1ffe2e88
CPU-Adam fix for scalar mode (#735)
|
3 年之前 |
Reza Yazdani
|
981bc7d493
Move workspace memory-allocation to PyTorch (#661)
|
3 年之前 |
Bruno
|
95575579b3
Use parentesis around min and max to enable Windows build (#449)
|
4 年之前 |
RezaYazdaniAminabadi
|
f0f2a70268
support dynamic sequence length in transformer kernels (#424)
|
4 年之前 |
RezaYazdaniAminabadi
|
a148bd33d6
Add configurable intermediate size to transformer kernels (#423)
|
4 年之前 |
Jeff Rasley
|
734d8991c8
Transformer kernel release (#242)
|
4 年之前 |