Alejandro Dubrovsky
|
9a8f6a1d3c
Account for expert parameters when calculating the total number of parameters in the model (#3720)
|
1 年之前 |
Conglong Li
|
b692d236d5
add Chinese Zhihu social account (#3755)
|
1 年之前 |
mzl
|
5a5340d03b
remove UtilsBuilder load, use torch (un)flatten ops (#3728)
|
1 年之前 |
Logan Adams
|
cd911f9ab2
Fix output transpose dimension bugs (#3747)
|
1 年之前 |
tensor-tang
|
45466afa34
fix hybrid engine mlp module (#3736)
|
1 年之前 |
john li
|
46bb08c2df
Include cublas error details when getting cublas handle fails (#3695)
|
1 年之前 |
StrayWarrior
|
09332dbf9f
Fix autotuner get_gas_from_user_config (#3664)
|
1 年之前 |
Logan Adams
|
1b40182312
Fix apex install bugs (#3741)
|
1 年之前 |
Joe Mayer
|
6f4fc30b58
FP8 unittest for H100 (#3731)
|
1 年之前 |
Ma, Guokai
|
5289d691d3
Documentation for DeepSpeed Accelerator Abstraction Interface (#3184)
|
1 年之前 |
Jeff Rasley
|
54bd9e290a
bump to 0.9.5
|
1 年之前 |
Logan Adams
|
a65f6b9e9b
Update Dockerfile with newer cuda and torch. (#3716)
|
1 年之前 |
Abhilash Majumder
|
26b3e73298
single node pdsh sigkill (#3730)
|
1 年之前 |
Ma, Guokai
|
8bfbb0e3ca
[Bugfix][CPU] Remove C++ version in CPU OpBuilder (#3643)
|
1 年之前 |
Olatunji Ruwase
|
046afcedb4
Increase tensor creator coverage (#3684)
|
1 年之前 |
Logan Adams
|
fc8e5c8858
Fix typo in name of hybrid engine function (#3704)
|
1 年之前 |
hablb
|
0977106ac9
zero3 performance optimizations (#3622)
|
1 年之前 |
Conglong Li
|
df42509786
DeepSpeed overview in Japanese (#3709)
|
1 年之前 |
john li
|
d414678df7
Small tweak on cuda version mismatch documentation (#3706)
|
1 年之前 |
Michael Wyatt
|
fb2b4ab11a
Fix unit test typo in tests/unit/ops/transformer/inference (#3697)
|
1 年之前 |
digger yu
|
c5edc91ecb
change partititon_name to partition_name (#3700)
|
1 年之前 |
Reza Yazdani
|
34a9fbf1a3
Fix gpt-j inference issue (#3639)
|
1 年之前 |
Logan Adams
|
7e59ef1230
Revert "fix typo name (#3689)" (#3702)
|
1 年之前 |
tensor-tang
|
f2f5f21b52
fix typo name (#3689)
|
1 年之前 |
Logan Adams
|
d8aaa58122
Fix incorrectly formatted f string (#3698)
|
1 年之前 |
Abhilash Majumder
|
c17313fb24
Correct world_size/backend for mpi (#3694)
|
1 年之前 |
Byungsoo Oh
|
b7f463ddeb
Fix local rank mismatch for heterogeneous nodes (#3409)
|
1 年之前 |
Ramya Ramineni
|
4cd0a003f5
non-JIT build fix on ROCm (#3638)
|
1 年之前 |
Siddharth Singh
|
2d737eddcc
Update README to add ICS'23 paper (#3687)
|
1 年之前 |
Olatunji Ruwase
|
e5fe5f65e8
Use logger in accelerator (#3682)
|
1 年之前 |