Xiaoxia (Shirley) Wu
|
41744d59f8
DeepSpeed-VisualChat Blog (#4446)
|
1 year ago |
Yuxiang Wei
|
986b5958e2
fix: wrong documentation of `ignore_unused_parameters` (#4418)
|
1 year ago |
Logan Adams
|
cd0d2ba2df
Enable ad-hoc running of cpu_inference (#4444)
|
1 year ago |
Logan Adams
|
fd98af256e
Fixup check release version script (#4413)
|
1 year ago |
Jackmin801
|
2c67b58b5f
fix: check-license (#4432)
|
1 year ago |
Jackmin801
|
58a206059f
Small docstring fix (#4431)
|
1 year ago |
Ma, Guokai
|
9a55291452
[CCLBackend] Using parallel memcpy for inference_all_reduce (#4404)
|
1 year ago |
Liangliang-Ma
|
1760627eb9
Zero infinity xpu support (#4130)
|
1 year ago |
Jackmin801
|
2f73b834b5
change default set_to_none in zero_grad methods (#4438)
|
1 year ago |
Logan Adams
|
0636c74c5e
Update cp_inf wokrflow (#4424)
|
1 year ago |
Yejing-Lai
|
7220e7f8f7
fix cpu loading model partition OOM (#4353)
|
1 year ago |
Yejing-Lai
|
388c84834f
add CPU autotp UT (#4263)
|
1 year ago |
Abhishek Jindal
|
28b9d5c231
Add condition when dimension is greater than 2 (#4390)
|
1 year ago |
Logan Adams
|
58619402b5
Update nv-transformers workflow to use cu11.6 (#4412)
|
1 year ago |
Abhishek Jindal
|
e339364127
Add torch no grad condition (#4391)
|
1 year ago |
Conglong Li
|
aea10eec5a
update deepspeed4science blog (#4408)
|
1 year ago |
Ziyang
|
60bf78454c
Fix incorrect assignment of self.quantized_nontrainable_weights (#4399)
|
1 year ago |
Masahiro Tanaka
|
f8d3ec7fa1
save/restore step in param groups with zero 1 (#4396)
|
1 year ago |
Michael Wyatt
|
4c35880b16
Allow multiple inference engines in single script (#4384)
|
1 year ago |
stephen youn
|
0e0748c579
adds triton flash attention2 kernel (#4337)
|
1 year ago |
Elsa Granger
|
4fc2c8e7d5
Fix llama meta tensor loading in AutoTP and kernel injected inference (#3608)
|
1 year ago |
Olatunji Ruwase
|
463dea2722
Fix min torch version (#4375)
|
1 year ago |
Logan Adams
|
17957728c0
Fix multinode runner to properly append to PDSH_SSH_ARGS_APPEND (#4373)
|
1 year ago |
Cheng Li
|
727609df4a
add the missing method (#4363)
|
1 year ago |
cctry
|
c58146471e
Openfold fix (#4368)
|
1 year ago |
Conglong Li
|
a99e5d3fb7
deepspeed4science japanese blog (#4369)
|
1 year ago |
Conglong Li
|
3592a22cfe
deepspeed4science chinese blog (#4366)
|
1 year ago |
Logan Adams
|
dcd3ae1954
Enable workflow dispatch on Torch 1.10 CI tests (#4361)
|
1 year ago |
Logan Adams
|
dcf649c3e0
Update conda env to have max pydantic version (#4362)
|
1 year ago |
Conglong Li
|
da7a1851c4
add deepspeed4science blog link (#4364)
|
1 year ago |