Alex Hedges
|
4abf637f96
Remove mutable default parameter in init_inference() (#2540)
|
1 年之前 |
Michael Wyatt
|
8b4318b950
Make DS-Inference config readable from JSON (#2537)
|
1 年之前 |
Michael Wyatt
|
43bf035cfc
Update docs to autogenerate pydantic config model docs (#2509)
|
1 年之前 |
Ammar Ahmad Awan
|
b5d18a6ab3
DeepSpeed inference config. (#2459) (#2472)
|
1 年之前 |
Connor Holmes
|
10e9d04c23
Cache Allocation and Softmax Fixes (#2433)
|
2 年之前 |
Reza Yazdani
|
afdc72879f
Ds-inference Int8 support through ZeroQuant technology (#2217)
|
2 年之前 |
Jeff Rasley
|
46401b3884
[zero-3] shutdown zero.Init from within ds.init (#2150)
|
2 年之前 |
Reza Yazdani
|
556f005152
Fix random token-generation issue + MP-checkpoint loading/saving (#2132)
|
2 年之前 |
Reza Yazdani
|
aa88137b8d
Add Inference support for running the BigScience-BLOOM Architecture (#2083)
|
2 年之前 |
Ammar Ahmad Awan
|
36ad3119d5
DeepSpeed comm backend v1 (#1985)
|
2 年之前 |
Reza Yazdani
|
a5adb90d72
Enabling CUDA-graph for the bert-type models (#1952)
|
2 年之前 |
Jeff Rasley
|
50893458d6
Fairseq support (#1915)
|
2 年之前 |
Jeff Rasley
|
b4fcd98ff0
Inference PP changes for neox (#1899)
|
2 年之前 |
Jeff Rasley
|
e46d808a1b
MoE inference + PR-MoE model support (#1705)
|
2 年之前 |
Jeff Rasley
|
a8a17f234a
Several fixes for our read-the-docs build (#1579)
|
2 年之前 |
Reza Yazdani
|
9ce00a2171
Tensor-Parallelism general support (#1512)
|
2 年之前 |
Rana Ali Amjad
|
648f7bfa50
Bfloat16 zero2 (#1398)
|
3 年之前 |
Olatunji Ruwase
|
274c375c87
Support Callable type for client optimizer and lr_scheduler (#1316)
|
3 年之前 |
Reza Yazdani
|
ed3de0c21b
Quantization + inference release (#1091)
|
3 年之前 |
Jeff Rasley
|
cfa63f5dad
ZeRO stage 1 refresh (#1042)
|
3 年之前 |
Sean Naren
|
41ab660b5d
Refactor param_dict to config (#1008)
|
3 年之前 |
Jeff Rasley
|
871f3048ad
Allow args to be optional in deepspeed.initialize (#825)
|
3 年之前 |
Samyam Rajbhandari
|
599258f979
ZeRO 3 Offload (#834)
|
3 年之前 |
Jeff Rasley
|
f032e56f8a
Validate consistent ckpt tags across ranks (#667)
|
3 年之前 |
Jeff Rasley
|
7435b2f10a
Ability to initialize distributed backend outside deepspeed runtime (#608)
|
3 年之前 |
Jeff Rasley
|
dce054dbba
backwards compatability w. v020 ckpts, fix issue with zero-1 ckpts (#543)
|
3 年之前 |
Jeff Rasley
|
31f46feee2
DeepSpeed JIT op + PyPI support (#496)
|
4 年之前 |
Reza Yazdani
|
f5aa2547d8
Add CPUAdam optimizer for zero-offload in deepspeed engine (#484)
|
4 年之前 |
Shaden Smith
|
65c2f974d8
Pipeline parallel training engine. (#392)
|
4 年之前 |
Jeff Rasley
|
41db1c2f03
ZeRO-Offload release (#391)
|
4 年之前 |