Earlee
|
57a27b0803
add type checker ignore to resolve that pylance can't resolved noqa annotation (#4102)
|
1 年之前 |
digger yu
|
ce535945e6
fix: change ==NONE to is (#3923)
|
1 年之前 |
Masahiro Tanaka
|
203ac9d7ac
support model declaration in zero.Init context (#3592)
|
1 年之前 |
stephen youn
|
69d1b9f978
DeepSpeed-Triton for Inference (#3748)
|
1 年之前 |
Michael Wyatt
|
ad168a6954
Fix for dist not being initialized when constructing main config (#3324)
|
1 年之前 |
Olatunji Ruwase
|
47f9f13bd3
DeepSpeed Chat (#3186)
|
1 年之前 |
Michael Wyatt
|
b361c72761
Update DeepSpeed copyright license to Apache 2.0 (#3111)
|
1 年之前 |
Jeff Rasley
|
91d63e0228
update formatter version and style settings (#3098)
|
1 年之前 |
Alex Hedges
|
4abf637f96
Remove mutable default parameter in init_inference() (#2540)
|
1 年之前 |
Michael Wyatt
|
8b4318b950
Make DS-Inference config readable from JSON (#2537)
|
1 年之前 |
Michael Wyatt
|
43bf035cfc
Update docs to autogenerate pydantic config model docs (#2509)
|
1 年之前 |
Ammar Ahmad Awan
|
b5d18a6ab3
DeepSpeed inference config. (#2459) (#2472)
|
1 年之前 |
Connor Holmes
|
10e9d04c23
Cache Allocation and Softmax Fixes (#2433)
|
2 年之前 |
Reza Yazdani
|
afdc72879f
Ds-inference Int8 support through ZeroQuant technology (#2217)
|
2 年之前 |
Jeff Rasley
|
46401b3884
[zero-3] shutdown zero.Init from within ds.init (#2150)
|
2 年之前 |
Reza Yazdani
|
556f005152
Fix random token-generation issue + MP-checkpoint loading/saving (#2132)
|
2 年之前 |
Reza Yazdani
|
aa88137b8d
Add Inference support for running the BigScience-BLOOM Architecture (#2083)
|
2 年之前 |
Ammar Ahmad Awan
|
36ad3119d5
DeepSpeed comm backend v1 (#1985)
|
2 年之前 |
Reza Yazdani
|
a5adb90d72
Enabling CUDA-graph for the bert-type models (#1952)
|
2 年之前 |
Jeff Rasley
|
50893458d6
Fairseq support (#1915)
|
2 年之前 |
Jeff Rasley
|
b4fcd98ff0
Inference PP changes for neox (#1899)
|
2 年之前 |
Jeff Rasley
|
e46d808a1b
MoE inference + PR-MoE model support (#1705)
|
2 年之前 |
Jeff Rasley
|
a8a17f234a
Several fixes for our read-the-docs build (#1579)
|
2 年之前 |
Reza Yazdani
|
9ce00a2171
Tensor-Parallelism general support (#1512)
|
2 年之前 |
Rana Ali Amjad
|
648f7bfa50
Bfloat16 zero2 (#1398)
|
3 年之前 |
Olatunji Ruwase
|
274c375c87
Support Callable type for client optimizer and lr_scheduler (#1316)
|
3 年之前 |
Reza Yazdani
|
ed3de0c21b
Quantization + inference release (#1091)
|
3 年之前 |
Jeff Rasley
|
cfa63f5dad
ZeRO stage 1 refresh (#1042)
|
3 年之前 |
Sean Naren
|
41ab660b5d
Refactor param_dict to config (#1008)
|
3 年之前 |
Jeff Rasley
|
871f3048ad
Allow args to be optional in deepspeed.initialize (#825)
|
3 年之前 |