Jinxing Pan
|
4f803852ac
Op_builder->is_compatible quite warning (#6093)
|
1 月之前 |
Olatunji Ruwase
|
0f2d485c27
Log operator warnings only in verbose mode (#5917)
|
2 月之前 |
Connor Holmes
|
0a61d5d664
Hybrid Engine Refactor and Llama Inference Support (#3425)
|
1 年之前 |
Michael Wyatt
|
b361c72761
Update DeepSpeed copyright license to Apache 2.0 (#3111)
|
1 年之前 |
Jeff Rasley
|
91d63e0228
update formatter version and style settings (#3098)
|
1 年之前 |
Jeff Rasley
|
da84e60d98
add missing license info to top of all source code (#2889)
|
1 年之前 |
Connor Holmes
|
b841628207
Drop Maxwell Support (#2574)
|
1 年之前 |
Connor Holmes
|
e7e7595502
Stable Diffusion Enhancements (#2491)
|
1 年之前 |
Connor Holmes
|
c84bca37b1
Memory Access Utility (#2276)
|
2 年之前 |
Arash Bakhtiari
|
8b2a63717a
Add support of OPT models (#2205)
|
2 年之前 |
Jeff Rasley
|
776e36988d
delay torch import for inference compatability check (#2167)
|
2 年之前 |
Jeff Rasley
|
46401b3884
[zero-3] shutdown zero.Init from within ds.init (#2150)
|
2 年之前 |
Jeff Rasley
|
63f470eeb6
prevent cuda 10 builds of inference kernels on ampere (#2157)
|
2 年之前 |
Reza Yazdani
|
8164ea9e6d
Fixing several bugs in the inference-api and the kernels (#1951)
|
2 年之前 |
Jeff Rasley
|
b4fcd98ff0
Inference PP changes for neox (#1899)
|
2 年之前 |
Reza Yazdani
|
d3cad05105
fixing the inference build path when pre-building the inference op (#1755)
|
2 年之前 |
Reza Yazdani
|
289c3f9ba4
GPT-J inference support (#1670)
|
2 年之前 |
Jeff Rasley
|
da1fe2f82c
Remove hard torch dependency at install (#1166)
|
3 年之前 |
eltonzheng
|
71ecf7e625
Add Windows support in README, use c++17 on Windows to support latest VC & cuda build tool (#1151)
|
3 年之前 |
Reza Yazdani
|
ed3de0c21b
Quantization + inference release (#1091)
|
3 年之前 |