Arash Bakhtiari
|
8b2a63717a
Add support of OPT models (#2205)
|
2 年之前 |
Jeff Rasley
|
776e36988d
delay torch import for inference compatability check (#2167)
|
2 年之前 |
Jeff Rasley
|
46401b3884
[zero-3] shutdown zero.Init from within ds.init (#2150)
|
2 年之前 |
Jeff Rasley
|
63f470eeb6
prevent cuda 10 builds of inference kernels on ampere (#2157)
|
2 年之前 |
Reza Yazdani
|
8164ea9e6d
Fixing several bugs in the inference-api and the kernels (#1951)
|
2 年之前 |
Jeff Rasley
|
b4fcd98ff0
Inference PP changes for neox (#1899)
|
2 年之前 |
Reza Yazdani
|
d3cad05105
fixing the inference build path when pre-building the inference op (#1755)
|
2 年之前 |
Reza Yazdani
|
289c3f9ba4
GPT-J inference support (#1670)
|
2 年之前 |
Jeff Rasley
|
da1fe2f82c
Remove hard torch dependency at install (#1166)
|
3 年之前 |
eltonzheng
|
71ecf7e625
Add Windows support in README, use c++17 on Windows to support latest VC & cuda build tool (#1151)
|
3 年之前 |
Reza Yazdani
|
ed3de0c21b
Quantization + inference release (#1091)
|
3 年之前 |