Reza Yazdani 468882fb68 Add the policy to run llama model from the official repo (#4313) 1 年之前
..
__init__.py b361c72761 Update DeepSpeed copyright license to Apache 2.0 (#3111) 1 年之前
base.py 1f72082fc0 [CPU] Support Intel CPU inference (#3041) 1 年之前
gelu_gemm.py 69d1b9f978 DeepSpeed-Triton for Inference (#3748) 1 年之前
linear.py 69d1b9f978 DeepSpeed-Triton for Inference (#3748) 1 年之前
mlp_gemm.py 367d6f9cec Support InternLM (#4137) 1 年之前
qkv_gemm.py 367d6f9cec Support InternLM (#4137) 1 年之前
residual_add.py ce535945e6 fix: change ==NONE to is (#3923) 1 年之前
softmax.py 1f72082fc0 [CPU] Support Intel CPU inference (#3041) 1 年之前
softmax_context.py 468882fb68 Add the policy to run llama model from the official repo (#4313) 1 年之前
vector_matmul.py 69d1b9f978 DeepSpeed-Triton for Inference (#3748) 1 年之前