Reza Yazdani bc7778ea5b Fix the workspace allocation for the transformer kernel (#1397) 3 years ago
..
adam f28432441b DeepSpeed MoE (#1310) 3 years ago
aio be789b1665 Fix many typos (#1423) 3 years ago
includes bc7778ea5b Fix the workspace allocation for the transformer kernel (#1397) 3 years ago
lamb ed3de0c21b Quantization + inference release (#1091) 3 years ago
quantization be789b1665 Fix many typos (#1423) 3 years ago
sparse_attention e5bbc2e559 Sparse attn + ops/runtime refactor + v0.3.0 (#343) 4 years ago
transformer bc7778ea5b Fix the workspace allocation for the transformer kernel (#1397) 3 years ago
utils 31f46feee2 DeepSpeed JIT op + PyPI support (#496) 4 years ago