Connor Holmes 01080fc30e Merge branch 'master' into lokoppak/ln_schedule_update 1 年之前
..
StopWatch.h 734d8991c8 Transformer kernel release (#242) 4 年之前
Timer.h a10e4811fe force set lf instead of crlf (https://github.com/pre-commit/pre-commit-hooks#mixed-line-ending) (#1598) 2 年之前
compat.h 8abdaee243 Add cpu adagrad (#1358) 3 年之前
context.h ee1ffe2e88 CPU-Adam fix for scalar mode (#735) 3 年之前
conversion_utils.h 9aa7b638b7 Kernel Data Conversion Utility (#2327) 2 年之前
cpu_adagrad.h 74493b2bee support CPU Adam and Adagrad on Windows with SDK 10.0.22000 (#1634) 2 年之前
cpu_adam.h a04480e192 Fix the half-precision version of CPU-Adam (#2032) 2 年之前
cublas_wrappers.h c3c8d5dd93 AMD support (#1430) 2 年之前
custom_cuda_layers.h ef869377e9 DeepSpeed Data Efficiency Library (#2585) 1 年之前
dequantization_utils.h 30c8d8a881 Initial dequant library implementation (#2521) 1 年之前
dropout.h a10e4811fe force set lf instead of crlf (https://github.com/pre-commit/pre-commit-hooks#mixed-line-ending) (#1598) 2 年之前
ds_kernel_utils.h edd17bdaad format fixes 1 年之前
ds_transformer_cuda.h bc7778ea5b Fix the workspace allocation for the transformer kernel (#1397) 3 年之前
feed_forward.h c3c8d5dd93 AMD support (#1430) 2 年之前
gelu.h a10e4811fe force set lf instead of crlf (https://github.com/pre-commit/pre-commit-hooks#mixed-line-ending) (#1598) 2 年之前
gemm_test.h c3c8d5dd93 AMD support (#1430) 2 年之前
general_kernels.h c3c8d5dd93 AMD support (#1430) 2 年之前
memory_access_utils.h be4ffb82ad Reduction Kernel Utility (#2436) 2 年之前
normalize_layer.h a10e4811fe force set lf instead of crlf (https://github.com/pre-commit/pre-commit-hooks#mixed-line-ending) (#1598) 2 年之前
quantization.h 30c8d8a881 Initial dequant library implementation (#2521) 1 年之前
quantization_utils.h 30c8d8a881 Initial dequant library implementation (#2521) 1 年之前
quantizer.h ed3de0c21b Quantization + inference release (#1091) 3 年之前
reduction_utils.h 30c8d8a881 Initial dequant library implementation (#2521) 1 年之前
simd.h a04480e192 Fix the half-precision version of CPU-Adam (#2032) 2 年之前
softmax.h a10e4811fe force set lf instead of crlf (https://github.com/pre-commit/pre-commit-hooks#mixed-line-ending) (#1598) 2 年之前
strided_batch_gemm.h c3c8d5dd93 AMD support (#1430) 2 年之前
type_shim.h 648f7bfa50 Bfloat16 zero2 (#1398) 3 年之前