Reza Yazdani 2c5bba6dc1 Transformer kernel - fix unit test (#1503) 3 年之前
..
inference ee6a92c066 Fixing the transformer APIs to return tuple as the output (if needed) (#1491) 3 年之前
cublas_wrappers.cu c78c29f938 supporting different hidden dimensions (#559) 3 年之前
dropout_kernels.cu e2dfcadf3b Fix the bias-add and add the layer-norm-eps parameter (#791) 3 年之前
ds_transformer_cuda.cpp bc7778ea5b Fix the workspace allocation for the transformer kernel (#1397) 3 年之前
gelu_kernels.cu e721cb691f Supporting different hidden dimensions for transformer kernels-v2 (#934) 3 年之前
general_kernels.cu 937c5ceec1 issue with the implementation of column_sum_reduce (#804) 3 年之前
normalize_kernels.cu 2c5bba6dc1 Transformer kernel - fix unit test (#1503) 3 年之前
softmax_kernels.cu bfe7f0db2a Fix cudaErrorInvalidConfiguration in attn_softmax() for large seq_length*heads values (#1239) 3 年之前
transform_kernels.cu e2dfcadf3b Fix the bias-add and add the layer-norm-eps parameter (#791) 3 年之前