Reza Yazdani aca7fc549a Add local attention for GPT-Neo model architecture (#1114) 3 年之前
..
context.h ed3de0c21b Quantization + inference release (#1091) 3 年之前
cublas_wrappers.h ed3de0c21b Quantization + inference release (#1091) 3 年之前
custom_cuda_layers.h aca7fc549a Add local attention for GPT-Neo model architecture (#1114) 3 年之前