Reza Yazdani aca7fc549a Add local attention for GPT-Neo model architecture (#1114) | 3 年之前 | |
---|---|---|
.. | ||
context.h | ed3de0c21b Quantization + inference release (#1091) | 3 年之前 |
cublas_wrappers.h | ed3de0c21b Quantization + inference release (#1091) | 3 年之前 |
custom_cuda_layers.h | aca7fc549a Add local attention for GPT-Neo model architecture (#1114) | 3 年之前 |