.. |
features
|
da84e60d98
add missing license info to top of all source code (#2889)
|
1 year ago |
__init__.py
|
da84e60d98
add missing license info to top of all source code (#2889)
|
1 year ago |
base.py
|
0acf7e9c48
[RFC] add device abstraction to allow other device than CUDA be used (#2221)
|
1 year ago |
base_moe.py
|
0acf7e9c48
[RFC] add device abstraction to allow other device than CUDA be used (#2221)
|
1 year ago |
bert.py
|
da84e60d98
add missing license info to top of all source code (#2889)
|
1 year ago |
bloom.py
|
dc01cee5ca
using container when loading inference checkpoints (#2875)
|
1 year ago |
clip.py
|
da84e60d98
add missing license info to top of all source code (#2889)
|
1 year ago |
distil_bert.py
|
da84e60d98
add missing license info to top of all source code (#2889)
|
1 year ago |
gpt2.py
|
da84e60d98
add missing license info to top of all source code (#2889)
|
1 year ago |
gptj.py
|
dc01cee5ca
using container when loading inference checkpoints (#2875)
|
1 year ago |
gptneo.py
|
dc01cee5ca
using container when loading inference checkpoints (#2875)
|
1 year ago |
gptneox.py
|
dc01cee5ca
using container when loading inference checkpoints (#2875)
|
1 year ago |
megatron_gpt.py
|
da84e60d98
add missing license info to top of all source code (#2889)
|
1 year ago |
megatron_gpt_moe.py
|
da84e60d98
add missing license info to top of all source code (#2889)
|
1 year ago |
opt.py
|
dc01cee5ca
using container when loading inference checkpoints (#2875)
|
1 year ago |
unet.py
|
87eaf8f99a
Check for local CUDA graphs when enable_cuda_graph=True (#2941)
|
1 year ago |
vae.py
|
87eaf8f99a
Check for local CUDA graphs when enable_cuda_graph=True (#2941)
|
1 year ago |