Lev Kurilenko 3798e60519 Fix Meta Tensor checkpoint load for OPT models (#2990) 1 year ago
..
containers 0acf7e9c48 [RFC] add device abstraction to allow other device than CUDA be used (#2221) 1 year ago
__init__.py da84e60d98 add missing license info to top of all source code (#2889) 1 year ago
auto_tp.py 2ede0d942a AutoTP Assert Kernel Injection Support (#2939) 1 year ago
inject.py da84e60d98 add missing license info to top of all source code (#2889) 1 year ago
layers.py da84e60d98 add missing license info to top of all source code (#2889) 1 year ago
load_checkpoint.py 3798e60519 Fix Meta Tensor checkpoint load for OPT models (#2990) 1 year ago
module_quantize.py da84e60d98 add missing license info to top of all source code (#2889) 1 year ago
policy.py 0acf7e9c48 [RFC] add device abstraction to allow other device than CUDA be used (#2221) 1 year ago
replace_module.py dc01cee5ca using container when loading inference checkpoints (#2875) 1 year ago
replace_policy.py 867da307d0 Inference Refactor (replace_with_policy, model_implementations) (#2554) 1 year ago
utils.py da84e60d98 add missing license info to top of all source code (#2889) 1 year ago