Reza Yazdani 2afa1c7f2f Communication Optimization for Large-Scale Training (#4695) 11 months ago
..
__init__.py b361c72761 Update DeepSpeed copyright license to Apache 2.0 (#3111) 1 year ago
experts.py b361c72761 Update DeepSpeed copyright license to Apache 2.0 (#3111) 1 year ago
layer.py 2afa1c7f2f Communication Optimization for Large-Scale Training (#4695) 11 months ago
mappings.py b361c72761 Update DeepSpeed copyright license to Apache 2.0 (#3111) 1 year ago
sharded_moe.py b354c28b76 polishing timers and log_dist (#3996) 1 year ago
utils.py b361c72761 Update DeepSpeed copyright license to Apache 2.0 (#3111) 1 year ago