Sam Ade Jacobs 8b191d7ccf Long sequence parallelism (Ulysses) integration with HuggingFace (#5774) 2 月之前
..
__init__.py a855405e0b DeepSpeed Ulysses release (#4198) 1 年之前
cross_entropy.py 8b191d7ccf Long sequence parallelism (Ulysses) integration with HuggingFace (#5774) 2 月之前
layer.py ffe0af2357 Fix the bug of deepspeed sequence parallel working with batch size larger than 1 (#5823) 2 月之前