Sam Ade Jacobs 8b191d7ccf Long sequence parallelism (Ulysses) integration with HuggingFace (#5774) | 2 月之前 | |
---|---|---|
.. | ||
__init__.py | a855405e0b DeepSpeed Ulysses release (#4198) | 1 年之前 |
cross_entropy.py | 8b191d7ccf Long sequence parallelism (Ulysses) integration with HuggingFace (#5774) | 2 月之前 |
layer.py | ffe0af2357 Fix the bug of deepspeed sequence parallel working with batch size larger than 1 (#5823) | 2 月之前 |