Michael Wyatt
|
b361c72761
Update DeepSpeed copyright license to Apache 2.0 (#3111)
|
1 year ago |
Jeff Rasley
|
91d63e0228
update formatter version and style settings (#3098)
|
1 year ago |
Alex Hedges
|
316c4a43e0
Add flake8 to pre-commit checks (#2051)
|
2 years ago |
Alex Hedges
|
4cf970e6bb
Add codespell to pre-commit checks (#1717)
|
2 years ago |
Baizhou Huang
|
76847f42cf
Add warmup_type arguments in WarmupLR and WarmupDecayLR (#1530)
|
2 years ago |
Anes Benmerzoug
|
db5d8ba2fd
Fix OneCycleLR zero division error (#1498)
|
3 years ago |
junxu
|
c64a03db77
Fix docstrings for lr_schedules.py (#1455)
|
3 years ago |
Alex Hedges
|
be789b1665
Fix many typos (#1423)
|
3 years ago |
Stas Bekman
|
18a26f3f60
[WarmupDecayLR] fix log(0) & 1/log(1) bugs (#772)
|
3 years ago |
Olatunji Ruwase
|
da5563a9c1
LR scheduler unit tests (#429)
|
3 years ago |
Stas Bekman
|
9f8e8f3829
implement missing get_last_lr (#595)
|
3 years ago |
Olatunji Ruwase
|
0e942df008
Add Linear warmup+decay lr schedule (#414)
|
4 years ago |
Jeff Rasley
|
e5bbc2e559
Sparse attn + ops/runtime refactor + v0.3.0 (#343)
|
4 years ago |