Olatunji Ruwase
|
db207d872e
Merge branch 'master' into olruwase/restore_from_bit16_weights
|
2 年之前 |
Justin Chiu
|
4912e0ad7e
Various ZeRO Stage3 Optimizations + Improvements (including bfloat16 support) (#1453)
|
2 年之前 |
Olatunji Ruwase
|
6b9fee4629
Fix checkpoint api
|
2 年之前 |
Jeff Rasley
|
2d51f6171b
preserve cuda visible devices order (#1712)
|
2 年之前 |
Reza Yazdani
|
94de0229fb
Fix inference api & add more description on inference engine tutorial (#1711)
|
2 年之前 |
Jeff Rasley
|
2662fded2d
add logo and move news (#1709)
|
2 年之前 |
Ammar Ahmad Awan
|
af074de349
Reorganize MoE news and tutorials. (#1708)
|
2 年之前 |
Reza Yazdani
|
e27a60a879
Add more context for the MoE Inference tutorial (#1707)
|
2 年之前 |
Zhewei Yao
|
53fdadfb9a
pr moe tutorial creation (#1704)
|
2 年之前 |
Reza Yazdani
|
38e16c696d
add moe-inference tutorial (#1706)
|
2 年之前 |
Jeff Rasley
|
e46d808a1b
MoE inference + PR-MoE model support (#1705)
|
2 年之前 |
Jeff Rasley
|
3293cf72a0
[ZeRO] Default disable elastic ckpt in stage 1+2 and reduce CPU memory overhead during ckpt load (#1525)
|
2 年之前 |
Jeff Rasley
|
e4cf40d617
force clear stashed tensors (#1698)
|
2 年之前 |
liamcli
|
fead387f78
support module and no python args for launcher (#1690)
|
2 年之前 |
Jeff Rasley
|
a85dce0728
add -lcurand to fix torch-nightly issue w. JIT (#1688)
|
2 年之前 |
Jeff Rasley
|
3a4cb04243
[docs] switch to transparent dark logo
|
2 年之前 |
Reza Yazdani
|
762e697a03
fix the half-precision version of rotary_pos_emb kernel (#1683)
|
2 年之前 |
Reza Yazdani
|
289c3f9ba4
GPT-J inference support (#1670)
|
2 年之前 |
Jeff Rasley
|
7e857aab9a
[docs] add gh-dark-mode logo
|
2 年之前 |
Jeff Rasley
|
9c5cf3a5d4
[docs] add light-mode logo
|
2 年之前 |
Jeff Rasley
|
2422ec4885
add segfault guard for cpu-adam/adagrad (#1681)
|
2 年之前 |
Olatunji Ruwase
|
cef116f82c
Copy grads to cpu in z1-offload (#1679)
|
2 年之前 |
Jeff Rasley
|
c2735996c0
[docs] add logo (#1676)
|
2 年之前 |
Conglong Li
|
aca647991f
update results about public Pile dataset (#1675)
|
2 年之前 |
Olatunji Ruwase
|
4354c3cc67
Fix largest param numel calculation (#1623)
|
2 年之前 |
Victor
|
74493b2bee
support CPU Adam and Adagrad on Windows with SDK 10.0.22000 (#1634)
|
2 年之前 |
Jeff Rasley
|
b6f0ac97ae
bump to 0.5.10
|
2 年之前 |
Manuel R. Ciosici
|
d0ab722427
Various small documentation text improvements (#1665)
|
2 年之前 |
Reza Yazdani
|
559c4ce11a
Convert the fp16_params to group of parameters (#1651)
|
2 年之前 |
Manuel R. Ciosici
|
40ce131caa
Replace calls to print() with calls to logger (#1664)
|
2 年之前 |