Olatunji Ruwase
|
cef116f82c
Copy grads to cpu in z1-offload (#1679)
|
2 年之前 |
Jeff Rasley
|
c2735996c0
[docs] add logo (#1676)
|
2 年之前 |
Conglong Li
|
aca647991f
update results about public Pile dataset (#1675)
|
2 年之前 |
Olatunji Ruwase
|
4354c3cc67
Fix largest param numel calculation (#1623)
|
2 年之前 |
Victor
|
74493b2bee
support CPU Adam and Adagrad on Windows with SDK 10.0.22000 (#1634)
|
2 年之前 |
Jeff Rasley
|
b6f0ac97ae
bump to 0.5.10
|
2 年之前 |
Manuel R. Ciosici
|
d0ab722427
Various small documentation text improvements (#1665)
|
2 年之前 |
Reza Yazdani
|
559c4ce11a
Convert the fp16_params to group of parameters (#1651)
|
2 年之前 |
Manuel R. Ciosici
|
40ce131caa
Replace calls to print() with calls to logger (#1664)
|
2 年之前 |
Stas Bekman
|
317400eafc
[save_fp16_model] return status (#1663)
|
2 年之前 |
Jeff Rasley
|
d93d924a77
follow-up to #1652, resolved a100-80gb issue (#1655)
|
2 年之前 |
Jeff Rasley
|
cbd68dc480
add backup cpu-arch detection if py-cpuinfo fails (#1652)
|
2 年之前 |
Conglong Li
|
752319c782
New feature contribution guideline (#1646)
|
2 年之前 |
Alex Hedges
|
8bbf081ad8
Add torchvision to requirements-dev.txt (#1642)
|
2 年之前 |
Minjia Zhang
|
f2c433f03e
Updating autotuner readme file to add hyperparameter adjustment suggestions (#1641)
|
2 年之前 |
Reza Yazdani
|
259936a76c
Fix cpu-adam AVX performance (#1637)
|
2 年之前 |
Cheng Li
|
082f392a93
Add tensor methods in flops counting and separate macs and flops (#1591)
|
2 年之前 |
Jeff Rasley
|
7f58853c2e
[testing] 3x faster unit tests (#1636)
|
2 年之前 |
Jeff Rasley
|
1d295ff5f8
Refactor ZeRO naming to reduce confusion (#1607)
|
2 年之前 |
Gary Miguel
|
07887f6630
sharded_moe: make top1gating ONNX-exportable (#1578)
|
2 年之前 |
Victor
|
64c2946a23
use py-cpuinfo to detect SIMD_WIDTH in platform-independent way (#1616)
|
2 年之前 |
Ammar Ahmad Awan
|
feb6afb049
remove print (#1626)
|
2 年之前 |
Conglong Li
|
c6ace162c4
MoE for NLG tutorial (#1633)
|
2 年之前 |
Jeff Rasley
|
88d26e0b8d
[docs] update readme (#1632)
|
2 年之前 |
Jeff Rasley
|
df9e064d6b
[docs] add MoE NLG announcement to news
|
2 年之前 |
Jeff Rasley
|
3ffeaa4999
MoE for NLG announcement (#1628)
|
2 年之前 |
Olatunji Ruwase
|
91e15593ea
Control ds_report output (#1622)
|
2 年之前 |
Jeff Rasley
|
3488b8cdd3
[readme] remove stats badge until PyPI is fixed
|
2 年之前 |
Pierce Stegman
|
cda7c71895
Sparse Attention: Fix Triton errors (#1608)
|
2 年之前 |
Jeff Rasley
|
4b854a37cb
[zero-3] set default device during zero.Init (#1605)
|
2 年之前 |