Gary Miguel
|
07887f6630
sharded_moe: make top1gating ONNX-exportable (#1578)
|
2 年之前 |
alexandremuzio
|
1bc13fe83f
Removing `ImportError` from tutel import try/except (#1583)
|
2 年之前 |
alexandremuzio
|
2887349cd4
Adding Tutel to MoE layer (#1528)
|
3 年之前 |
Ammar Ahmad Awan
|
56635d5b6c
enable/disable moe token dropping. (#1492)
|
3 年之前 |
Gani Nazirov
|
20bf1cc120
Switch to use or not einsum op. Needed for ORT (#1456)
|
3 年之前 |
Ammar Ahmad Awan
|
1fc74cb9c8
Add basic MoE timing breakdown (#1428)
|
3 年之前 |
Alex Hedges
|
be789b1665
Fix many typos (#1423)
|
3 年之前 |
Ammar Ahmad Awan
|
f28432441b
DeepSpeed MoE (#1310)
|
3 年之前 |