Lev Kurilenko
|
8186da3217
Add megatron_v2 class variable to megatron policy
|
1 年之前 |
Lev Kurilenko
|
5651f9e3de
move _transpose() to base container
|
1 年之前 |
Lev Kurilenko
|
d190f68f00
Update megatron container to work with MoE==1 text-generation example
|
1 年之前 |
Lev Kurilenko
|
731965db33
Fix MegatronLayerPolicy to have megatron_v2=True (#2579)
|
1 年之前 |
Reza Yazdani
|
35b350b28c
Fix quantized-inference & Add generic support of checkpoint loading (#2547)
|
1 年之前 |
Connor Holmes
|
b841628207
Drop Maxwell Support (#2574)
|
1 年之前 |
Ma, Guokai
|
06938835eb
Support fp32 gradaccum for bf16 model (#2566)
|
1 年之前 |
Hayden
|
2d8f3f564d
Add Determined to open-source DL frameworks (#2573)
|
1 年之前 |
Jeff Rasley
|
d4cab2cec2
use get_global_rank if available (#2567)
|
1 年之前 |
Jeongseok Kang
|
bbe030c5c9
docs: Update the recent url for Megatron-LM (#2564)
|
1 年之前 |
Jeff Rasley
|
c77d42dc7b
bump to 0.7.7
|
1 年之前 |
AGUL
|
aeda7f9f8c
Fix invalid check of recorded parameter orders in zero stage3. (#2550)
|
1 年之前 |
Ma, Guokai
|
ffcf384673
Abstract accelerator (step 1) (#2504)
|
1 年之前 |
Lev Kurilenko
|
9b24af9182
TODO comments
|
1 年之前 |
Lev Kurilenko
|
742334378b
Merge branch 'master' into inference-refactor-v1
|
1 年之前 |
Lev Kurilenko
|
09f37abb66
Clean up
|
1 年之前 |
Lev Kurilenko
|
9a36258df9
Add ds_megatron_gpt model implementation + make explicit megatron is GPT specific
|
1 年之前 |
Lev Kurilenko
|
75003bed47
Megatron container working w/ moe=1
|
1 年之前 |
Michael Wyatt
|
c5f85858a8
add missing moe deprecated fields to inference config (#2556)
|
1 年之前 |
Cheng Li
|
abe4fc6b55
encoded ds config into command line argument when launching child processes in autotuning (#2524)
|
1 年之前 |
ShijieZZZZ
|
340fc0cf19
Report progress at gradient accumulation boundary (#2553)
|
1 年之前 |
Lev Kurilenko
|
8ca77499c6
Container based OPT is working
|
1 年之前 |
Ammar Ahmad Awan
|
3f511079cd
remove notes.
|
1 年之前 |
Ammar Ahmad Awan
|
102a1bdb23
Public inference v2 (#387)
|
1 年之前 |
Joe Mayer
|
21c2802964
Adding Gradient Accumulation Data Type Config (#2512)
|
1 年之前 |
Ammar Ahmad Awan
|
90ae688442
Pass down the new DS inference config to replace_transformer_layer. (#2539)
|
1 年之前 |
Connor Holmes
|
5df1eea742
Change Where DS/Triton is Used in Stable Diffusion (#2536)
|
1 年之前 |
Alex Hedges
|
4abf637f96
Remove mutable default parameter in init_inference() (#2540)
|
1 年之前 |
Michael Wyatt
|
c5ee27f737
Add MII tests (#2533)
|
1 年之前 |
Michael Wyatt
|
8b4318b950
Make DS-Inference config readable from JSON (#2537)
|
1 年之前 |