提交历史

作者 SHA1 备注 提交日期
  Lev Kurilenko 8186da3217 Add megatron_v2 class variable to megatron policy 1 年之前
  Lev Kurilenko 5651f9e3de move _transpose() to base container 1 年之前
  Lev Kurilenko d190f68f00 Update megatron container to work with MoE==1 text-generation example 1 年之前
  Lev Kurilenko 731965db33 Fix MegatronLayerPolicy to have megatron_v2=True (#2579) 1 年之前
  Reza Yazdani 35b350b28c Fix quantized-inference & Add generic support of checkpoint loading (#2547) 1 年之前
  Connor Holmes b841628207 Drop Maxwell Support (#2574) 1 年之前
  Ma, Guokai 06938835eb Support fp32 gradaccum for bf16 model (#2566) 1 年之前
  Hayden 2d8f3f564d Add Determined to open-source DL frameworks (#2573) 1 年之前
  Jeff Rasley d4cab2cec2 use get_global_rank if available (#2567) 1 年之前
  Jeongseok Kang bbe030c5c9 docs: Update the recent url for Megatron-LM (#2564) 1 年之前
  Jeff Rasley c77d42dc7b bump to 0.7.7 1 年之前
  AGUL aeda7f9f8c Fix invalid check of recorded parameter orders in zero stage3. (#2550) 1 年之前
  Ma, Guokai ffcf384673 Abstract accelerator (step 1) (#2504) 1 年之前
  Lev Kurilenko 9b24af9182 TODO comments 1 年之前
  Lev Kurilenko 742334378b Merge branch 'master' into inference-refactor-v1 1 年之前
  Lev Kurilenko 09f37abb66 Clean up 1 年之前
  Lev Kurilenko 9a36258df9 Add ds_megatron_gpt model implementation + make explicit megatron is GPT specific 1 年之前
  Lev Kurilenko 75003bed47 Megatron container working w/ moe=1 1 年之前
  Michael Wyatt c5f85858a8 add missing moe deprecated fields to inference config (#2556) 1 年之前
  Cheng Li abe4fc6b55 encoded ds config into command line argument when launching child processes in autotuning (#2524) 1 年之前
  ShijieZZZZ 340fc0cf19 Report progress at gradient accumulation boundary (#2553) 1 年之前
  Lev Kurilenko 8ca77499c6 Container based OPT is working 1 年之前
  Ammar Ahmad Awan 3f511079cd remove notes. 1 年之前
  Ammar Ahmad Awan 102a1bdb23 Public inference v2 (#387) 1 年之前
  Joe Mayer 21c2802964 Adding Gradient Accumulation Data Type Config (#2512) 1 年之前
  Ammar Ahmad Awan 90ae688442 Pass down the new DS inference config to replace_transformer_layer. (#2539) 1 年之前
  Connor Holmes 5df1eea742 Change Where DS/Triton is Used in Stable Diffusion (#2536) 1 年之前
  Alex Hedges 4abf637f96 Remove mutable default parameter in init_inference() (#2540) 1 年之前
  Michael Wyatt c5ee27f737 Add MII tests (#2533) 1 年之前
  Michael Wyatt 8b4318b950 Make DS-Inference config readable from JSON (#2537) 1 年之前