master
Updated 1 day ago by GitHub
gma/xpu_compile_analysis
Updated 1 week ago by Ma, Guokai
jomayeri/aio-file-offset
Updated 3 days ago by Ubuntu
jomayeri/aio-locked-tensor
Updated 1 week ago by GitHub
jomayeri/aio-mem-fix
Updated 4 days ago by GitHub
jomayeri/deepnvme-perf-debug
Updated 3 weeks ago by jomayeri
jomayeri/lr-step-init
Updated 1 week ago by Ubuntu
jomayeri/lr-step-move
Updated 3 weeks ago by GitHub
jomayeri/swap-with-locked
Updated 6 days ago by Ubuntu
loadams/add-contributing-release-md-files
Updated 2 weeks ago by GitHub
loadams/fix-mpi4py
Updated 3 weeks ago by Logan Adams
loadams/fix-no-torch-failure-mlu
Updated 3 weeks ago by Logan Adams
loadams/fix-triggers-no-torch-workflow
Updated 3 weeks ago by Logan Adams
loadams/transformers-fixes
Updated 3 weeks ago by GitHub
loadams/update-nv-lightning-test-cu-ver
Updated 2 weeks ago by Logan Adams
olruwase/dnvme_docs
Updated 3 weeks ago by GitHub
olruwase/set_zero_opt_grad
Updated 1 week ago by GitHub
olruwase/zero_multi_models
Updated 2 weeks ago by Olatunji Ruwase
tohtana/allocate_test_port
Updated 1 week ago by Masahiro Tanaka
tohtana/autocast_only_floating_values
Updated 1 week ago by Masahiro Tanaka
tohtana/clean_all_param_coordinators
Updated 2 weeks ago by Masahiro Tanaka
tohtana/clean_up_prefetch_param
Updated 1 week ago by GitHub
tohtana/consistent_zero_grad
Updated 3 weeks ago by GitHub
tohtana/debug_semaphore_leak
Updated 1 week ago by GitHub
tohtana/file_store_for_tests
Updated 4 days ago by Masahiro Tanaka
tohtana/get_offload_state_api
Updated 1 week ago by GitHub
tohtana/ignore_reuse_dist_env
Updated 1 week ago by Masahiro Tanaka
tohtana/lock_hf_cache_update
Updated 6 days ago by GitHub
tohtana/log_run_tests
Updated 5 days ago by Masahiro Tanaka
tohtana/offload_zero_buffers
Updated 3 weeks ago by GitHub
tohtana/test_with_pt25
Updated 5 days ago by Masahiro Tanaka
AutoPR/0.12.2
Updated 11 months ago by GitHub
AutoPR/0.14.0
Updated 7 months ago by GitHub
CUDA-Graph-support
Updated 2 years ago by Reza Yazdani
HeyangQin/deepspeed-ulysses-chinese-blog
Updated 1 year ago by GitHub
HeyangQin/enable_hpz_nograd
Updated 1 year ago by HeyangQin
HeyangQin/fix_hpz_nograd
Updated 1 year ago by HeyangQin
HeyangQin/fix_issue_3062
Updated 1 year ago by GitHub
HeyangQin/fix_issue_3068
Updated 1 year ago by GitHub
HeyangQin/fix_issue_3156
Updated 1 year ago by HeyangQin
HeyangQin/fix_issue_5205
Updated 6 months ago by HeyangQin
HeyangQin/fix_pr_3462_standalone
Updated 1 year ago by GitHub
HeyangQin/hpz_convergence
Updated 10 months ago by GitHub
HeyangQin/inference_t5_phase1
Updated 1 year ago by HeyangQin
HeyangQin/mixed_precision_lora_sam
Updated 1 year ago by HeyangQin
HeyangQin/mixz_tutorial
Updated 1 year ago by GitHub
HeyangQin/skip_bias_quant
Updated 1 year ago by HeyangQin
HeyangQin/staging-zero-pp-v1
Updated 1 year ago by HeyangQin
HeyangQin/ucp_blog_chinese
Updated 3 months ago by Heyang Qin
HeyangQin/ulysses_fp8
Updated 6 months ago by GitHub
Megtron-Kernel-Integration
Updated 4 years ago by Reza Yazdani
SA_feature_tag
Updated 4 years ago by arashashari
SA_tutorial_update
Updated 4 years ago by arashashari
SA_update_tutorial_link
Updated 4 years ago by GitHub
add-bfp16-support
Updated 2 years ago by Reza Yazdani
add-comm-layout
Updated 11 months ago by Reza Yazdani
add-inference-comm
Updated 2 years ago by Reza Yazdani
add-llama2-support
Updated 1 year ago by GitHub
add-quantizer
Updated 2 years ago by Reza Yazdani
add-shared-lib
Updated 2 years ago by Reza Yazdani
adk9/phi3-inference
Updated 3 months ago by GitHub
adk9/phi3-small
Updated 3 months ago by GitHub
adk9/update-minor-cuda
Updated 4 months ago by Abhishek Kulkarni
amawa/1-bit-alltoall
Updated 3 years ago by Ammar Ahmad Awan
amawa/1bit-adam-nccl
Updated 3 years ago by Ammar Ahmad Awan
amawa/add-moe-container
Updated 1 year ago by Ammar Ahmad Awan
amawa/aml-get-hosts
Updated 1 year ago by GitHub
amawa/auto-save-ckpt
Updated 3 years ago by Ammar Ahmad Awan
amawa/config-pass-down
Updated 1 year ago by Ammar Ahmad Awan
amawa/debug
Updated 3 years ago by Ammar Ahmad Awan
amawa/fix-amd-rocm
Updated 1 year ago by Ammar Ahmad Awan
amawa/fix-auto-tp-load-ckpt
Updated 1 year ago by Ammar Ahmad Awan
amawa/fix-tracer-zero3
Updated 2 years ago by Ammar Ahmad Awan
amawa/fix-z3-for-hf-accelerate
Updated 1 year ago by Ammar Ahmad Awan
amawa/fix-z3-warn-print-v2
Updated 2 years ago by Ammar Ahmad Awan
amawa/inference-fix
Updated 2 years ago by Ammar Ahmad Awan
amawa/remove-deepcopy
Updated 1 year ago by Jeff Rasley
amawa/split-a2a
Updated 3 years ago by Ammar Ahmad Awan
amawa/zero-inf-refactor
Updated 3 years ago by GitHub
amd-jiting
Updated 1 year ago by GitHub
aml-autotuner
Updated 1 year ago by Cheng Li
arashb-patch-1
Updated 9 months ago by GitHub
arashb/fix-phi-2
Updated 9 months ago by GitHub
arpan/auto-check
Updated 2 years ago by Arpan Jain
autocast-fix
Updated 1 year ago by Jeff Rasley
awan-10-patch-1
Updated 1 year ago by GitHub
awan-10-patch-2
Updated 1 year ago by GitHub
awan-10-patch-3
Updated 1 year ago by GitHub
azure
Updated 2 years ago by Ammar Ahmad Awan
big-science
Updated 3 years ago by Jeff Rasley
big-science-v2
Updated 3 years ago by Jeff Rasley
bing/debugging
Updated 1 year ago by Bing Xie
bing/ds-adam
Updated 1 year ago by Bing Xie
bing/formatting-correction
Updated 1 year ago by Bing Xie
bing/io-tutorial
Updated 1 year ago by GitHub
bing/modify-ds-optimizer
Updated 1 year ago by Bing Xie
bing/optimizer-naming
Updated 1 year ago by GitHub
bloom-debug
Updated 2 years ago by GitHub
chatgpt-chinese-blog
Updated 1 year ago by Ammar Ahmad Awan
check-linear-sizes
Updated 1 year ago by Reza Yazdani
cholmes/activation-utils
Updated 2 years ago by GitHub
cholmes/checkpoints-inference-v2-2
Updated 11 months ago by GitHub
cholmes/fix-asym-quant
Updated 1 year ago by Connor Holmes
cholmes/fix_reduction_utils_amd
Updated 1 year ago by GitHub
cholmes/isolate-src-code
Updated 11 months ago by GitHub
cholmes/kv-cache-flexibility
Updated 11 months ago by Connor Holmes
cholmes/mem-access-predicated-load
Updated 2 years ago by GitHub
cholmes/migrate-to-dequant-lib
Updated 1 year ago by GitHub
cholmes/pipelined-quant
Updated 1 year ago by GitHub
cholmes/reduce-quantized-gpus
Updated 1 year ago by GitHub
cholmes/sd-extension
Updated 1 year ago by Connor Holmes
cholmes/ts-builder
Updated 2 years ago by cmikeh2
cholmes/unique-cuda-graphs
Updated 2 years ago by cmikeh2
ckpt-fix-unfused
Updated 3 years ago by GitHub
clean-llama
Updated 1 year ago by Molly Smith
clean-llama-v2
Updated 1 year ago by Molly Smith
clean-opt
Updated 1 year ago by GitHub
clean-opt-base
Updated 1 year ago by GitHub
clean-opt-v2
Updated 1 year ago by Lev Kurilenko
clean-opt-v2-base
Updated 1 year ago by Ammar Ahmad Awan
codegen-inference
Updated 1 year ago by GitHub
comm-opt2
Updated 10 months ago by Reza Yazani
costineseanu/windows_inference_build
Updated 4 months ago by GitHub
cpu-adam/optional_CUDA-copy
Updated 3 years ago by GitHub
debug-base-attn
Updated 1 year ago by Ammar Ahmad Awan
debug-ds-inf
Updated 1 year ago by Ammar Ahmad Awan
debug-ds-inf-torch-matmul
Updated 1 year ago by Ammar Ahmad Awan
ds-chat-blog-8-31
Updated 1 year ago by GitHub
ds-chat-clean-opt
Updated 1 year ago by Ammar Ahmad Awan
ds-chat-news
Updated 1 year ago by Ammar Ahmad Awan
ds-chat-release
Updated 1 year ago by GitHub
ds-inference/add-falcon-support
Updated 1 year ago by Reza Yazdani
ds-inference/bloom-support-meta
Updated 2 years ago by Jeff Rasley
ds-inference/fix-generation
Updated 1 year ago by GitHub
ds-inference/fix-mp
Updated 2 years ago by GitHub
ds-inference/remove-randgen
Updated 2 years ago by Reza Yazdani
ds-inference/simplify
Updated 2 years ago by GitHub
ds-inference/support-large-token-length
Updated 2 years ago by Reza Yazdani
ds-seq-tutorial
Updated 1 year ago by Ammar Ahmad Awan
ds-vchat-blog-v1
Updated 1 year ago by GitHub
ds-vchat-blog-v2
Updated 1 year ago by GitHub
duli/capability
Updated 6 months ago by GitHub
duli/cuda_op_builder
Updated 4 months ago by Du Li
duli/op_builder
Updated 4 months ago by Du Li
duli/pre_post
Updated 1 year ago by Du Li
duli/zero_debugging
Updated 3 months ago by Du Li
elastic-ckpt-refresh
Updated 2 years ago by Jeff Rasley
elasticity-v2
Updated 3 years ago by Jeff Rasley
eltonz/copy_grad_stream
Updated 3 years ago by Tunji Ruwase
enable-neox
Updated 2 years ago by Jeff Rasley
encoded-ds-config
Updated 1 year ago by GitHub
fairseq-moe
Updated 2 years ago by Ammar Ahmad Awan
fairseq-moe-debug
Updated 2 years ago by Ammar Ahmad Awan
falcon-180b
Updated 1 year ago by Reza Yazdani
fastgen-blog
Updated 11 months ago by GitHub
fastgen-blog-2
Updated 9 months ago by GitHub
features/rebase-quant-fp6
Updated 7 months ago by GitHub
fix-MoQ
Updated 2 years ago by Reza Yazdani
fix-autotuning-docs
Updated 2 years ago by Cheng Li
fix-autotuning-exit
Updated 1 year ago by Cheng Li
fix-autotuning-reqs
Updated 2 years ago by GitHub
fix-flops-profiler
Updated 2 years ago by GitHub
fix-fp16-test
Updated 2 years ago by GitHub
fix-injection
Updated 1 year ago by GitHub
fix-max_train_batch_size
Updated 2 years ago by Cheng Li
fix-misaligned-grad
Updated 3 years ago by Samyam
fix-moe-top1gating
Updated 2 years ago by Reza Yazdani
fix-sp-dense
Updated 1 year ago by GitHub
fix-sparse-attn
Updated 2 years ago by GitHub
fix-tuner-prescale_gradients
Updated 1 year ago by GitHub
fix-tuner-scheduler-bug
Updated 1 year ago by GitHub
fix-twitter
Updated 1 year ago by GitHub
fix-typos
Updated 2 years ago by Cheng Li
fix_mpu_ckpt
Updated 7 months ago by Logan Adams
flash-attention
Updated 2 years ago by Reza Yazdani
flops-profiler-skip-unused-args
Updated 1 year ago by GitHub
fp6-blog
Updated 7 months ago by GitHub
fs-82
Updated 2 years ago by Jeff Rasley
fs-soft-kernel
Updated 2 years ago by Reza Yazdani
fs-z2-fix
Updated 2 years ago by GitHub
fs/soft-kernel
Updated 1 year ago by Reza Yazdani Aminabadi
gcooper/make_optimizer_optional
Updated 3 years ago by Shaden Smith
generic-ckpt-loading
Updated 1 year ago by Reza Yazdani
gh-pages
Updated 4 years ago by Shaden Smith
gh-readonly-queue/master/pr-3852-3491e32d72746ec3d990108a23e67b2666b3e0e0
Updated 1 year ago by GitHub
gh-readonly-queue/master/pr-3852-adb9bc14b780115fd54f3f1234abcb7ab52fa975
Updated 1 year ago by GitHub
gh-readonly-queue/master/pr-3854-85503dab878875175b6d5eb6a39125878c172273
Updated 1 year ago by GitHub
gh-readonly-queue/master/pr-3892-548451ba4e8ea71029d738c33f639e0439aad1dd
Updated 1 year ago by GitHub
gh-readonly-queue/master/pr-3892-9f8817b2425bb82d9b6355caa6d2d0ebd036885d
Updated 1 year ago by GitHub
gh-readonly-queue/master/pr-3893-cc71eec8c85c4437d8139e53372da7f22224fed5
Updated 1 year ago by GitHub
gh-readonly-queue/master/pr-3928-82115d9059ce8271229c8f63153a02f2d323cfc1
Updated 1 year ago by GitHub
gh-readonly-queue/master/pr-4163-5e16eb2c939707d0d0062a458d77998fccb3afad
Updated 1 year ago by GitHub
good-moe
Updated 2 years ago by GitHub
gpt2-debug
Updated 1 year ago by Molly Smith
guanhua/adam-timer
Updated 1 year ago by GuanhuaWang
guanhua/adam-timer2
Updated 1 year ago by GuanhuaWang
guanhua/check-bf16
Updated 6 months ago by GuanhuaWang
guanhua/h2d-offload
Updated 6 months ago by GitHub
guanhua/kernel-test
Updated 2 years ago by GuanhuaWang
guanhua/mics-fix
Updated 10 months ago by GitHub
guanhua/overflow-check
Updated 6 months ago by GitHub
guanhua/quant-dequant-test
Updated 2 years ago by GitHub
guanhua/quant-test
Updated 2 years ago by GitHub
guanhua/rocm-cpu-adam
Updated 1 year ago by GuanhuaWang
guanhua/v14.0-bf16-check
Updated 6 months ago by GuanhuaWang
hf-workaround
Updated 2 years ago by Jeff Rasley
hp-sam
Updated 2 years ago by Sam Ade Jacobs
hpzero-preview
Updated 1 year ago by GitHub
inference-api/tutorial
Updated 2 years ago by Reza Yazdani
inference-read-checkpoint
Updated 2 years ago by Reza Yazdani
inference-refactor-v1-mro-test
Updated 1 year ago by Michael Wyatt
inference/ElutherAI-GPTJ
Updated 3 years ago by Reza Yazdani
inference/TP-general-support
Updated 2 years ago by GitHub
inference/add-bf16-support
Updated 1 year ago by Connor Holmes
inference/engine-api
Updated 2 years ago by Reza Yazdani
inference/fix-masking
Updated 3 years ago by GitHub
inference/fix-mp-init
Updated 3 years ago by GitHub
inference/support-encoder-decoder
Updated 2 years ago by Reza Yazdani
injection-fixes
Updated 1 year ago by Jeff Rasley
jeff-test
Updated 2 years ago by GitHub
jeffra-patch-2
Updated 2 years ago by GitHub
jeffra/1node-launcher-fix
Updated 2 years ago by Jeff Rasley
jeffra/2904
Updated 1 year ago by Jeff Rasley
jeffra/auto-bucket
Updated 2 years ago by Ammar Ahmad Awan
jeffra/available_memory
Updated 1 year ago by Jeff Rasley
jeffra/bf16-updates
Updated 2 years ago by Jeff Rasley
jeffra/bf16-updates-v2
Updated 2 years ago by Jeff Rasley
jeffra/ci-updates
Updated 2 years ago by Jeff Rasley
jeffra/ckpt-barrier
Updated 2 years ago by GitHub
jeffra/docker-update
Updated 3 years ago by GitHub
jeffra/engine-xthru
Updated 3 years ago by Jeff Rasley
jeffra/engine-xthru-v2
Updated 3 years ago by GitHub
jeffra/engine-xthru-v2-no-padding
Updated 2 years ago by GitHub
jeffra/external-skip
Updated 1 year ago by Jeff Rasley
jeffra/fix-1416
Updated 3 years ago by GitHub
jeffra/fs-diverge
Updated 3 years ago by GitHub
jeffra/fs-gas-fix
Updated 2 years ago by Jeff Rasley
jeffra/fs-gas-fix-v2
Updated 2 years ago by Jeff Rasley
jeffra/fs-support
Updated 2 years ago by Jeff Rasley
jeffra/fs-z3
Updated 2 years ago by Jeff Rasley
jeffra/fs-z3-v0510
Updated 2 years ago by Jeff Rasley
jeffra/gptj-fixes
Updated 2 years ago by Jeff Rasley
jeffra/inf-engine-refactor
Updated 11 months ago by Jeff Rasley
jeffra/inf-tests
Updated 2 years ago by GitHub
jeffra/jit-fix
Updated 3 years ago by Jeff Rasley
jeffra/latest-hf
Updated 10 months ago by Logan Adams
jeffra/op-build-api
Updated 1 year ago by GitHub
jeffra/prepost_fwd_and_generate
Updated 1 year ago by Jeff Rasley
jeffra/saksham-zero1-fixes
Updated 3 years ago by Jeff Rasley
jeffra/savepid2
Updated 2 years ago by Jeff Rasley
jeffra/shm-report
Updated 1 year ago by Jeff Rasley
jeffra/staging-comms-logging-v1
Updated 2 years ago by Jeff Rasley
jeffra/turn-on-opt-test
Updated 1 year ago by GitHub
jeffra/update-z3-check
Updated 2 years ago by Jeff Rasley
jeffra/z1-refresh
Updated 3 years ago by Jeff Rasley
jeffra/z1-refresh-2
Updated 3 years ago by Jeff Rasley
jeffra/z1-refresh-3
Updated 3 years ago by Jeff Rasley
jeffra/z3-fix
Updated 2 years ago by GitHub
jeffra/z3-new-param
Updated 2 years ago by GitHub
jeffra/zero-1-fix
Updated 3 years ago by Jeff Rasley
jeffra/zero-1-fix-test
Updated 3 years ago by GitHub
jeffra/zero-ckpt-fixes
Updated 3 years ago by Jeff Rasley
jeffra/zero-moe-noCG
Updated 1 year ago by Jeff Rasley
jeffra/zero1-grad-norm
Updated 3 years ago by Jeff Rasley
jerasley/mac
Updated 1 year ago by GitHub
jomayeri/bf16-zero-check
Updated 1 year ago by GitHub
jomayeri/debug-2361
Updated 2 years ago by GitHub
jomayeri/destroy-zero
Updated 1 year ago by GitHub
jomayeri/fp8-init
Updated 7 months ago by Joe Mayer
jomayeri/h100-unittest
Updated 1 year ago by GitHub
jomayeri/he-mp-assert
Updated 1 year ago by GitHub
jomayeri/issue-3367
Updated 1 year ago by GitHub
jomayeri/issue-3560
Updated 1 year ago by Joe Mayer
jomayeri/issue-3598
Updated 1 year ago by GitHub
jomayeri/issue-3769
Updated 1 year ago by Michael Wyatt
jomayeri/issue-4083
Updated 1 year ago by Joe Mayer
jomayeri/issue-4095
Updated 1 year ago by GitHub
jomayeri/issue-4183
Updated 1 year ago by GitHub
jomayeri/issue-5087
Updated 8 months ago by Joe Mayer
jomayeri/model-param-list
Updated 1 year ago by GitHub
jomayeri/new-zero-accum
Updated 1 year ago by GitHub
jomayeri/zero-grad-accum
Updated 1 year ago by GitHub
kv-cache-reset
Updated 1 year ago by Jeff Rasley
landing-training
Updated 2 years ago by GitHub
landing-updates
Updated 4 years ago by Shaden Smith
lekurile/add_ds_chat_workflow
Updated 1 year ago by Lev Kurilenko
lekurile/add_hip_abstraction
Updated 8 months ago by Lev Kurilenko
lekurile/clean_up_params
Updated 1 year ago by GitHub
lekurile/container_param_cleanup
Updated 1 year ago by Lev Kurilenko
lekurile/ds_chat_attn_mlp_base
Updated 1 year ago by Lev Kurilenko
lekurile/ds_chat_fix_test
Updated 6 months ago by Lev Kurilenko
lekurile/ds_chat_gh_wf
Updated 8 months ago by Lev Kurilenko
lekurile/ds_chat_mlp_debug
Updated 1 year ago by Lev Kurilenko
lekurile/ds_chat_revert_54c06872
Updated 6 months ago by Lev Kurilenko
lekurile/ds_chat_test_54c06872
Updated 6 months ago by GitHub
lekurile/ds_chat_test_7b5b0660
Updated 6 months ago by GitHub
lekurile/ds_chat_test_exit_first
Updated 1 year ago by Lev Kurilenko
lekurile/ds_chat_test_f69f8840
Updated 6 months ago by GitHub
lekurile/fix_ds_chat_bloom
Updated 1 year ago by Lev Kurilenko
lekurile/fix_formatting
Updated 1 year ago by Lev Kurilenko
lekurile/fix_he_print
Updated 10 months ago by Lev Kurilenko
lekurile/fix_issue_2330
Updated 2 years ago by Lev Kurilenko
lekurile/fix_opt_meta_tensor
Updated 1 year ago by Lev Kurilenko
lekurile/fix_phi_2
Updated 9 months ago by Lev Kurilenko
lekurile/fix_sd
Updated 1 year ago by GitHub
lekurile/fix_sd_ci
Updated 9 months ago by Lev Kurilenko
lekurile/fix_unet_vae
Updated 11 months ago by GitHub
lekurile/general_local_cg
Updated 1 year ago by Lev Kurilenko
lekurile/infv2_lm_eval
Updated 8 months ago by Lev Kurilenko
lekurile/kernel_hip_amd
Updated 8 months ago by Lev Kurilenko
lekurile/load_ckpt_inf_eng
Updated 1 year ago by Lev Kurilenko
lekurile/mlp_functions
Updated 1 year ago by Lev Kurilenko
lekurile/offload_fix_test
Updated 5 months ago by Nadav Elyahu
lekurile/sd_min_ver
Updated 8 months ago by Lev Kurilenko
lekurile/test_rearrange_ops
Updated 5 months ago by GitHub
lekurile/update_ds_chat_ci
Updated 11 months ago by GitHub
lekurile/update_ds_chat_ci_2
Updated 5 months ago by GitHub
lekurile/update_ds_chat_ci_test
Updated 11 months ago by Lev Kurilenko
lekurile/update_dschat_wf
Updated 5 months ago by GitHub
lekurile/update_inf_ckpt_load
Updated 1 year ago by Lev Kurilenko
lf-test
Updated 2 years ago by GitHub
loadams/add-gaudi-badge-readme
Updated 7 months ago by Logan Adams
loadams/add-scheduled-open-issue-check-ds-chat
Updated 1 year ago by Logan Adams
loadams/add-torch-2-support
Updated 1 year ago by Logan Adams
loadams/amd-57
Updated 5 months ago by GitHub
loadams/amd-mi200-tests
Updated 1 year ago by GitHub
loadams/amd-pre-compile
Updated 1 year ago by GitHub
loadams/amd-updates
Updated 1 year ago by Logan Adams
loadams/auto-stage3-prefetch-bucket-size
Updated 6 months ago by Logan Adams
loadams/auto-task-open-failure
Updated 1 year ago by Logan Adams
loadams/build-for-cpu
Updated 11 months ago by Logan Adams
loadams/changes-to-op-builder
Updated 1 year ago by Logan Adams
loadams/cpu-inf
Updated 1 year ago by Logan Adams
loadams/cpu-inf-triggers
Updated 9 months ago by GitHub
loadams/cpu-inf-v0-docker
Updated 8 months ago by Logan Adams
loadams/cpu-inference-shorten
Updated 1 year ago by Logan Adams
loadams/cpu-torch
Updated 11 months ago by GitHub
loadams/cu118
Updated 11 months ago by Logan Adams
loadams/debug-torch
Updated 9 months ago by Logan Adams
loadams/disable-h100-ci
Updated 1 year ago by Logan Adams
loadams/disable-windows-ops-build-script
Updated 5 months ago by Logan Adams
loadams/dot-deepspeed_env-test
Updated 1 year ago by Logan Adams
loadams/dpkg-libaio
Updated 1 year ago by Logan Adams
loadams/empty-env-var-setup
Updated 1 year ago by GitHub
loadams/enable-amdmi200
Updated 1 year ago by GitHub
loadams/enable-workflow-dispatch-nv-torch-nightly-v100
Updated 9 months ago by Logan Adams
loadams/engine-pos-args
Updated 6 months ago by Logan Adams
loadams/fix-check-valid-version
Updated 1 year ago by GitHub
loadams/fix-cpu-inf-test-time
Updated 1 year ago by Logan Adams
loadams/fix-cuda-build-ops
Updated 1 year ago by Logan Adams
loadams/fix-fp16-bf16-logging-issue
Updated 1 year ago by Logan Adams
loadams/fix-hpu
Updated 6 months ago by Logan Adams
loadams/fix-lightning-pytorch2
Updated 1 year ago by Logan Adams
loadams/fix-nccl-comm-torch-check
Updated 1 year ago by Logan Adams
loadams/fix-nv-inference
Updated 11 months ago by GitHub
loadams/fix-nv-inference-hang
Updated 8 months ago by Logan Adams
loadams/fix-nv-torch-latest-v100
Updated 3 months ago by Logan Adams
loadams/fix-onebit-skip
Updated 1 year ago by Logan Adams
loadams/fix-torch-2
Updated 1 year ago by Logan Adams
loadams/fix-torch-compiler-hasattr
Updated 8 months ago by Logan Adams
loadams/get-amd-team-ci
Updated 1 year ago by Logan Adams
loadams/gh-cpu-inf
Updated 1 year ago by Logan Adams
loadams/gh-release-version-update
Updated 1 year ago by GitHub
loadams/hf-transformers-ci-fix
Updated 1 year ago by Logan Adams
loadams/hpu-uts
Updated 7 months ago by GitHub
loadams/ignore-unused-params-default
Updated 9 months ago by Logan Adams
loadams/libaio
Updated 1 year ago by GitHub
loadams/low-cpu-mem-ut
Updated 1 year ago by GitHub
loadams/lsb-release
Updated 1 year ago by Logan Adams
loadams/megatron
Updated 1 year ago by Logan Adams
loadams/megatron-lm-112
Updated 1 year ago by Logan Adams
loadams/megatron-new-pypi
Updated 1 year ago by GitHub
loadams/megatron-version
Updated 1 year ago by Logan Adams
loadams/more-torch-2-support
Updated 1 year ago by Logan Adams
loadams/nv-inf-jobs-test
Updated 8 months ago by Logan Adams
loadams/nv-inf-test
Updated 9 months ago by Logan Adams
loadams/nv-inference-revert
Updated 11 months ago by Logan Adams
loadams/nv-nightly
Updated 1 year ago by Logan Adams
loadams/nv-nightly-fix-transformers
Updated 6 months ago by Logan Adams
loadams/nv-sd-badge
Updated 10 months ago by Logan Adams
loadams/openmpi-eth0
Updated 1 year ago by Logan Adams
loadams/pin-torch-latest-ver
Updated 6 months ago by Logan Adams
loadams/py36
Updated 6 months ago by Logan Adams
loadams/pynvml
Updated 5 months ago by GitHub
loadams/recurse-flops-profiler
Updated 11 months ago by GitHub
loadams/reenable-cpu-inference
Updated 11 months ago by Logan Adams
loadams/remove-dead-code
Updated 6 months ago by GitHub
loadams/remove-modeling
Updated 1 year ago by Logan Adams
loadams/remove-python-36-check
Updated 3 months ago by Logan Adams
loadams/rename-fp-quantize-cu
Updated 4 months ago by Logan Adams
loadams/rename-nv-torch-latest-cpu-workflow
Updated 7 months ago by Logan Adams
loadams/revert-4660
Updated 10 months ago by Logan Adams
loadams/revert-5608
Updated 3 months ago by Logan Adams
loadams/revert-cpu-inf
Updated 1 year ago by Logan Adams
loadams/revert-loss
Updated 10 months ago by Logan Adams
loadams/revert-nv-inference-changes
Updated 1 year ago by GitHub
loadams/revert-pr-5608
Updated 3 months ago by Logan Adams
loadams/revert-userwarning
Updated 8 months ago by Logan Adams
loadams/rocm-fixes
Updated 1 year ago by GitHub
loadams/rocm57
Updated 9 months ago by Logan Adams
loadams/rocm6
Updated 9 months ago by GitHub
loadams/sd-paths
Updated 9 months ago by GitHub
loadams/setup-h100-triggers
Updated 7 months ago by GitHub
loadams/sigterm
Updated 1 year ago by GitHub
loadams/skip-nv-inference
Updated 11 months ago by Logan Adams
loadams/sparse-attn-fix
Updated 1 year ago by GitHub
loadams/sparse-attn-torch-2
Updated 1 year ago by Logan Adams
loadams/stablediffusion-test-triton2
Updated 1 year ago by GitHub
loadams/switch-modeling-compression
Updated 1 year ago by Logan Adams
loadams/switch-python-versions
Updated 3 months ago by GitHub
loadams/tar-vuln
Updated 1 year ago by Logan Adams
loadams/test-b421e8c8f31af254b63ad6e9839f617ab6d9c060
Updated 3 months ago by GitHub
loadams/test-compile
Updated 6 months ago by Logan Adams
loadams/test-cpu
Updated 8 months ago by Logan Adams
loadams/test-cpu-inf-fix
Updated 8 months ago by Logan Adams
loadams/test-f0e3f01d7c7a3d8748212e61eaf487fab41168a7
Updated 3 months ago by Logan Adams
loadams/test-fix-nv-inference
Updated 8 months ago by GitHub
loadams/test-glibc228
Updated 3 months ago by Logan Adams
loadams/test-merged-changes
Updated 6 months ago by Logan Adams
loadams/test-model-task
Updated 1 year ago by Logan Adams
loadams/test-nv-ds-chat-failure-mode
Updated 1 year ago by Logan Adams
loadams/test-nv-latest-cpu
Updated 8 months ago by Logan Adams
loadams/test-pytest-ordering
Updated 8 months ago by Logan Adams
loadams/test-runsc
Updated 10 months ago by Logan Adams
loadams/test-torch-2.3.0
Updated 6 months ago by Logan Adams
loadams/torch-cpu-mismatch-cudaopbuilder
Updated 11 months ago by GitHub
loadams/torch-nightly-debug
Updated 1 year ago by GitHub
loadams/transformers-torch
Updated 1 year ago by Logan Adams
loadams/transformers-torch-update
Updated 9 months ago by GitHub
loadams/transformers-workflow-dispatch
Updated 10 months ago by Logan Adams
loadams/try-bump-pydantic
Updated 1 year ago by GitHub
loadams/unpin-nv-torch-latest
Updated 5 months ago by GitHub
loadams/unpin-transformers
Updated 1 year ago by GitHub
loadams/update-2004-checkout-actions
Updated 6 months ago by Logan Adams
loadams/update-accelerate
Updated 10 months ago by Logan Adams
loadams/update-amd-required-paths
Updated 6 months ago by GitHub
loadams/update-conda-pydantic
Updated 1 year ago by Logan Adams
loadams/update-container-a6000
Updated 6 months ago by GitHub
loadams/update-docker
Updated 10 months ago by Logan Adams
loadams/update-dockerfile
Updated 1 year ago by Logan Adams
loadams/update-hpu-docker-container
Updated 5 months ago by Logan Adams
loadams/update-hpu-docker-image
Updated 4 months ago by Logan Adams
loadams/update-nodejs-reate-pr-action
Updated 6 months ago by Logan Adams
loadams/update-nv-accelerate
Updated 8 months ago by GitHub
loadams/update-nv-inference-torch-ver
Updated 8 months ago by GitHub
loadams/update-nv-torch-latest-cpu-torch-ver
Updated 8 months ago by Logan Adams
loadams/update-nv-torch-latest-cpu-version
Updated 8 months ago by Logan Adams
loadams/update-pydantic
Updated 1 year ago by Logan Adams
loadams/update-pytest
Updated 6 months ago by GitHub
loadams/update-pytest-error-codes
Updated 8 months ago by Logan Adams
loadams/update-real-latest
Updated 8 months ago by Logan Adams
loadams/update-sd-triton
Updated 1 year ago by Logan Adams
loadams/update-torch-113
Updated 5 months ago by GitHub
loadams/update-transformers
Updated 10 months ago by GitHub
loadams/update-transformers-cu116
Updated 1 year ago by Logan Adams
loadams/update-version-txt-post-release
Updated 8 months ago by Logan Adams
loadams/update-website-sidebar
Updated 5 months ago by Logan Adams
loadams/x86-accelerator
Updated 8 months ago by Michael Wyatt
loadams/xpu-readme
Updated 6 months ago by Logan Adams
loadams/xpu-test
Updated 6 months ago by Logan Adams
loadams/xpu-yml
Updated 6 months ago by Logan Adams
lokoppak/ln_schedule_update
Updated 1 year ago by GitHub
lokoppak/low_cpu_mem_usage_ut
Updated 1 year ago by Logan Adams
lokoppak/new_pt_binding
Updated 2 years ago by Lok Chand Koppaka
lokoppak/quantization_3d
Updated 1 year ago by GitHub
lokoppak/ref_ln
Updated 1 year ago by Lok Chand Koppaka
lsh
Updated 4 years ago by Elton Zheng
master-test
Updated 2 years ago by GitHub
megatron2.4-3d
Updated 3 years ago by Jeff Rasley
minjiaz/ds-seq-tutorial
Updated 1 year ago by Ammar Ahmad Awan
minjiaz/moe-comm
Updated 1 year ago by Minjia Zhang
minjiaz/moe-sharing
Updated 2 years ago by GitHub
moe-full-tp
Updated 2 years ago by GitHub
moe-inference-tutorial
Updated 2 years ago by GitHub
moe-inference-tutorial1
Updated 2 years ago by Jeff Rasley
moe-inference/add-tutorial
Updated 2 years ago by Jeff Rasley
moe-pipelining
Updated 2 years ago by GitHub
moe-timing
Updated 2 years ago by Siddharth Singh
mosm/autotp-he
Updated 1 year ago by Molly Smith
mosm/autotp_llama
Updated 1 year ago by Molly Smith
mosm/bloom_dev
Updated 1 year ago by Molly Smith
mosm/codegen
Updated 1 year ago by Molly Smith
mosm/debug-ds-attn
Updated 1 year ago by Ammar Ahmad Awan
mosm/debugger
Updated 1 year ago by GitHub
mosm/dschat-news
Updated 1 year ago by Molly Smith
mosm/inf-refactor
Updated 1 year ago by Molly Smith
mosm/llama2
Updated 1 year ago by GitHub
mosm/matmul_test
Updated 1 year ago by Molly Smith
mosm/module_parser
Updated 1 year ago by molly-smith
mosm/mp_tutorial
Updated 1 year ago by molly-smith
mosm/opt-kernel
Updated 1 year ago by Molly Smith
mosm/softmax
Updated 1 year ago by GitHub
mosm/softmax-longseq
Updated 1 year ago by Molly Smith
mosm/t5
Updated 1 year ago by Molly Smith
mosm/test
Updated 2 years ago by GitHub
mosm/tp_dev
Updated 1 year ago by molly-smith
mosm/wb-param
Updated 1 year ago by Molly Smith
mrwyattii/expand-fp16-tests
Updated 1 year ago by Michael Wyatt
mrwyattii/fix-for-mii-UT
Updated 11 months ago by Michael Wyatt
mrwyattii/fix-inference-skipped-tests
Updated 1 year ago by Michael Wyatt
mrwyattii/fix-launcher-user-args
Updated 1 year ago by Michael Wyatt
mrwyattii/fix-multi-node-checks
Updated 1 year ago by Michael Wyatt
mrwyattii/pin-datasets
Updated 1 year ago by Michael Wyatt
mrwyattii/remove-symlinks
Updated 1 year ago by Michael Wyatt
mrwyattii/rename-cpu-accelerator
Updated 8 months ago by Logan Adams
mrwyattii/safetensor
Updated 11 months ago by Michael Wyatt
mrwyattii/silence-backend-warning
Updated 1 year ago by Michael Wyatt
mrwyattii/update-GH-permission
Updated 9 months ago by GitHub
mrwyattii/update-MII-tests-infV2
Updated 11 months ago by GitHub
multi-z3-prs
Updated 3 years ago by Jeff Rasley
multi-z3-prs-r2
Updated 3 years ago by Jeff Rasley
mz/llama-support
Updated 1 year ago by Michael Wyatt
neox-q-int8
Updated 1 year ago by GitHub
niumanar/gan_optimizer
Updated 4 years ago by Niranjan Uma Naresh
offloadpp-news
Updated 11 months ago by GitHub
olruwase/accelerator_abstraction
Updated 2 years ago by GitHub
olruwase/adam_types
Updated 3 years ago by Olatunji Ruwase
olruwase/align_rrg_rs_param_order
Updated 3 years ago by GitHub
olruwase/all_gather_profiling
Updated 2 years ago by Tunji Ruwase
olruwase/amd_configurable_pp_rtol
Updated 3 years ago by Olatunji Ruwase
olruwase/assert_unused_parameters
Updated 3 years ago by Tunji Ruwase
olruwase/b16-debugging
Updated 2 years ago by Olatunji Ruwase
olruwase/bf16-updates-2
Updated 2 years ago by Olatunji Ruwase
olruwase/bf16_tied_weights_reduce
Updated 2 years ago by Olatunji Ruwase
olruwase/bf16_update_hp_params
Updated 2 years ago by Olatunji Ruwase
olruwase/bloom-support
Updated 2 years ago by Tunji Ruwase
olruwase/bloom_176b_checkpoint_bc
Updated 2 years ago by GitHub
olruwase/ci_pytorch_1x
Updated 1 year ago by Olatunji Ruwase
olruwase/disable_prefetch_profiler
Updated 1 year ago by GitHub
olruwase/disable_z3_prefetcher
Updated 1 year ago by Tunji Ruwase
olruwase/ds_2449
Updated 1 year ago by Tunji Ruwase
olruwase/ds_2921
Updated 1 year ago by Tunji Ruwase
olruwase/ds_3481
Updated 1 year ago by Tunji Ruwase
olruwase/ds_3680_2
Updated 1 year ago by Tunji Ruwase
olruwase/ds_3948
Updated 1 year ago by GitHub
olruwase/dynamic_graph_activation_checkpoint
Updated 3 years ago by Olatunji Ruwase
olruwase/elastic-ckpt-refresh
Updated 2 years ago by GitHub
olruwase/engine_destroy
Updated 2 years ago by Olatunji Ruwase
olruwase/fix_kernel_memory_bloat
Updated 3 years ago by Tunji Ruwase
olruwase/frozen_weights_unit_test
Updated 1 year ago by Tunji Ruwase
olruwase/fs-zero3_trace_fix
Updated 2 years ago by Olatunji Ruwase
olruwase/fs_z3_trace_error_disable
Updated 2 years ago by Olatunji Ruwase
olruwase/fs_z3_trace_log
Updated 2 years ago by Olatunji Ruwase
olruwase/fuse_torch_adam_w
Updated 1 year ago by GitHub
olruwase/gpt3-finetuning
Updated 3 years ago by Tunji Ruwase
olruwase/grad_accum_loss
Updated 3 years ago by Tunji Ruwase
olruwase/issue_3062
Updated 1 year ago by Olatunji Ruwase
olruwase/llama2_empty_group
Updated 1 year ago by GitHub
olruwase/local_storage_checkpoint
Updated 2 years ago by Olatunji Ruwase
olruwase/lr_warmup_decay
Updated 3 years ago by Olatunji Ruwase
olruwase/non_tensor_activation_checkpoint
Updated 3 years ago by Olatunji Ruwase
olruwase/nvme_finetune
Updated 2 years ago by GitHub
olruwase/nvme_offload_bug
Updated 3 years ago by GitHub
olruwase/nvme_perf_sweep
Updated 3 years ago by GitHub
olruwase/nvme_testsuite
Updated 3 years ago by Tunji Ruwase
olruwase/override_module_apply
Updated 1 year ago by Tunji Ruwase
olruwase/refactor_universal_checkpoint
Updated 2 years ago by GitHub
olruwase/restore_from_bit16_weights
Updated 2 years ago by GitHub
olruwase/round_robin_gradient_option
Updated 3 years ago by Olatunji Ruwase
olruwase/save_checkpoint_latest_false
Updated 3 years ago by Tunji Ruwase
olruwase/save_zero3_fp16_weights
Updated 3 years ago by Tunji Ruwase
olruwase/setup_env_libaio
Updated 1 year ago by GitHub
olruwase/trainable_parameters
Updated 3 years ago by Tunji Ruwase
olruwase/z3_perf_tune
Updated 2 years ago by Olatunji Ruwase
olruwase/z3_suppress_warning
Updated 2 years ago by Olatunji Ruwase
olruwase/zcode_model_expert
Updated 3 years ago by Tunji Ruwase
olruwase/zero1_non_tensor_checkpoint
Updated 3 years ago by Tunji Ruwase
olruwase/zero2_grad_accum_bug
Updated 3 years ago by Tunji Ruwase
olruwase/zero2_offload_keyerror
Updated 3 years ago by Tunji Ruwase
olruwase/zero2_offload_rrb_divergence
Updated 3 years ago by GitHub
olruwase/zero2_offload_slowdown
Updated 3 years ago by Tunji Ruwase
olruwase/zero2_trainable_parameters
Updated 2 years ago by GitHub
olruwase/zero2_trainable_parameters_v0.5.7
Updated 2 years ago by Tunji Ruwase
olruwase/zero2_unbalanced_grad_reduction
Updated 3 years ago by Tunji Ruwase
olruwase/zero3_amp_autocast
Updated 3 years ago by Olatunji Ruwase
olruwase/zero3_broken_tracing
Updated 3 years ago by GitHub
olruwase/zero3_dp_norm_allreduce
Updated 3 years ago by GitHub
olruwase/zero3_profile_fetch
Updated 2 years ago by Olatunji Ruwase
olruwase/zero3_unboundlocal_bug
Updated 3 years ago by Olatunji Ruwase
olruwase/zero_inference_tokgen
Updated 2 years ago by Olatunji Ruwase
olruwase/zero_inference_torch_version
Updated 1 year ago by Tunji Ruwase
olruwase/zero_offload_e2e
Updated 4 years ago by Tunji Ruwase
olruwase/zero_offload_fix_corner_case
Updated 4 years ago by Tunji Ruwase
olruwase/zero_offload_v3
Updated 4 years ago by Tunji Ruwase
olruwase/zero_optional_reduce_scatter
Updated 3 years ago by GitHub
olruwase/zero_stage1_checkpoint_layout
Updated 3 years ago by Tunji Ruwase
olruwase/zero_stage1_elastic_checkpoint
Updated 3 years ago by Olatunji Ruwase
olruwase/zinf_none_swapper
Updated 2 years ago by GitHub
paper
Updated 1 year ago by GitHub
patch-z1-cont-grad
Updated 2 years ago by GitHub
pr_moe_tutorial
Updated 2 years ago by GitHub
preserve-CVDs
Updated 2 years ago by Jeff Rasley
profiler-add-shape
Updated 2 years ago by Cheng Li
qanthony/bigbird
Updated 2 years ago by GitHub
qanthony/comms-bench
Updated 2 years ago by Quentin Anthony
qanthony/nccl-backend
Updated 2 years ago by Quentin Anthony
quantization-refresh
Updated 1 year ago by GitHub
quantize-inference
Updated 1 year ago by Reza Yazdani
refine-quantizer
Updated 2 years ago by Reza Yazdani
remotes/origin/dev/tput
Updated 1 year ago by Shijie Zhou
remove-tbx
Updated 2 years ago by Jeff Rasley
remove-unused-quantize-settings
Updated 1 year ago by GitHub
reyazda/adam-scalar-fix
Updated 3 years ago by GitHub
reyazda/cpu_adam_jit_v2
Updated 4 years ago by Jeff Rasley
reyazda/fix-inference-api
Updated 3 years ago by GitHub
reyazda/pytorch-workspace-allocate
Updated 3 years ago by GitHub
reyazda/remove_bertid
Updated 4 years ago by Reza Yazdani
reyazda/support_AVX2_by_default
Updated 4 years ago by Reza Yazdani
reyazda/test-hidden-dimension
Updated 3 years ago by Reza Yazdani
reyazda/test-sparse
Updated 3 years ago by Jeff Rasley
reyazda/test-sparse-v2
Updated 3 years ago by Jeff Rasley
reyazda/test-transformer
Updated 3 years ago by Reza Yazdani
reyazda/testing_embedding
Updated 3 years ago by Reza Yazdani
reyazda/triton-new-sparse
Updated 3 years ago by Reza Yazdani
reza/deepspeed_adam_merge_v3
Updated 4 years ago by Reza Yazdani
reza/fix-adam-copyfp16
Updated 4 years ago by Reza Yazdani
reza/fix_adam_corner_case
Updated 4 years ago by Reza Yazdani
reza/fix_adam_perf
Updated 4 years ago by Reza Yazdani
reza/megatron_kernel_integration
Updated 4 years ago by Reza Yazdani
saksham-zero1-fixes
Updated 3 years ago by GitHub
samyam-overlap-comm
Updated 4 years ago by GitHub
samyamr/elasticity
Updated 3 years ago by Samyam Rajbhandari
samyamr/fix-for-fragmented-linear-inputs
Updated 3 years ago by GitHub
samyamr/gpt3-finetuning
Updated 3 years ago by Samyam Rajbhandari
samyamr/gpt3-finetuning-mixed-precision
Updated 3 years ago by Samyam Rajbhandari
samyamr/stage3-alignment-fix
Updated 3 years ago by GitHub
samyamr/zero-2-debug
Updated 3 years ago by GitHub
security-patch
Updated 3 years ago by Jeff Rasley
shaden/textgen
Updated 2 years ago by Shaden Smith
smartreply_hotfix
Updated 4 years ago by Jeff Rasley
sp/comm-opt
Updated 10 months ago by Reza Yazdani
sparse-attn-cuda11
Updated 3 years ago by GitHub
sparse-attn/support-latest-triton
Updated 3 years ago by GitHub
staging-amd
Updated 3 years ago by Jeff Rasley
staging-amd-port
Updated 2 years ago by Jeff Rasley
staging-amd-v2
Updated 3 years ago by Jeff Rasley
staging-amd-v3
Updated 3 years ago by Jeff Rasley
staging-comms-next-v2
Updated 2 years ago by GitHub
staging-comms-v1
Updated 2 years ago by Quentin Anthony
staging-demo-feature-v0
Updated 1 year ago by GitHub
staging-ds-chat-blog-v1
Updated 1 year ago by Ammar Ahmad Awan
staging-ds-seq-v1
Updated 1 year ago by GitHub
staging-inference-v2-5
Updated 11 months ago by GitHub
staging-mii-update
Updated 2 years ago by Jeff Rasley
staging-moe-next-v1
Updated 2 years ago by Jeff Rasley
staging-oaas
Updated 2 years ago by Elton Zheng
staging-pld-v1
Updated 4 years ago by Tunji Ruwase
staging-pp
Updated 2 years ago by Du Li
staging-test
Updated 2 years ago by GitHub
staging-zero-dual-v2
Updated 4 years ago by GitHub
staging-zero-dual-v3
Updated 4 years ago by GitHub
staging-zero-dual-v5
Updated 4 years ago by GitHub
staging-zero-inference-v1
Updated 1 year ago by GitHub
stale-issues
Updated 3 years ago by GitHub
styoun/triton-flash2
Updated 1 year ago by GitHub
styoun/triton2.1
Updated 1 year ago by GitHub
styoun/triton2.1-autotune
Updated 1 year ago by GitHub
styoun/zero-inf-8bit-q
Updated 1 year ago by GitHub
subprocess-test
Updated 2 years ago by Jeff Rasley
test-ac
Updated 4 years ago by Jeff Rasley
test-cuda-11.7
Updated 1 year ago by Reza Yazdani
tmp
Updated 3 years ago by GitHub
tmp-old
Updated 1 year ago by GitHub
tohtana/add_slides_meetup_japan
Updated 4 months ago by GitHub
tohtana/bcast_warning_z3
Updated 8 months ago by Masahiro Tanaka
tohtana/cache_kv_requirements
Updated 9 months ago by Masahiro Tanaka
tohtana/compile-zero
Updated 8 months ago by GitHub
tohtana/compile_no_grad
Updated 6 months ago by Masahiro Tanaka
tohtana/debug_compile_backends
Updated 7 months ago by Masahiro Tanaka
tohtana/fix-save-checkpoint-step
Updated 1 year ago by Masahiro Tanaka
tohtana/fix_bf16_opt_update_hp
Updated 8 months ago by Masahiro Tanaka
tohtana/fix_chkpt_alignment
Updated 8 months ago by Masahiro Tanaka
tohtana/fix_sort_dp_univ_ckpt
Updated 6 months ago by GitHub
tohtana/fix_univ_chkpt_load
Updated 7 months ago by Masahiro Tanaka
tohtana/model_declaration_in_init_context
Updated 1 year ago by GitHub
tohtana/pipeline_with_compiled_module
Updated 7 months ago by Masahiro Tanaka
tohtana/remove_step_on_init
Updated 8 months ago by GitHub
tohtana/univ_ckpt_custom_shape
Updated 6 months ago by GitHub
tohtana/z3_multi_dtypes
Updated 11 months ago by GitHub
token-drop
Updated 3 years ago by Ammar Ahmad Awan
transformer-injection
Updated 2 years ago by GitHub
transformer-kernel/support-arbitrary-hidden
Updated 3 years ago by Reza Yazdani
transformer/fix-layer-norm
Updated 3 years ago by Reza Yazdani
transformer/injection
Updated 3 years ago by Reza Yazdani
transformer/large-seq-support
Updated 3 years ago by Reza Yazdani
transformer/triangular-mask
Updated 3 years ago by Reza Yazdani
triton-fix
Updated 1 year ago by Jeff Rasley
ucp_blog
Updated 3 months ago by Sam Ade Jacobs
umchand/test_compiler
Updated 4 months ago by Umesh Chand
umchand/triton/bias_act
Updated 8 months ago by Umesh Chand
unify-benchmark-knowledge
Updated 2 years ago by Michael Wyatt
update-flops-profiler-doc
Updated 1 year ago by GitHub
update-flops-profiler-pool-compute
Updated 2 years ago by Cheng Li
workaround-zero3
Updated 2 years ago by GitHub
z1-offload-multigpu
Updated 2 years ago by Jeff Rasley
z3-mem-leak
Updated 1 year ago by Jeff Rasley
zero-ckpt-cpu-issue-v2
Updated 2 years ago by Jeff Rasley
zhenyzhang-data
Updated 1 year ago by GitHub
zheweiyao/quantize_update
Updated 3 years ago by GitHub