.. |
__init__.py
|
05faa0b3c8
add quantization script for cpu
|
2 years ago |
config.json
|
a798ea04a6
add minimalistic benchmarks
|
2 years ago |
convert_model.py
|
a2634001e9
Reduce vocabulary size in test model, fix bug in routing when overlapped (#45)
|
2 years ago |
deploy_server.sh
|
11a424837f
integrate mixed-8bit model (#39)
|
2 years ago |
inference_one_block.py
|
4695071ad2
WIP: make DistributedBloom compliant with HF interface
|
2 years ago |
local_server_config_example.cfg
|
f60a7dd183
deploy swarm on local & remote machines
|
2 years ago |
remote_server_config_example.cfg
|
f60a7dd183
deploy swarm on local & remote machines
|
2 years ago |
run_local_servers.sh
|
11a424837f
integrate mixed-8bit model (#39)
|
2 years ago |
run_remote_servers.sh
|
6573076883
Sequential and parallel forward / backward (#36)
|
2 years ago |
run_server.py
|
11a424837f
integrate mixed-8bit model (#39)
|
2 years ago |
speed_test.py
|
e2711a033b
Add automated tests (#23)
|
2 years ago |