Alexander Borzunov
|
c6e1b5a8e5
Add various server timeouts, lower --max_batch_size and --inference_max_length defaults (#97)
|
1 year ago |
Alexander Borzunov
|
d8ef09146e
Improve server's logging (#96)
|
1 year ago |
Alexander Borzunov
|
fef7257fe0
Try to fix protobuf versions once again (#95)
|
1 year ago |
Aleksandr Borzunov
|
1b51703444
Revert protobuf version change
|
1 year ago |
Alexander Borzunov
|
b26b0b7121
Require hivemind with fixed compression and protobuf working on Colab (#94)
|
1 year ago |
Alexander Borzunov
|
11d6ba683c
Make inference, forward, and backward fully fault-tolerant (#91)
|
1 year ago |
Alexander Borzunov
|
57e8d2e721
Implement exponential backoff for forward & backward (#85)
|
2 years ago |
Alexander Borzunov
|
f64eb3a665
Update hivemind to 1.1.2, mark `model` argument as required (#81)
|
2 years ago |
justheuristic
|
fef48d7d99
Use bitsandbytes==0.34.0, update readme (#76)
|
2 years ago |
justheuristic
|
8caf1145a8
Quality of life changes: update readme, simplify run_server interface (#75)
|
2 years ago |
justheuristic
|
3fdcc55a56
fix protobuf version (#74)
|
2 years ago |
justheuristic
|
e92487e5d2
Update dependency versions (#71)
|
2 years ago |
justheuristic
|
d271b75dd4
Let users specify sequence length instead of assuming 2048 (#52)
|
2 years ago |
Dmitry Baranchuk
|
11a424837f
integrate mixed-8bit model (#39)
|
2 years ago |
Dmitry Baranchuk
|
04a2b6f5e3
Support various backend dtypes & async serialization (#38)
|
2 years ago |
justheuristic
|
e2711a033b
Add automated tests (#23)
|
2 years ago |
justheuristic
|
99059ae667
install script
|
2 years ago |
justheuristic
|
b370b43110
freeze hivemind and transformers versions
|
2 years ago |