Commit History

Author SHA1 Message Date
  Alexander Borzunov c6e1b5a8e5 Add various server timeouts, lower --max_batch_size and --inference_max_length defaults (#97) 1 year ago
  Alexander Borzunov d8ef09146e Improve server's logging (#96) 1 year ago
  Alexander Borzunov fef7257fe0 Try to fix protobuf versions once again (#95) 1 year ago
  Aleksandr Borzunov 1b51703444 Revert protobuf version change 1 year ago
  Alexander Borzunov b26b0b7121 Require hivemind with fixed compression and protobuf working on Colab (#94) 1 year ago
  Alexander Borzunov 11d6ba683c Make inference, forward, and backward fully fault-tolerant (#91) 1 year ago
  Alexander Borzunov 57e8d2e721 Implement exponential backoff for forward & backward (#85) 2 years ago
  Alexander Borzunov f64eb3a665 Update hivemind to 1.1.2, mark `model` argument as required (#81) 2 years ago
  justheuristic fef48d7d99 Use bitsandbytes==0.34.0, update readme (#76) 2 years ago
  justheuristic 8caf1145a8 Quality of life changes: update readme, simplify run_server interface (#75) 2 years ago
  justheuristic 3fdcc55a56 fix protobuf version (#74) 2 years ago
  justheuristic e92487e5d2 Update dependency versions (#71) 2 years ago
  justheuristic d271b75dd4 Let users specify sequence length instead of assuming 2048 (#52) 2 years ago
  Dmitry Baranchuk 11a424837f integrate mixed-8bit model (#39) 2 years ago
  Dmitry Baranchuk 04a2b6f5e3 Support various backend dtypes & async serialization (#38) 2 years ago
  justheuristic e2711a033b Add automated tests (#23) 2 years ago
  justheuristic 99059ae667 install script 2 years ago
  justheuristic b370b43110 freeze hivemind and transformers versions 2 years ago