Commit History

Author SHA1 Message Date
  justheuristic 1a85ba99ee black-isort 2 years ago
  justheuristic 0e5e93af7c Update convert_8bit.py 2 years ago
  justheuristic 8f34b92b68 Update server.py 2 years ago
  justheuristic 95551d78b4 Update requirements.txt 2 years ago
  justheuristic c5c747fc8c bnb version 2 years ago
  justheuristic 1a765cfff9 Update README.md 2 years ago
  justheuristic 8caf1145a8 Quality of life changes: update readme, simplify run_server interface (#75) 2 years ago
  Artem Chumachenko 1046911dea Add prompt tuning example on Personachat dataset (#69) 2 years ago
  justheuristic 3fdcc55a56 fix protobuf version (#74) 2 years ago
  justheuristic e92487e5d2 Update dependency versions (#71) 2 years ago
  Pavel Samygin 50535a8435 Priority tasks (#47) 2 years ago
  justheuristic 892d18fea7 Build cpuonly from bitsandbytes main (#70) 2 years ago
  justheuristic f3984b192a Make attention cache wait until memory is freed (#53) 2 years ago
  justheuristic 8a0c056929 Fix calling rpc_info multiple times (#60) 2 years ago
  Artem Chumachenko ada98a1b37 Add deep prompt inference (#66) 2 years ago
  Alexander Borzunov 54ad745bed Warn that current instructions involve 6B model but we will replace them soon (#63) 2 years ago
  Alexander Borzunov 5f0c5329d4 Update readme with arxiv link and more discussions (#62) 2 years ago
  Alexander Borzunov 9bea7b9ea8 Update bullet points with feedback from Tim and other people (#61) 2 years ago
  Alexander Borzunov 7653562aa1 Use latest version of Petals scheme, shrink Petals logo (#59) 2 years ago
  Alexander Borzunov 2eb5843852 Update readme for the 1st public release (#57) 2 years ago
  Pavel Samygin 0be21775af remove transformer block, implement as sequential of size 1 (#54) 2 years ago
  Artem Chumachenko 77220c718c Add shallow prefix-tuned inference (#55) 2 years ago
  justheuristic d271b75dd4 Let users specify sequence length instead of assuming 2048 (#52) 2 years ago
  Dmitry Baranchuk 948877149c Fix recovering for sequential_backward (#50) 2 years ago
  Dmitry Baranchuk 24ba3433e4 [Fix] make distributed seq cls to not create the full bloom model (#49) 2 years ago
  justheuristic f12d0deee9 [quickfix 1/n] remove expensive assertions in inference code (#48) 2 years ago
  Dmitry Baranchuk 0fd2caa4be Convert actual model weights (#46) 2 years ago
  justheuristic a2634001e9 Reduce vocabulary size in test model, fix bug in routing when overlapped (#45) 2 years ago
  Dmitry Baranchuk 5745882c67 fix rpc_forward_stream 2 years ago
  Dmitry Baranchuk 6095f58681 Deep distributed prompt tuning (#42) 2 years ago