Alexander Borzunov
|
c07a7e0812
Add "Terms of Use"
|
1 年之前 |
Artem Chumachenko
|
0d9c7de0bd
Add sst-2 ipynb example (#86)
|
1 年之前 |
Alexander Borzunov
|
57e8d2e721
Implement exponential backoff for forward & backward (#85)
|
2 年之前 |
Alexander Borzunov
|
ee4e69c254
Enable rebalancing by default (#84)
|
2 年之前 |
Artem Chumachenko
|
2cb82dd648
Add colab-related changes (#80)
|
2 年之前 |
Alexander Borzunov
|
87fd6a4f08
Fix "Too many open files" during rebalancing (#83)
|
2 年之前 |
Alexander Borzunov
|
f64eb3a665
Update hivemind to 1.1.2, mark `model` argument as required (#81)
|
2 年之前 |
Alexander Borzunov
|
149f433763
Rebalance swarm when necessary (#34)
|
2 年之前 |
Alexander Borzunov
|
640bbc38a9
Make even smaller readability changes
|
2 年之前 |
Alexander Borzunov
|
d1b012b479
Make small readability & style changes to the instructions (#77)
|
2 年之前 |
justheuristic
|
fef48d7d99
Use bitsandbytes==0.34.0, update readme (#76)
|
2 年之前 |
justheuristic
|
8caf1145a8
Quality of life changes: update readme, simplify run_server interface (#75)
|
2 年之前 |
Artem Chumachenko
|
1046911dea
Add prompt tuning example on Personachat dataset (#69)
|
2 年之前 |
justheuristic
|
3fdcc55a56
fix protobuf version (#74)
|
2 年之前 |
justheuristic
|
e92487e5d2
Update dependency versions (#71)
|
2 年之前 |
Pavel Samygin
|
50535a8435
Priority tasks (#47)
|
2 年之前 |
justheuristic
|
892d18fea7
Build cpuonly from bitsandbytes main (#70)
|
2 年之前 |
justheuristic
|
f3984b192a
Make attention cache wait until memory is freed (#53)
|
2 年之前 |
justheuristic
|
8a0c056929
Fix calling rpc_info multiple times (#60)
|
2 年之前 |
Artem Chumachenko
|
ada98a1b37
Add deep prompt inference (#66)
|
2 年之前 |
Alexander Borzunov
|
54ad745bed
Warn that current instructions involve 6B model but we will replace them soon (#63)
|
2 年之前 |
Alexander Borzunov
|
5f0c5329d4
Update readme with arxiv link and more discussions (#62)
|
2 年之前 |
Alexander Borzunov
|
9bea7b9ea8
Update bullet points with feedback from Tim and other people (#61)
|
2 年之前 |
Alexander Borzunov
|
7653562aa1
Use latest version of Petals scheme, shrink Petals logo (#59)
|
2 年之前 |
Alexander Borzunov
|
2eb5843852
Update readme for the 1st public release (#57)
|
2 年之前 |
Pavel Samygin
|
0be21775af
remove transformer block, implement as sequential of size 1 (#54)
|
2 年之前 |
Artem Chumachenko
|
77220c718c
Add shallow prefix-tuned inference (#55)
|
2 年之前 |
justheuristic
|
d271b75dd4
Let users specify sequence length instead of assuming 2048 (#52)
|
2 年之前 |
Dmitry Baranchuk
|
948877149c
Fix recovering for sequential_backward (#50)
|
2 年之前 |
Dmitry Baranchuk
|
24ba3433e4
[Fix] make distributed seq cls to not create the full bloom model (#49)
|
2 年之前 |