Sven Mika
|
199dc8cff0
Revert "Revert "[RLlib] @deprecate(error=True|False) escalation."" (#28807)
|
2 years ago |
Amog Kamsetty
|
e501654925
Revert "[RLlib] @deprecate(error=True|False) escalation. (#28733)" (#28795)
|
2 years ago |
Sven Mika
|
c4348c1889
[RLlib] @deprecate(error=True|False) escalation. (#28733)
|
2 years ago |
Sven Mika
|
7cca7782f1
[RLlib] OPE (off policy estimator) API. (#24384)
|
2 years ago |
Balaji Veeramani
|
7f1bacc7dc
[CI] Format Python code with Black (#21975)
|
2 years ago |
Sven Mika
|
ea2bea7e30
[RLlib; Docs overhaul] Docstring cleanup: Offline. (#19808)
|
3 years ago |
Sven Mika
|
c4a3e1589b
[RLlib] CQL: Bug fixes and OPE example added to test and offline_rl.py example. (#15761)
|
3 years ago |
Felipe Antunes
|
4c0f0ce3a9
[RLlib] In OffPolicyEstimators (Offline RL): Include last step of trajectory (#12619)
|
3 years ago |
Sven Mika
|
2256047876
[RLlib] Rename rllib.utils.types into typing to match built-in python module's name. (#10114)
|
4 years ago |
Michael Luo
|
b51ab2af66
[RLlib] Offline Type Annotations (#9676)
|
4 years ago |
Sven Mika
|
d537e9f0d8
[RLlib] Exploration API: merge deterministic flag with exploration classes (SoftQ and StochasticSampling). (#7155)
|
4 years ago |
Sven
|
60d4d5e1aa
Remove future imports (#6724)
|
4 years ago |
Eric Liang
|
5d7afe8092
[rllib] Try moving RLlib to top level dir (#5324)
|
5 years ago |