Sven Mika
|
a931076f59
[RLlib] Tf2 + eager-tracing same speed as framework=tf; Add more test coverage for tf2+tracing. (#19981)
|
3 years ago |
Sven Mika
|
0b308719f8
[RLlib; Docs overhaul] Docstring cleanup: rllib/utils (#19829)
|
3 years ago |
Sven Mika
|
b213565783
[RLlib] Fix failing test cases: Soft-deprecate ModelV2.from_batch (in favor of ModelV2.__call__). (#19693)
|
3 years ago |
gjoliver
|
c3c42278e4
[RLlib] clean up all the SampleBatch['is_training'] deprecation warnings (#19652)
|
3 years ago |
Sven Mika
|
61a1274619
[RLlib] No Preprocessors (part 2). (#18468)
|
3 years ago |
Sven Mika
|
8a066474d4
[RLlib] No Preprocessors; preparatory PR #1 (#18367)
|
3 years ago |
Sven Mika
|
494ddd98c1
[RLlib] Replace "seq_lens" w/ SampleBatch.SEQ_LENS. (#17928)
|
3 years ago |
Sven Mika
|
7eb1a29426
[RLlib] Fix ModelV2 custom metrics for torch. (#16734)
|
3 years ago |
Sven Mika
|
e973b726c2
[RLlib] Support native tf.keras.Models (part 2) - Default keras models for Vision/RNN/Attention. (#15273)
|
3 years ago |
Sven Mika
|
bb8a286cbc
[RLlib] Support native tf.keras.Model (milestone toward obsoleting ModelV2 class). (#14684)
|
3 years ago |
Sven Mika
|
cecfc3b43b
[RLlib] Multi-GPU support for Torch algorithms. (#14709)
|
3 years ago |
Sven Mika
|
69202c6a7d
[RLlib] Obsolete usage tracking dict via sample batch. (#13065)
|
3 years ago |
Sven Mika
|
ee4b6e7e3b
[RLlib] Unity3D example broken due to change in ML-Agents API. Attention-net prev-n-a/r. Attention-wrapper works with images. (#14569)
|
3 years ago |
Sven Mika
|
95ef04b71a
[RLlib] Implement `TorchPolicy.export_model`. (#13989)
|
3 years ago |
Sven Mika
|
52c94b7ee9
[RLlib] Allow SAC to use custom models as Q- or policy nets and deprecate "state-preprocessor" for image spaces. (#13522)
|
3 years ago |
Sven Mika
|
6f342a2221
[RLlib] Preparatory PR for: Documentation on Model Building. (#13260)
|
3 years ago |
Sven Mika
|
391cdfae8c
[RLlib] Trajectory view API docs. (#12718)
|
3 years ago |
Sven Mika
|
a5318961de
[RLlib] Preprocessor fixes (multi-discrete) and tests. (#13083)
|
3 years ago |
Sven Mika
|
99ae7bae05
[RLlib] JAXPolicy prep. PR #1. (#13077)
|
3 years ago |
Sven Mika
|
b2bcab711d
[RLlib] Attention Nets: tf (#12753)
|
3 years ago |
Sven Mika
|
99c81c6795
[RLlib] Attention Net prep PR #3. (#12450)
|
3 years ago |
Sven Mika
|
3ad9365e1d
[RLlib] Attention Net prep PR #2: Smaller cleanups. (#12449)
|
3 years ago |
Sven Mika
|
62c7ab5182
[RLlib] Trajectory view API: Enable by default for PPO, IMPALA, PG, A3C (tf and torch). (#11747)
|
4 years ago |
Sven Mika
|
d9f1874e34
[RLlib] Minor fixes (torch GPU bugs + some cleanup). (#11609)
|
4 years ago |
Sven Mika
|
36bda8432b
[RLlib] Trajectory view API: Simple List Collector (on by default for PPO); LSTM-agnostic (#11056)
|
4 years ago |
Sven Mika
|
805dad3bc4
[RLlib] SAC algo cleanup. (#10825)
|
4 years ago |
Sven Mika
|
e968b52cb7
[RLlib] Trajectory view API - 03 Fast LSTM + prev actions/rewards (#9950)
|
4 years ago |
Sven Mika
|
2256047876
[RLlib] Rename rllib.utils.types into typing to match built-in python module's name. (#10114)
|
4 years ago |
Sven Mika
|
57690a3a9f
[RLlib] Trajectory view API - 02 actual API scaffold (#9753)
|
4 years ago |
Eric Liang
|
590943a499
[rllib] Type annotations for model classes (#9646)
|
4 years ago |