码涯-AIGC代码仓库-openoker/ray: 一个针对强化学习和深度学习所设计的大规模分布式计算框架。 @ data-dashboard

simonsays1980 94ed6f99a3 [RLlib] BC RLModule. (#39542)		1 年之前
..
a2c	2dbd5fbeac [RLlib] Add throughput per-second metrics (env/agent steps trained and -sampled) to Algorithm. (#34526)	1 年之前
a3c	1c636e7c30 [RLlib] DreamerV3: Learner API classes for tf-keras, loss functions, additional update method. (#35385)	1 年之前
alpha_star	adfdbbdfa2 [RLlib] APPO+new-stack (Atari benchmark) - Preparatory PR 03 - PyTorch. (#34779)	1 年之前
alpha_zero	2dbd5fbeac [RLlib] Add throughput per-second metrics (env/agent steps trained and -sampled) to Algorithm. (#34526)	1 年之前
apex_ddpg	2dbd5fbeac [RLlib] Add throughput per-second metrics (env/agent steps trained and -sampled) to Algorithm. (#34526)	1 年之前
apex_dqn	2dbd5fbeac [RLlib] Add throughput per-second metrics (env/agent steps trained and -sampled) to Algorithm. (#34526)	1 年之前
appo	960032a15f [RLlib][RLModules] RNNs and RLModules (#32723)	1 年之前
ars	8427de2776 [RLlib] Fix ARS release test (#35608)	1 年之前
bandits	2dbd5fbeac [RLlib] Add throughput per-second metrics (env/agent steps trained and -sampled) to Algorithm. (#34526)	1 年之前
bc	94ed6f99a3 [RLlib] BC RLModule. (#39542)	1 年之前
cql	2dbd5fbeac [RLlib] Add throughput per-second metrics (env/agent steps trained and -sampled) to Algorithm. (#34526)	1 年之前
crr	2dbd5fbeac [RLlib] Add throughput per-second metrics (env/agent steps trained and -sampled) to Algorithm. (#34526)	1 年之前
ddpg	2dbd5fbeac [RLlib] Add throughput per-second metrics (env/agent steps trained and -sampled) to Algorithm. (#34526)	1 年之前
ddppo	2dbd5fbeac [RLlib] Add throughput per-second metrics (env/agent steps trained and -sampled) to Algorithm. (#34526)	1 年之前
dqn	2dbd5fbeac [RLlib] Add throughput per-second metrics (env/agent steps trained and -sampled) to Algorithm. (#34526)	1 年之前
dreamer	b52a81b3de [RLlib] Preparation for gymnasium/gym0.26 upgrade: Deprecate `horizon` and `soft_horizon` settings. (#30583)	1 年之前
dreamerv3	b0045799e3 [RLlib] DreamerV3: Make 200M (XL model) work; mixed float16 option (#38461)	1 年之前
dt	2dbd5fbeac [RLlib] Add throughput per-second metrics (env/agent steps trained and -sampled) to Algorithm. (#34526)	1 年之前
es	2dbd5fbeac [RLlib] Add throughput per-second metrics (env/agent steps trained and -sampled) to Algorithm. (#34526)	1 年之前
impala	78b58a959a [RLlib] Learner API: Policies using RLModules (for sampler only) do not need loss/stats/mixins. (#34445)	1 年之前
leela_chess_zero	f80badcdb0 [RLlib] Remove leela chess from release tests (#32325)	1 年之前
maddpg	2dbd5fbeac [RLlib] Add throughput per-second metrics (env/agent steps trained and -sampled) to Algorithm. (#34526)	1 年之前
maml	b52a81b3de [RLlib] Preparation for gymnasium/gym0.26 upgrade: Deprecate `horizon` and `soft_horizon` settings. (#30583)	1 年之前
marwil	8d2dc9a399 [RLlib] Change default framework from tf to torch (#33604)	1 年之前
mbmpo	2dbd5fbeac [RLlib] Add throughput per-second metrics (env/agent steps trained and -sampled) to Algorithm. (#34526)	1 年之前
pg	2dbd5fbeac [RLlib] Add throughput per-second metrics (env/agent steps trained and -sampled) to Algorithm. (#34526)	1 年之前
ppo	8d80377445 [RLlib] Fixed 'rollout_fragment_length' in pong-example by setting it to 'auto'. (#39552)	1 年之前
qmix	2dbd5fbeac [RLlib] Add throughput per-second metrics (env/agent steps trained and -sampled) to Algorithm. (#34526)	1 年之前
r2d2	2dbd5fbeac [RLlib] Add throughput per-second metrics (env/agent steps trained and -sampled) to Algorithm. (#34526)	1 年之前
sac	2dbd5fbeac [RLlib] Add throughput per-second metrics (env/agent steps trained and -sampled) to Algorithm. (#34526)	1 年之前
simple_q	2dbd5fbeac [RLlib] Add throughput per-second metrics (env/agent steps trained and -sampled) to Algorithm. (#34526)	1 年之前
slateq	2dbd5fbeac [RLlib] Add throughput per-second metrics (env/agent steps trained and -sampled) to Algorithm. (#34526)	1 年之前
td3	2dbd5fbeac [RLlib] Add throughput per-second metrics (env/agent steps trained and -sampled) to Algorithm. (#34526)	1 年之前
__init__.py	7243d56185 [RLlib] python-based training from cli (to make tuned examples more pythonic) (#29459)	2 年之前
cleanup_experiment.py	7f1bacc7dc [CI] Format Python code with Black (#21975)	2 年之前
compact-regression-test.yaml	8e680c483c [RLlib] gymnasium support (new `Env.reset()/step()/seed()/render()` APIs). (#28369)	1 年之前
create_plots.py	baa053496a [RLlib] Benchmark and regression test yaml cleanup and restructuring. (#8414)	4 年之前