On-Policy Deep Reinforcement Learning for the Average-Reward Criterion | NobleID

Main

Mint Work Search Explore Works Knowledge Graph Citation Graph Bibliometrics Receipts

v1

On-Policy Deep Reinforcement Learning for the Average-Reward Criterion

Identifier:nobleid.org/w1/20260515/404AFFC2

Type:Preprint

0 views

Support unavailableClaim Your Authorship

View Original Paper Source

Embeddable Badge

[![NobleID](https://www.nobleid.org/api/badge/404AFFC2.svg?ark=w1%2F20260515%2F404AFFC2)](https://nobleid.org/work/w1/20260515/404AFFC2)

ARK Inflections

Metadata Policy

Metadata Formats

Bibliometric Analysis

Impact metrics, research fronts, co-authorship networks →

Authors & Claims

Paper Authors

Claim authorship of this work

info@nobleid.org

About FAQ Terms Policy Persistence & Resolver Service Policy Pricing