Deep reinforcement learning (final version) received a peer review by Wikipedia editors, which on 4 January 2021 was archived. It may contain ideas you can use to improve this article.
This article is rated Start-class on Wikipedia's content assessment scale. It is of interest to the following WikiProjects:
This article was reviewed by member(s) of WikiProject Articles for creation. The project works to allow users to contribute quality articles and media files to the encyclopedia and track their progress as they are developed. To participate, please visit the project page for more information.Articles for creationWikipedia:WikiProject Articles for creationTemplate:WikiProject Articles for creationAfC articles
This article is within the scope of WikiProject Computer science, a collaborative effort to improve the coverage of Computer science related articles on Wikipedia. If you would like to participate, please visit the project page, where you can join the discussion and see a list of open tasks.Computer scienceWikipedia:WikiProject Computer scienceTemplate:WikiProject Computer scienceComputer science articles
This article is within the scope of WikiProject Science, a collaborative effort to improve the coverage of Science on Wikipedia. If you would like to participate, please visit the project page, where you can join the discussion and see a list of open tasks.ScienceWikipedia:WikiProject ScienceTemplate:WikiProject Sciencescience articles
This article is within the scope of WikiProject Engineering, a collaborative effort to improve the coverage of engineering on Wikipedia. If you would like to participate, please visit the project page, where you can join the discussion and see a list of open tasks.EngineeringWikipedia:WikiProject EngineeringTemplate:WikiProject EngineeringEngineering articles
The current "training" section is a mixture of a lot of different but very specific topics. It would make more sense to have it be an overview of deep RL algorithms, and then have a separate section on broad research directions that are being investigated: off-policy RL, inverse RL, meta-RL, goal-conditioned RL. Happy to do this myself if there is agreement. Anair13 (talk) 20:36, 24 November 2020 (UTC)[reply]