User:Turingsk/Books/RL
Appearance
The Wikimedia Foundation's book rendering service has been withdrawn. Please upload your Wikipedia book to one of the external rendering services. |
You can still create and edit a book design using the Book Creator and upload it to an external rendering service:
|
This user book is a user-generated collection of Wikipedia articles that can be easily saved, rendered electronically, and ordered as a printed book. If you are the creator of this book and need help, see Help:Books (general tips) and WikiProject Wikipedia-Books (questions and assistance). Edit this book: Book Creator · Wikitext Order a printed copy from: PediaPress [ About ] [ Advanced ] [ FAQ ] [ Feedback ] [ Help ] [ WikiProject ] [ Recent Changes ] |
- Bellman equation
- Dopamine
- Hamilton–Jacobi–Bellman equation
- Hidden Markov model
- Leabra
- Marcus Hutter
- Markov decision process
- Multi-armed bandit
- Optimal control
- Partially observable Markov decision process
- Predictive state representation
- PVLV
- Q-learning
- Reinforcement learning
- Rescorla–Wagner model
- Reward system
- SARSA
- Temporal difference learning
- Dynamic treatment regime