Programs via Reinforcement
Programs via Reinforcement
The learning of cognitive programs faces many technical difficulties, but the most important is the valuation of programs that have delayed rewards. The algorithms of Reinforcement Learning that the brain uses tackle this head-on are logically situated in the Basal Ganglia, which represent the abstract sequential components of motor and cognitive plans. Such sequences are evaluated in terms of their expected reward and risk, which in turn are coded, by using the neurotransmitters dopamine and serotonin respectively, which serves as a common evaluative currency. Reinforcement learning algorithms learn by adjusting deviations in expected reward, a signal, which can also be used to program the Cortex’s memory representations. Tesauro’s use of reinforcement to learn the game of Backgammon provides a superb example of the putative integration of the process between the two forebrain subsystems.
Keywords: Cognitive Programs, Reinforcement Learning, Basal Ganglia, Forebrain
MIT Press Scholarship Online requires a subscription or purchase to access the full text of books within the service. Public users can however freely search the site and view the abstracts and keywords for each book and chapter.
Please, subscribe or login to access full text content.
If you think you should have access to this title, please contact your librarian.
To troubleshoot, please check our FAQs, and if you can't find the answer there, please contact us.