Neural Modulation for Reinforcement Learning in Developmental Networks Facing an Exponential No. of States

Hao Ye,Xuanjing Huang,Juyang Weng
2013-01-01
Abstract:Suppose that a developmental agent (animal or machine) has c concepts to learn and each concept has v possible values. The number of states is then vc, exponential in the number of possible concepts. This computational complexity is well known to be intractable. In artificial intelligence (AI), human handcrafting of symbolic states has been adopted to reduce the number of states, relying on human intuition about the required states of a given task. This paradigm has resulted in the well-known high brittleness because of the inability of the human designer to check the validity of his state reduction for the system to correctly go through an exponential number of paths of state transitions (eg, in graphic models). In this reported work, we study how a Developmental Network (DN) as an emergent and probabilistic finite automaton (FA) that enables its states to emerge automatically—only those that are experienced in its “life”—greatly reducing the number of actual states. In order to avoid the requirement for the human teacher to specify every state in online teaching (ie, action in DN), we allow the human teacher to give scores to evaluate the displayed actions (ie, reinforcement learning), modeling the serotonin system for punishments and the dopamine system for rewards. Due to the need of ground truth for performance evaluation which is hard to come by in the real world, we used a simulation environment described as a game setting, but the methodology is applicable to a real-world developmental robot and also our computational understanding how an animal develops its skills.
What problem does this paper attempt to address?