Learning is its Own Reward: Exploring Worlds with Curiosity-driven Spiking Neural Networks
Abstract/Contents
- Abstract
While curiosity is an important part of human learning, it is rarely explicitly a part of machine learning: given some objective, machine learning algorithms optimize directly to get the best possible score. This has led to numerous impressive results but needs many examples and fails in situations where performance feedback is limited. Approaches attempting to incorporate curiosity generally still use a top-down optimization paradigm.
However, spiking neural networks (SNNs), which aim to better mimic biological neurons, present an alternative. Research has shown that local, spike-timing-dependent learning rules allow for the objective-free and bottom-up learning of complex patterns in data. But, limited work has been done applying spiking neural networks in contexts where they must interact with a simulated world making a series of actions.
We show that by combining bottom-up and top-down learning mechanisms in an SNN, namely spike-timing dependent plasticity (STDP) and backpropagation, and rewarding the SNN agent for the amount that it learns, as measured by its STDP update before modulation, our agent quickly learns to explore its environment in increasingly complex ways and seek increasingly novel situations. We demonstrate that, in terms of extrinsic rewards, this approach often results in better performance much more quickly (Up to 1,510 times faster with better performance for some environments and state-of-the-art models) than other curiosity-driven and general reinforcement-learning methods.
Description
Type of resource | text |
---|---|
Date created | June 2020 |
Creators/Contributors
Author | Zelikman, Eric | |
---|---|---|
Degree granting institution | Stanford University, Department of Symbolic Systems | |
Primary advisor | Haber, Nicholas | |
Advisor | Dean, Thomas L. |
Subjects
Subject | reinforcement learning |
---|---|
Subject | symbolic systems |
Subject | curiosity |
Subject | spiking neural networks |
Subject | neural networks |
Subject | sample-efficiency |
Subject | generalization |
Subject | machine learning |
Genre | Thesis |
Bibliographic information
Access conditions
- Use and reproduction
- User agrees that, where applicable, content will not be used to identify or to otherwise infringe the privacy or confidentiality rights of individuals. Content distributed via the Stanford Digital Repository may be subject to additional license and use restrictions applied by the depositor.
- License
- This work is licensed under an Open Data Commons Attribution License v1.0.
Preferred citation
- Preferred Citation
- Zelikman, Eric (2020). Learning is its Own Reward: Exploring Worlds with Curiosity-driven Spiking Neural Networks. Stanford Digital Repository. Available at: https://purl.stanford.edu/pb563ty3328
Collection
Undergraduate Honors Theses, Symbolic Systems Program, Stanford University
View other items in this collection in SearchWorksContact information
- Contact
- ezelikman@cs.stanford.edu
Also listed in
Loading usage metrics...