Learning is its Own Reward: Exploring Worlds with Curiosity-driven Spiking Neural Networks

Zelikman, Eric

Learning is its Own Reward: Exploring Worlds with Curiosity-driven Spiking Neural Networks

<a href="https://embed.stanford.edu/iframe/?url=https%3A%2F%2Fpurl.stanford.edu%2Fpb563ty3328" class="su-underline">Show Content</a>

Abstract/Contents

Abstract

While curiosity is an important part of human learning, it is rarely explicitly a part of machine learning: given some objective, machine learning algorithms optimize directly to get the best possible score. This has led to numerous impressive results but needs many examples and fails in situations where performance feedback is limited. Approaches attempting to incorporate curiosity generally still use a top-down optimization paradigm.

However, spiking neural networks (SNNs), which aim to better mimic biological neurons, present an alternative. Research has shown that local, spike-timing-dependent learning rules allow for the objective-free and bottom-up learning of complex patterns in data. But, limited work has been done applying spiking neural networks in contexts where they must interact with a simulated world making a series of actions.

We show that by combining bottom-up and top-down learning mechanisms in an SNN, namely spike-timing dependent plasticity (STDP) and backpropagation, and rewarding the SNN agent for the amount that it learns, as measured by its STDP update before modulation, our agent quickly learns to explore its environment in increasingly complex ways and seek increasingly novel situations. We demonstrate that, in terms of extrinsic rewards, this approach often results in better performance much more quickly (Up to 1,510 times faster with better performance for some environments and state-of-the-art models) than other curiosity-driven and general reinforcement-learning methods.

Description

Type of resource	text
Date created	June 2020

Creators/Contributors

Author	Zelikman, Eric
Degree granting institution	Stanford University, Department of Symbolic Systems
Primary advisor	Haber, Nicholas
Advisor	Dean, Thomas L.

Subjects

Subject	reinforcement learning
Subject	symbolic systems
Subject	curiosity
Subject	spiking neural networks
Subject	neural networks
Subject	sample-efficiency
Subject	generalization
Subject	machine learning
Genre	Thesis

Bibliographic information

Location	https://purl.stanford.edu/pb563ty3328

Access conditions

Use and reproduction: User agrees that, where applicable, content will not be used to identify or to otherwise infringe the privacy or confidentiality rights of individuals. Content distributed via the Stanford Digital Repository may be subject to additional license and use restrictions applied by the depositor.
License: This work is licensed under an Open Data Commons Attribution License v1.0.

Preferred citation

Preferred Citation: Zelikman, Eric (2020). Learning is its Own Reward: Exploring Worlds with Curiosity-driven Spiking Neural Networks. Stanford Digital Repository. Available at: https://purl.stanford.edu/pb563ty3328

Collection

Undergraduate Honors Theses, Symbolic Systems Program, Stanford University

View other items in this collection in SearchWorks

Contact information

Contact: ezelikman@cs.stanford.edu

Also listed in

View in SearchWorks

Loading usage metrics...