Document Type

Article

Publication Date

5-24-2019

Publication Title

Machine Learning & Knowledge Extracting

DOI

10.3390/make1020041

ISSN

2504-4990

Abstract

In many statistical and machine learning applications, without-replacement sampling is considered superior to with-replacement sampling. In some cases, this has been proven, and in others the heuristic is so intuitively attractive that it is taken for granted. In reinforcement learning, many count-based exploration strategies are justified by reliance on the aforementioned heuristic. This paper will detail the non-intuitive discovery that when measuring the goodness of an exploration strategy by the stochastic shortest path to a goal state, there is a class of processes for which an action selection strategy based on without-replacement sampling of actions can be worse than with-replacement sampling. Specifically, the expected time until a specified goal state is first reached can be provably larger under without-replacement sampling. Numerical experiments describe the frequency and severity of this inferiority.

Comments

© 2019 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).

Recommended Citation

Carden, Stephen W., S. Dalton Walker. 2019. "Exploration Using Without-Replacement Sampling of Actions Is Sometimes Inferior." Machine Learning & Knowledge Extracting, 1 (2): 698-714: MDPI. doi: 10.3390/make1020041
https://digitalcommons.georgiasouthern.edu/math-sci-facpubs/766

Download

Included in

Mathematics Commons

COinS

Department of Mathematical Sciences Faculty Publications

Exploration Using Without-Replacement Sampling of Actions Is Sometimes Inferior

Document Type

Publication Date

Publication Title

DOI

ISSN

Abstract

Comments

Recommended Citation

Included in

Search GS Commons

Browse GS Commons

About GS Commons

Links

Department of Mathematical Sciences Faculty Publications

Exploration Using Without-Replacement Sampling of Actions Is Sometimes Inferior

Authors

Document Type

Publication Date

Publication Title

DOI

ISSN

Abstract

Comments

Recommended Citation

Included in

Share

Search GS Commons

Browse GS Commons

About GS Commons

Links