Sequential Choice Under Ambiguity: Intuitive Solutions to the Armed-Bandit Problem

The process by which individuals learn from feedback when making recurrent choices among ambiguous alternatives is explored. We describe an experiment in which subjects solve a variant of the classic armed-bandit problem of dynamic decision theory, set in the context of airline choice. Subjects are...

Full description

Saved in:

Bibliographic Details
Published in	Management science Vol. 41; no. 5; pp. 817 - 834
Main Authors	Meyer, Robert J, Shi, Yong
Format	Journal Article
Language	English
Published	Linthicum, MD INFORMS 01.05.1995 Institute for Operations Research and the Management Sciences
Series	Management Science
Subjects	Airlines Ambiguity Applied sciences Base interest rates Civil aviation Decision analysis Decision making models decision making under uncertainty Decision theory Entscheidung Exact sciences and technology Experiment Experimentation Heuristic Heuristics heuristics and biases Konsumentenverhalten learning and risk taking Lernprozess Management science Normativity Observed choices Operational research and scientific management Operational research. Management science Passagierluftverkehr Rational choice theory Risk theory. Actuarial science sequential decision analysis Studies Theorie Learning Uncertainty Intuitionistic logic Decision making Decision theory Heuristic methods Sequential decision Risk taking Modeling Ambiguity Method study
Online Access	Get full text
ISSN	0025-1909 1526-5501
DOI	10.1287/mnsc.41.5.817

Cover

More Information
Summary:	The process by which individuals learn from feedback when making recurrent choices among ambiguous alternatives is explored. We describe an experiment in which subjects solve a variant of the classic armed-bandit problem of dynamic decision theory, set in the context of airline choice. Subjects are asked to make repeated choices between two hypothetical airlines, one having an on-time departure probability which is known a priori, and the other has an ambiguous probability whose true value can only be discovered by making sample trips on the airline. Subjects attempt to make choices in such a way as to maximize the total number of one-time departures over a fixed planning horizon. We examine the extent to which actual choice patterns over time are consistent with those which would be made by a decision maker acting as an optimal Bernoulli sampler. The data offer support for a number of expected—and some unexpected—departures from optimality, including a tendency to underexperiment with promising options and overexperiment with unpromising options, and a tendency to increasingly switch between airlines as the average base rate of departures decreases. Implications of the work for the descriptive validity of normative dynamic decision models is explored, as well as for the generalizability of previous findings about choice under ambiguity to dynamic settings.
Bibliography:	ObjectType-Article-2 SourceType-Scholarly Journals-1 ObjectType-Feature-1 content type line 14 ObjectType-Article-1 content type line 23
ISSN:	0025-1909 1526-5501
DOI:	10.1287/mnsc.41.5.817