1 Department of Marketing & Management, Faculty of Business and Social Sciences, SDU2 Strategic Organizational Design, Department of Marketing & Management, Faculty of Business and Social Sciences, SDU3 Ca' Foscari University of Venice4 Department of Marketing & Management, Faculty of Business and Social Sciences, SDU
This paper revisits a recent study by Posen and Levinthal (Manag Sci 58:587–601, 2012) on the exploration/exploitation tradeoff for a multi-armed bandit problem, where the reward probabilities undergo random shocks. We show that their analysis suffers two shortcomings: it assumes that learning is based on stale evidence, and it overlooks the steady state. We let the learning rule endogenously discard stale evidence, and we perform the long run analyses. The comparative study demonstrates that some of their conclusions must be qualified.
Lecture Notes in Economics and Mathematical Systems: Agent-based Approaches To Economics and Social Systems, 2014, p. 205-216
Main Research Area:
Lecture Notes in Economics and Mathematical Systems