Abstract
AbstractIn a previous paper Amman et al. (Macroecon Dyn, 2018) compare the two dominant approaches for solving models with optimal experimentation (also called active learning), i.e. the value function and the approximation method. By using the same model and dataset as in Beck and Wieland (J Econ Dyn Control 26:1359–1377, 2002), they find that the approximation method produces solutions close to those generated by the value function approach and identify some elements of the model specifications which affect the difference between the two solutions. They conclude that differences are small when the effects of learning are limited. However the dataset used in the experiment describes a situation where the controller is dealing with a nonstationary process and there is no penalty on the control. The goal of this paper is to see if their conclusions hold in the more commonly studied case of a controller facing a stationary process and a positive penalty on the control.
Publisher
Springer Science and Business Media LLC
Reference36 articles.
1. Aghion, P., Bolton, P., Harris, C., & Jullien, B. (1991). Optimal learning by experimentation. Review of Economic Studies, 58, 621–654.
2. Amman, H., & Tucci, M. (2017). The dual approach in an infinite horizon model. Quaderni del Dipartimento di Economia Politica 766. Università di Siena, Siena, Italy.
3. Amman, H. M. (1996). Numerical methods for linear-quadratic models. In H. M. Amman, D. A. Kendrick, & J. Rust (Eds.), Handbook of computational economics of handbook in economics (Vol. 13, pp. 579–618). Amsterdam: North-Holland Publishers.
4. Amman, H. M., Kendrick, D. A., & Tucci, M. P. (2018). Approximating the value function for optimal experimentation. Macroeconomic Dynamics (forthcoming).
5. Amman, H. M., & Neudecker, H. (1997). Numerical solution methods of the algebraic matrix riccati equation. Journal of Economic Dynamics and Control, 21, 363–370.