Πρότυπο:MAE839-Biblio
Από Wiki Τμήματος Μαθηματικών
		Βιβλιογραφία για MDPs:
- Bertsekas, D. P., Dynamic Programming and Optimal Control, vol. I and II, Athena Scientific, 1995. (Later editions, vol. I, 2017 and vol. 2, 2012)
 - Bäuerle, N., Rieder, U. (2011). Markov decision processes with applications to finance. Springer Science & Business Media.
 - Boucherie, R. J., & van Dijk, N. M. (Eds.) (2017). Markov Decision Processes in Practice. (International Series in Operations Research & Management Science; Vol. 248). Springer. https://doi.org/10.1007/978-3-319-47766-4
 - Chakravorty, J., & Mahajan, A. (2014). Multi-Armed Bandits, Gittins Index, and its Calculation. Methods and applications of statistics in clinical trials: Planning, analysis, and inferential methods, 2, 416-435.
 - Feinberg, E. A., & Shwartz, A. (Eds.). (2012). Handbook of Markov decision processes: methods and applications (Vol. 40). Springer Science & Business Media.
 - Koole, G. (2007). Monotonicity in Markov reward and decision chains: Theory and applications. Foundations and Trends® in Stochastic Systems, 1(1), 1-76.
 - Puterman, M. L. (2014). Markov decision processes: discrete stochastic dynamic programming. John Wiley & Sons.
 - Ross, S. M. (2013). Applied probability models with optimization applications. Courier Corporation.
 - A concise introduction to MDPs can be found in Chapter 17 of M. Mohri, A. Rostamizadeh, and A. Talwalkar. Foundations of Machine Learning, MIT Press, 2018.
 - Sigaud, O., & Buffet, O. (Eds.). (2013). Markov decision processes in artificial intelligence. John Wiley & Sons.
 
Βιβλιογραφία για RL:
- Agarwal, N. Jiang, S. Kakade, W. Sun. Reinforcement Learning Theory and Applications, Working Book.
 - Bertsekas, D. P., Tsitsiklis, J. N. (1996). Neuro-dynamic programming. Athena Scientific.
 - Bertsekas, D.P. (2019). Reinforcement learning and optimal control. Athena Scientific.
 - Meyn, S.P. (2022). Control Systems and Reinforcement Learning, Cambridge University Press.
 - Powell, W. B. (2007). Approximate Dynamic Programming: Solving the curses of dimensionality (Vol. 703). John Wiley & Sons.
 - Sutton, R.S., Barto, A.G. (2018). Reinforcement Learning: An Introduction, MIT Press.
 
Συναφή επιστημονικά περιοδικά:
- Operations Research (INFORMS)
 - Mathematics of Operations Research (INFORMS)
 - European Journal of Operations Research (Elsevier)