New Preprints

  • S. Cayci and A. Eryilmaz, Recurrent Natural Policy Gradient for POMDPs, arXiv preprint: 2405.18221, 2024.
  • S. Cayci and A. Eryilmaz, Convergence of Gradient Descent for Recurrent Neural Networks: A Nonasymptotic Analysis, arXiv preprint: 2402.12241, 2024.
  • Selected Publications

  • S. Cayci, N. He, R. Srikant, Finite-time analysis of entropy-regularized neural natural actor-critic algorithm, Transactions of Machine Learning Research, 2024.
  • S. Cayci, N. He, R. Srikant, Convergence of entropy-regularized natural policy gradient with linear function approximation, to appear in SIAM Journal on Optimization, 2024.
  • S. Cayci, N. He, R. Srikant, Finite-time analysis of natural actor-critic for POMDPs, to appear in SIAM Journal on Mathematics of Data Science, 2024.
  • S. Cayci and A. Eryilmaz, Provably Robust Temporal Difference Learning for Heavy-Tailed Rewards, NeurIPS 2023.
  • S. Cayci, S. Satpathi, N. He, R. Srikant, Sample Complexity and Overparameterization Bounds for Temporal Difference Learning with Neural Network Approximation, IEEE Transactions on Automatic Control, 2023.
  • B. Yardim, S. Cayci, M. Geist, N. He, Policy Mirror Ascent for Efficient and Independent Learning in Mean Field Games, ICML 2023.
  • Valid HTML5 · Valid CSS3