On the Stability of Random Matrix Product with Markovian Noise: Application to Linear Stochastic Approximation and TD Learning

Durmus A.; E. Moulines; A. Naumov; S. Samsonov; Wai H.

Публикации

?

On the Stability of Random Matrix Product with Markovian Noise: Application to Linear Stochastic Approximation and TD Learning

P. 1711–1752.

Durmus A., Мулине Э. Ф., Наумов А. А., Самсонов С. В., Wai H.

Язык: английский

Полный текст

Текст на другом сайте

Ключевые слова: Markov chains stability of random matrix product linear stochastic approximation TD-learning

ПУБЛИКАЦИЯ ПОДГОТОВЛЕНА ПО РЕЗУЛЬТАТАМ ПРОЕКТА:

Анализ неопределенности в алгоритмах машинного обучения (2021)

В книге

Proceedings of Machine Learning Research

Vol. 134: Conference on Learning Theory. , PMLR, 2021.

High-Order Error Bounds for Markovian LSA with Richardson–Romberg Extrapolation

Левин И. В., Наумов А. А., Самсонов С. В., , in: Proceedings of the AAAI Conference on Artificial Intelligence. AAAI-26: AAAI Technical Track on Planning, Routing, and Scheduling; AAAI Technical Track on Reasoning under Uncertainty; AAAI Technical Track on Search and Optimization. Main Track, volume 40 no. 43.: American Association for Artificial Intelligence (AAAI) Press, 2026. P. 36696–36704.

Добавлено: 17 апреля 2026 г.

Об одном применении теоремы А.Н. Колмогорова

Соболев В. Н., Фролов А. А., Чебышевский сборник 2025 Т. 26 № 5 С. 203–220

В статье на классе K бесконечных двоичных последовательностей без 1-серий строится согласованное распределение вероятностей P, которое индуцируется однородной цепью Маркова с матрицей перехода за один шаг P𝜑 , и полностью определяемой золотым сечением 𝜑. Использование цепи Маркова при построении вероятностной меры P позволяет применить теорему А.Н. Колмогорова о продолжении меры. Асимптотическое распределение подкласса K 0 ...

Добавлено: 11 февраля 2026 г.

Statistical inference for Linear Stochastic Approximation with Markovian Noise

Самсонов С. В., Шешукова М. Е., Moulines E. и др., , in: 39th Conference on Neural Information Processing Systems (NeurIPS 2025).: NeurIPS, 2025. P. 174565–174626.

Добавлено: 26 января 2026 г.

A Revival of Conservative Ideology and the Projected Religious Landscape in Russia

Скоробогатов А. С., Economics of Transition and Institutional Change 2026 Vol. 34 No. 2 P. 387–409

Добавлено: 3 ноября 2025 г.

Nonasymptotic Analysis of Stochastic Gradient Descent with the Richardson–Romberg Extrapolation

Шешукова М. Е., Беломестный Д. В., Durmus A. и др., , in: Proceedings of the 13th International Conference on Learning Representations (ICLR 2025).: ICLR, 2025.

Добавлено: 15 августа 2025 г.

SCAFFLSA: Taming Heterogeneity in Federated Linear Stochastic Approximation and TD Learning

Mangold P., Самсонов С. В., Labbi S. и др., , in: 38th Conference on Neural Information Processing Systems (NeurIPS 2024).: [б.и.], 2024. Ch. 37 P. 13927–13981.

Добавлено: 11 февраля 2025 г.

Nonasymptotic Analysis of Stochastic Gradient Descent with the Richardson-Romberg Extrapolation

Шешукова М. Е., Беломестный Д. В., Durmus A. и др., / Series arXiv "math". 2024.

Добавлено: 13 октября 2024 г.

Improved High-Probability Bounds for the Temporal Difference Learning Algorithm via Exponential Stability

Самсонов С. В., Тяпкин Д. Н., Наумов А. А. и др., , in: Proceedings of Machine Learning Research. Volume 247: The Thirty Seventh Annual Conference on Learning Theory, 30-3 July 2023, Edmonton, Canada.: PMLR, 2024. Ch. 247 P. 4511–4547.

Добавлено: 13 октября 2024 г.

Rosenthal-type inequalities for linear statistics of Markov chains

Durmus A., Мулине Э. Ф., Наумов А. А. и др., / Series arXiv "math". 2023.

Добавлено: 18 июня 2023 г.

Local Limit Theorems and Strong Approximations for Robbins-Monro Procedures

Конаков В. Д., Mammen E., / Series arXiv "math". 2023. No. 2304.10673.

Добавлено: 24 апреля 2023 г.

Local-Global MCMC kernels: the best of both worlds

Самсонов С. В., Лагутин Е. М., Gabrie M. и др., , in: Thirty-Sixth Conference on Neural Information Processing Systems : NeurIPS 2022.: Curran Associates, Inc., 2022. P. 5178–5193.

Добавлено: 1 февраля 2023 г.

BR-SNIS: Bias Reduced Self-Normalized Importance Sampling

Cardoso G., Самсонов С. В., Thin A. и др., , in: Thirty-Sixth Conference on Neural Information Processing Systems : NeurIPS 2022.: Curran Associates, Inc., 2022. P. 716–729.

Добавлено: 1 февраля 2023 г.

Mathematical Model for Assessing the Reliability of Water Supply Networks

Рунев Е. В., Springer Nature Switzerland 2022 Vol. 402 No. 1 P. 343–351

Добавлено: 1 ноября 2022 г.

Finite-Time High-Probability Bounds for Polyak–Ruppert Averaged Iterates of Linear Stochastic Approximation

Durmus A., Мулине Э. Ф., Наумов А. А. и др., Mathematics of Operations Research 2025 Vol. 50 No. 2 P. 935–964

Добавлено: 13 июля 2022 г.

Об улучшенных оценках и условиях сходимости для цепей Маркова

Веретенников А. Ю., Веретенникова М. А., Известия РАН. Серия математическая 2022 Т. 86 № 1 С. 98–133

В настоящей работе продолжено исследование улучшенной скорости сходимости для эргодических однородных цепей Маркова. Постановка задачи расширена по сравнению с предыдущими работами на данную тему: удалось отказаться от предположения о единой доминирующей мере, рассмотрен случай неоднородных цепей Маркова; также рассмотрен случай более общего фазового пространства. Приведены примеры, когда новая оценка скорости сходимости такая же, и когда она оказывается лучше ...

Добавлено: 14 марта 2022 г.

Tight High Probability Bounds for Linear Stochastic Approximation with Fixed Stepsize

Durmus A., Мулине Э. Ф., Наумов А. А. и др., , in: Advances in Neural Information Processing Systems 34 (NeurIPS 2021).: Curran Associates, Inc., 2021. P. 30063–30074.

This paper provides a non-asymptotic analysis of linear stochastic approximation (LSA) algorithms with fixed stepsize. This family of methods arises in many machine learning tasks and is used to obtain approximate solutions of a linear system $\bar{A}\theta = \bar{b}$ for which $\bar{A}$ and $\bar{b}$ can only be accessed through random estimates $\{({\bf A}_n, {\bf b}_n): ...

Добавлено: 17 февраля 2022 г.

Probability and moment inequalities for additive functionals of geometrically ergodic Markov chains

Durmus A., Мулине Э. Ф., Наумов А. А. и др., Journal of Theoretical Probability 2024 Vol. 37 P. 2184–2233

In this paper, we establish moment and Bernstein-type inequalities for additive functionals of geometrically ergodic Markov chains. These inequalities extend the corresponding inequalities for independent random variables. Our conditions cover Markov chains converging geometrically to the stationary distribution either in V-norms or in weighted Wasserstein distances. Our inequalities apply to unbounded functions and depend explicitly on ...

Добавлено: 7 сентября 2021 г.

Macdonald polynomials and extended Gelfand–Tsetlin graph

Ольшанский Г. И., Selecta Mathematica, New Series 2021 Vol. 27 Article 41

Добавлено: 4 июня 2021 г.