An accelerated directional derivative method for smooth stochastic convex optimization

P. Dvurechensky; Eduard Gorbunov; A. Gasnikov

doi:10.1016/j.ejor.2020.08.027

Publications

?

An accelerated directional derivative method for smooth stochastic convex optimization

European Journal of Operational Research. 2021. Vol. 290. No. 2. P. 601–621.

Dvurechensky P., Eduard Gorbunov, Gasnikov A.

We consider smooth stochastic convex optimization problems in the context of algorithms which are based on directional derivatives of the objective function. This context can be considered as an intermediate one between derivative-free optimization and gradient-based optimization. We assume that at any given point and for any given direction, a stochastic approximation for the directional derivative of the objective function at this point and in this direction is available with some additive noise. The noise is assumed to be of an unknown nature, but bounded in the absolute value. We underline that we consider directional derivatives in any direction, as opposed to coordinate descent methods which use only derivatives in coordinate directions. For this setting, we propose a non-accelerated and an accelerated directional derivative method and provide their complexity bounds. Our non-accelerated algorithm has a complexity bound which is similar to the gradient-based algorithm, that is, without any dimension-dependent factor. Our accelerated algorithm has a complexity bound which coincides with the complexity bound of the accelerated gradient-based algorithm up to a factor of square root of the problem dimension. We extend these results to strongly convex problems.

Research target: Computer Science

Priority areas: IT and mathematics

Language: English

DOI

Text on another site

Keywords: convex optimization

The Exact Circuit Complexity of Boolean Functions in an Infinite Basis

Mikhailovich A., Kochergin V., Mathematical notes 2025 Vol. 117 No. 3-4 P. 579–594

The exact value of the complexity of the circuit implementation of an arbitrary Boolean function in a certain basis consisting of negation and all monotone Boolean functions is found. The complexity of a function is defined as the least number of basis elements sufficient to construct a circuit implementation of this function. ...

Added: February 28, 2026

SSL-MEPR: A Semi-Supervised Multi-Task Cross-Domain Learning Framework for Multimodal Emotion and Personality Recognition

Ryumina E., Aksenov A., Koryakovskaya D. et al., Machine Learning and Knowledge Extraction 2026 No. 8 P. 1–41

The growing demand for personalized human-computer interaction calls for methods that jointly model emotional states and personality traits. However, large-scale multimodal corpora annotated for both tasks are still lacking. This challenge stems from integrating diverse, task-specific corpora with divergent modality informativeness and domain characteristics. To address it, we propose SSL-MEPR, a semi-supervised multi-task cross-domain learning ...

Added: February 27, 2026

Моделирование информационного сетевого взаимодействия в киберсоциальных системах

Maltseva S. V., Голубцов П. В., Барахнин В. Б., Вычислительные технологии 2026 Т. 31 № 1 С. 5–22

The issues of macro-level monitoring of the manufacturing system in the implementation of the concepts of Industry 4.0 and 5.0 based on the study of information flows in manufacturing network structures are considered. The numerical models of three types of network interaction, that taking into account the influence of the number of objects, external influences, ...

Added: February 26, 2026

HoTPP benchmark: Are we good at the long horizon events forecasting?

Karpukhin I., Shipilov F., Savchenko A., Neurocomputing 2026 Vol. 672 Article 132771

Forecasting multiple future events within a given time horizon is essential for applications in finance, retail, social networks, and healthcare. This problem is typically addressed using Marked Temporal Point Processes (MTPP), which provide a principled framework for modeling both event timing and event labels. While most existing research focuses on predicting only the next event, forecasting distant future ...

Added: February 25, 2026

Comparative analysis of the characteristics of promising apsk modulation schemes in wireless telecommunications

Kazakov G. N., Nguyen H. T., Shevgunov T. et al., T-Comm: Telecommunications and transport 2025 Vol. 19 No. 9 P. 59–76

The growing requirements for the use of high-speed and energy-efficient high-capacity data transmission channels in modern and future telecommunication networks have led to an increasing interest in the formation and application of signals with new constellations. Requirements for the shape of signal constellations in connection with the emergence of new technologies of wireless telecommunications are ...

Added: February 25, 2026

Метод оценки частно-временной плотности вероятности цифрового сигнала с использованием линейной интерполяции

Shevgunov T., T-Comm: Телекоммуникации и транспорт 2024 Т. 18 № 7 С. 4–12

В работе представлена разработка нового инструмента частно-временного (fraction-of-time) подхода, в рамках которого случайный процесс описывается с использованием функциональных моделей, синтезируемых по его единственной наблюдаемой реализации, без необходимости построения абстрактных вероятностных моделей в условиях отсутствия достоверной априорной информации о проявлении процессом свойства эргодичности. На основе полученной ранее аналитической формулы, выражающей частно-временную плотность непрерывного сигнала в явной ...

Added: February 25, 2026

Proceedings of the Ninth International Scientific Conference “Intelligent Information Technologies for Industry”

Cham: Springer Publishing Company, 2026.

This book contains the works connected with the key advances in Intelligent Information Technologies for Industry presented at IITI 2025, the Ninth International Scientific Conference on Intelligent Information Technologies for Industry held on November 5-7, 2025 in Sirius Federal Territory, Russia. The book is written by the experts in the field of applied artificial intelligence ...

Added: February 25, 2026

Measuring External Conflict in Dempster-Shafer Theory Based on Kantorovich Problems

Bronevich A., Lepskiy A., International Journal of Approximate Reasoning 2026 Vol. 190 Article 109597

In the paper, we consider three possible types of external conflict in Dempster-Shafer theory and propose its measurement based on functionals evaluating intersection, inclusion and distance between random sets. All proposed functionals can be viewed as extensions of known functionals like Jaccard metric, Jaccard index, and Dice coefficient from usual sets to random sets based ...

Added: February 25, 2026

Proceeding of international Conference on Data Mining (ICDM 2022)

Sergei O. Kuznetsov, Buzmakov A., Makhalova T. et al., IEEE, 2022.

In this paper, we revisit pattern mining and study the distribution underlying a binary dataset thanks to the closure structure which is based on passkeys, i.e., minimum generators in equivalence classes robust to noise. We introduce △-closedness, a generalization of the closure operator, where △ measures how a closed set differs from its upper neighbors ...

Added: February 25, 2026

АНАЛИЗ ТОНАЛЬНОСТИ РУССКОЙ ДРАМЫ XVIII–XX ВВ. КАК ИНСТРУМЕНТ МОДЕЛИРОВАНИЯ ХУДОЖЕСТВЕННОЙ СТРУКТУРЫ

Anisimova K., Цифровые гуманитарные исследования 2025 № 2 С. 24–47

Исследование посвящено описанию эмоциональной динамики как проявления художественной структуры русской драмы XVIII–XX вв. на основе автоматической разметки тональности реплик с использованием нейросетевых моделей BERT-архитектуры. Такие модели, дообученные даже на нехудожественных текстах, показывают удовлетворительные результаты при анализе тональности драматических реплик, что было проверено на вручную размеченной тестовой выборке. На основе такой автоматической эмоциональной разметки было показано, ...

Added: February 24, 2026

Explainable artificial intelligence for smart and ethical healthcare

Avdoshin S. M., Elena Yu. Pesotskaya, Advanced SmartHealth 2026 Vol. 1 No. 1 P. 1–15

SmartHealth technologies are evolving rapidly, and the emerging Medicine 5.0 paradigm highlights the need for artificial intelligence that pairs high performance with explainability, transparency, and ethical soundness. However, many neural network approaches remain “black boxes,” limiting their uptake in clinical practice, where justification and trust are essential. This article reviews their applications in diagnosis, monitoring ...

Added: February 24, 2026

Advanced SmartHealth

Avdoshin S. M., Pesotskaya E. Y., Singapore: AccScience Publishing, 2026.

Added: February 24, 2026

Explainable artificial intelligence for smart and ethical healthcare

Avdoshin S. M., Elena Yu. P., Advanced Journal of SmartHealth 2026 Vol. X P. 1–15

SmartHealth technologies are evolving rapidly, and the emerging Medicine 5.0 paradigm highlights the need for artificial intelligence that pairs high performance with explainability, transparency, and ethical soundness. However, many neural-network approaches remain “black boxes,” limiting their uptake in clinical practice, where justification and trust are essential. This article reviews their applications in diagnosis, monitoring of ...

Added: February 24, 2026

ФУНДАМЕНТАЛЬНАЯ МОДЕЛЬ ДЛЯ ВРЕМЕННЫХ РЯДОВ И КАК ЕЕ (НЕ) ОБУЧАТЬ НА СИНТЕТИКЕ

Temirkhanov A., Костромина А. М., Цымбой О. А. et al., Доклады Российской академии наук. Математика, информатика, процессы управления (ранее - Доклады Академии Наук. Математика) 2025 Т. 527 № S С. 485–494

The industry is rich in cases when we are required to make forecasting for large amounts of time series at once. However, we might be in a situation where we can not afford to train a separate model for each of them. Such issue in time series modeling remains without due attention. The remedy for ...

Added: February 24, 2026

ГрафиКон 2025 : материалы 35-й Международной конференции по компьютерной графике и машинному зрению (Россия, Йошкар-Ола, 30 сентября – 2 октября 2025 г.)

Йошкар-Ола: Поволжский государственный технологический университет, 2025.

Представлены материалы 35-й Международной конференции «ГрафиКон 2025», проходившей на базе Поволжского государственного технологического университета. В сборник вошли доклады участников конференции, посвященные методам и технологиям компьютерного анализа изображений, визуальной и когнитивной аналитики, 3D-реконструкции, визуальной навигации и человеко-машинного взаимодействия, виртуальной и дополненной реальности, распознавания образов и др. Издание адресовано сотрудникам научно-исследовательских и образовательных организаций, специалистам предприятий ИТ-индустрии, аспирантам, студентам. ...

Added: February 21, 2026

BIG DATA и анализ высокого уровня = BIG DATA and Advanced Analytics: сб. науч. ст. XI Междунар. науч.-практ. конф. (Республика Беларусь, Минск, 23–24 апреля 2025 года)

Мн.: БГУИР, 2025.

BIG DATA и анализ высокого уровня = BIG DATA and Advanced Analytics : сб. науч. ст. XI Междунар. науч.-практ. конф. (Республика Беларусь, Минск, 23–24 апреля 2025 года) / редкол.: В. А. Богуш [и др.]. – Минск : БГУИР, 2025. – 498 с. ISBN 978-985-543-814-5. Опубликованы результаты научных исследований и разработок в области BIG DATA and Advanced Analytics для оптимизации ...

Added: February 21, 2026

Iterative Ricci-Foster Curvature Flow with GMM-Based Edge Pruning: A Novel Approach to Community Detection

Sorokin K., Beketov M., Онучин А. et al., / arxiv.org. Серия cs.SI "Social and Information Networks ". 2025.

Community detection in complex networks is a fundamental problem, open to new approaches in various scientific settings. We introduce a novel community detection method, based on Ricci flow on graphs. Our technique iteratively updates edge weights (their metric lengths) according to their (combinatorial) Foster version of Ricci curvature computed from effective resistance distance between the ...

Added: January 15, 2026

Implementing Transport Coding in OMNeT++ for Message Delay Reduction

Petrovanov I., Sergeev A., / Series Computer Science "arxiv.org". 2025. No. 2512.18332.

Transport coding reduces message delay in packet-switched networks by introducing controlled redundancy at the transport layer: original packets are encoded into coded packets, and the message is reconstructed after the first successful deliveries, effectively shifting latency from the maximum packet delay to the -th order statistic. We present a concise, reproducible discrete-event implementation of transport coding in OMNeT++, including ...

Added: December 24, 2025

Hessian-based lightweight neural network for brain vessel segmentation on a minimal training dataset

Меньшиков И. А., Бернадотт А. К., Elvimov N. S., / Series arXie "Statistical mechanics". 2025.

Accurate segmentation of blood vessels in brain magnetic resonance angiography (MRA) is essential for successful surgical procedures, such as aneurysm repair or bypass surgery. Currently, annotation is primarily performed through manual segmentation or classical methods, such as the Frangi filter, which often lack sufficient accuracy. Neural networks have emerged as powerful tools for medical image ...

Added: December 1, 2025

Determining the boundary of dynamical chaos in the generalized Chirikov map via machine learning

Чернышов Д. П., Satanin A., Shchur L., / Series arXiv "math". 2025.

We investigate the boundary separating regular and chaotic dynamics in the generalized Chirikov map, an extension of the standard map with phase-shifted secondary kicks. Lyapunov maps were computed across the parameter space (K,K(α, τ)) and used to train a convolutional neural network (ResNet18) for binary classification of dynamical regimes. The model reproduces the known critical ...

Added: November 21, 2025

On Linear Convergence in Smooth Convex-Concave Bilinearly-Coupled Saddle-Point Optimization: Lower Bounds and Optimal Algorithms

Borodich E., Gasnikov A., , in: Volume 267: International Conference on Machine Learning, 13-19 July 2025, Vancouver Convention Center, Vancouver, CanadaVol. 267.: [б.и.], 2025. P. 1–56.

We revisit the smooth convex-concave bilinearly-coupled saddle-point problem of the form . In the highly specific case where function is strongly convex and function is affine, or both functions are affine, there exist lower bounds on the number of gradient evaluations and matrix-vector multiplications required to solve the problem, as well as matching optimal algorithms. A notable aspect ...

Added: November 18, 2025

Эффективный алгоритм торговли на фондовом рынке: ретроспективный анализ, основанный на данных по S&P-500.

Rubchinskiy A., Chubarova D., / Series WP7 "Математические методы анализа решений в экономике, бизнесе и политике". 2025. No. WP7/2025/01.

The article examines one of the most famous examples of socio-economic systems, characterized by significant uncertainty – the S&P-500 stock market, where shares of 500 largest US companies are traded. No assumptions are made about the probabilistic characteristics of the stock market. A flexible algorithm for daily trading has been developed, based on both known fixed data ...

Added: November 9, 2025

Diffusion on language model embeddings for protein sequence generation

Meshchaninov V., Strashnov, P., Shevtsov A. et al., / Cornell University. Серия CoRR, arXiv:2403.03726 "Computing Research Repository,". 2025.

Protein design requires a deep understanding of the inherent complexities of the protein universe. While many efforts lean towards conditional generation or focus on specific families of proteins, the foundational task of unconditional generation remains underexplored and undervalued. Here, we explore this pivotal domain, introducing DiMA, a model that leverages continuous diffusion on embeddings derived ...

Added: October 5, 2025

Smoothie: Smoothing Diffusion on Token Embeddings for Text Generation

Shabalin A., Meshchaninov V., Vetrov D., / Series cs.CL, arXiv:2505.18853 "Computation and Language". 2025.

Diffusion models have achieved state-of-the-art performance in generating images, audio, and video, but their adaptation to text remains challenging due to its discrete nature. Prior approaches either apply Gaussian diffusion in continuous latent spaces, which inherits semantic structure but struggles with token decoding, or operate in categorical simplex space, which respect discreteness but disregard semantic ...

Added: October 5, 2025