• A
  • A
  • A
  • АБВ
  • АБВ
  • АБВ
  • A
  • A
  • A
  • A
  • A
Обычная версия сайта
  • RU
  • EN
  • HSE University
  • Publications
  • Articles
  • Trade-off between memory usage and optimization speed in a simple machine learning task
  • RU
  • EN
Расширенный поиск
Высшая школа экономики
Национальный исследовательский университет
Priority areas
  • business informatics
  • economics
  • engineering science
  • humanitarian
  • IT and mathematics
  • law
  • management
  • mathematics
  • sociology
  • state and public administration
by year
  • 2027
  • 2026
  • 2025
  • 2024
  • 2023
  • 2022
  • 2021
  • 2020
  • 2019
  • 2018
  • 2017
  • 2016
  • 2015
  • 2014
  • 2013
  • 2012
  • 2011
  • 2010
  • 2009
  • 2008
  • 2007
  • 2006
  • 2005
  • 2004
  • 2003
  • 2002
  • 2001
  • 2000
  • 1999
  • 1998
  • 1997
  • 1996
  • 1995
  • 1994
  • 1993
  • 1992
  • 1991
  • 1990
  • 1989
  • 1988
  • 1987
  • 1986
  • 1985
  • 1984
  • 1983
  • 1982
  • 1981
  • 1980
  • 1979
  • 1978
  • 1977
  • 1976
  • 1975
  • 1974
  • 1973
  • 1972
  • 1971
  • 1970
  • 1969
  • 1968
  • 1967
  • 1966
  • 1965
  • 1964
  • 1963
  • 1958
  • More
Subject
News
May 20, 2026
HSE University Opens First Representative Office of Satellite Laboratory in Brazil
HSE University-St Petersburg opened a representative office of the Satellite Laboratory on Social Entrepreneurship at the University of Campinas in Brazil. The platform is going to unite research and educational projects in the spheres of sustainable development, communications and social innovations.
May 18, 2026
The 'Second Shift' Is Not Why Women Avoid News
Women are more likely than men to avoid political and economic news, but the reasons for this behaviour are linked less to structural inequality or family-related stress than to personal attitudes and the emotional perception of news content. This conclusion was reached by HSE researchers after analysing data from a large-scale survey of more than 10,000 residents across 61 regions of Russia. The study findings have been published in Woman in Russian Society.
May 15, 2026
Preserving Rationality in a Period of Turbulence
The HSE International Laboratory for Logic, Linguistics and Formal Philosophy studies logic and rationality in a transformed world characterised by a diversity of logical systems and rational agents. The laboratory supports and develops academic ties with Russian and international partners. The HSE News Service spoke with the head of the laboratory, Prof. Elena Dragalina-Chernaya, about its work.

 

Have you spotted a typo?
Highlight it, click Ctrl+Enter and send us a message. Thank you for your help!

Publications
  • Books
  • Articles
  • Chapters of books
  • Working papers
  • Report a publication
  • Research at HSE

?

Trade-off between memory usage and optimization speed in a simple machine learning task

Proceedings of 2025 9th Scientific School Dynamics of Complex Networks and their Applications (DCNA), Publisher IEEE, Electronic ISSN: 2770-744X Print on Demand(PoD) ISSN: 2770-7431. 2025. P. 42–44.
Горюнов О. А., Соловьев И. А., Klinshov V.

—In modern machine learning, the efficiency of train ing is heavily influenced by the choice of optimization algorithm, with preconditioned methods offering significant speed improve ments at the cost of increased memory usage. This study explores the trade-off between memory consumption and optimization performance by generalizing the Shampoo preconditioning al gorithm to support arbitrarily sized preconditioning matrices, rather than being limited by tensor dimensions. We evaluate this approach on a perceptron trained for a simple regression task, measuring how different preconditioner sizes affect the training speed. Our results reveal a strong positive correlation: larger preconditioners consistently lead to faster learning, although the performance gains scale sublinearly with memory. Our findings provide practical insights into balancing computational resources and training efficiency in adaptive optimization.

Research target: Computer Science Mathematics
Language: English
Full text
DOI
Text on another site
Keywords: multistabilitygradient descentpreconditioned optimizationperformance trade-off
Publication based on the results of:
Машинное обучение и нелинейная динамика: пересечение, взаимодействие и синтез (2027)
Similar publications
Upper bounds for Steklov eigenvalues of a hypersurface of revolution
Denis Seliutskii, Russian Journal of Mathematical Physics 2025 Vol. 32 No. 2 P. 399–407
In this paper, we find an upper bound for the first Steklov eigenvalue for a surface of revolution with boundary consisting of two spheres of different radii. Moreover, we prove that, in some cases, this boundary is sharp. ...
Added: May 19, 2026
ML-based Fast Simulation of FARICH Responses
Shipilov F., Barnyakov A., Ivanov A. et al., / Series Physics "arxiv.org". 2026.
A fast simulation of the detector response is a vital task in high-energy physics (HEP). Traditional Monte-Carlo methods form the backbone of modern particle physics simulation software but are computationally expensive. We present a machine-learning-based approach to fast simulation of the Focusing Aerogel Ring Imaging Cherenkov (FARICH) detector response. Given a particle track and momentum, ...
Added: May 19, 2026
Proceedings of the 19th Conference of the European Chapter of the Association for Computational Linguistics (Volume 3: System Demonstrations)
Association for Computational Linguistics, 2026.
Added: May 19, 2026
Dataset of solubility values for organic compounds in binary mixtures of solvents at various temperatures
Bezzubov S., Malikov D., Krasnov L. et al., Scientific data 2026 Vol. 13 Article 727
Solubility is a crucial property of organic compounds, impacting their potential applications in synthetic chemistry, materials science and drug design. Moreover, in technological processes mixtures of solvents are often utilized, making the solubility assessment more complicated. Predicting solubility values in mixtures of solvents from a molecular structure can help to address this issue, although a ...
Added: May 19, 2026
Aerokinesis: An IoT-Based Vision-Driven Gesture Control System for Quadcopter Navigation Using Deep Learning and ROS2
Pikalov V., Meshcheryakov V., Kondratev S. et al., Technologies 2026 Vol. 14 No. 1 P. 1–27
This paper presents Aerokinesis, an IoT-based software–hardware system for intuitive gesture-driven control of quadcopter unmanned aerial vehicles (UAVs), developed within the Robot Operating System 2 (ROS2) framework. The proposed system addresses the challenge of providing an accessible human–drone interaction interface for operators in scenarios where traditional remote controllers are impractical or unavailable. The architecture comprises ...
Added: May 19, 2026
Aerokinesis: An IoT-Based Vision-Driven Gesture Control System for Quadcopter Navigation Using Deep Learning and ROS2
Кондратьев С., Никитин Г. Э., Дырченкова Ю. А. et al., Technologies 2026 Vol. 14 No. 1 P. 1–27
This paper presents Aerokinesis, an IoT-based software–hardware system for intuitive gesture-driven control of quadcopter unmanned aerial vehicles (UAVs), developed within the Robot Operating System 2 (ROS2) framework. The proposed system addresses the challenge of providing an accessible human–drone interaction interface for operators in scenarios where traditional remote controllers are impractical or unavailable. The architecture comprises ...
Added: May 19, 2026
On smooth Fano threefolds with coregularity zero
Жакупов О. Б., European Journal of Mathematics 2025 Vol. 11 Article 84
We provide examples of smooth three-dimensional Fano complete intersections of degree 2, 4, 6, and 8 that have absolute coregularity 0. Considering the main theorem of Avilov, Loginov, and Przyjalkowski (CNTP 18:506–577, 2024) on the remaining 101 families of smooth Fano threefolds, our result implies that each family of smooth Fano threefolds has an element of absolute ...
Added: May 18, 2026
Parallel Computational Technologies. PCT 2025
Springer, 2025.
This book constitutes the refereed proceedings of the 19th International Conference on Parallel Computational Technologies, PCT 2025, held in Moscow, Russia, during April 8–10, 2025. The 31 full papers included in this volume were carefully reviewed and selected from 122 submissions. These papers were organized under the following topical sections: High Performance Architectures, Tools and Technologies; ...
Added: May 18, 2026
KMHCR: A Key-Controlled Signal-Domain Transformation for 5G IoT Security
Ronglin Z., Wei L., Jiahong C. et al., Journal of Signal Processing Systems 2026 Vol. 98 P. 1–15
To address the need for lightweight and low-latency protection in massive resource-constrained 5G Internet of Things (IoT) systems, this paper proposes Key-Controlled Modulation Hopping and Constellation Rotation (KMHCR). KMHCR is designed as a physical-layer confidentiality-enhancement mechanism that avoids bit-wise full-payload encryption in the protection pipeline. It uses a shared key derived from channel-reciprocity secret key ...
Added: May 16, 2026
DPN Verifier: A Toolkit for Faster Soundness Verification and Repair of Process Models with Data
Suvorov N. M., Proceedings of the Institute for System Programming of the RAS 2026 Vol. 38 No. 3(2) P. 49–66
Data Petri Nets (DPNs) extend classical Petri nets to model processes where data directly influences control-flow, enabling a comprehensive view of system behavior and possibility to detect failure points that could otherwise be hidden. Soundness is a correctness criterion that captures such failure points as deadlocks and livelocks as well as model boundedness and absence ...
Added: May 16, 2026
2-Elliptic Periodic Orbits near a Nonsimple Homoclinic Tangency in Four-Dimensional Symplectic Maps
Lerman L. M., Turaev D. V., Regular and Chaotic Dynamics 2026 Vol. 31 No. 3 P. 349–369
We show that bifurcations of four-dimensional symplectic diffeomorphisms with a quadratic homoclinic tangency to a saddle periodic orbit with real multipliers produce 2-elliptic periodic orbits if the tangency is not partially hyperbolic. We show that a normal form for the rescaled first-return maps near such tangency is given by a four-dimensional symplectic H´enonlike map and study bifurcations of the ...
Added: May 15, 2026
Bibliometric Analysis by Network Models
Aleskerov F. T., Yakuba V. I., Khutorskaya O. et al., Springer, 2026.
The book contains new models of bibliometric analysis based on centrality measures in network analysis, pattern analysis and stability analysis. A distinctive feature of these centrality measures is that they account for the parameters of vertices and group influence of vertices to a vertex. This reveals specific groups of publications, authors, terms, journals and affiliations ...
Added: May 15, 2026
Neural-network maps for two-parameter modeling of bistability and codimension-two bifurcations in two-dimensional flow dynamical systems
Kuptsov P., Panyushev A., Stankevich N., Chaos 2026 Vol. 36 No. 5 Article 053138
We develop a machine-learning approach to reproduce the behavior of two versions of the van der Pol oscillator exhibiting a subcritical Andronov–Hopf bifurcation, with or without a codimension-2 Bautin point. We construct a neural-network model that functions as a recur rent map and train it on short segments of oscillator trajectories. The results show that, ...
Added: May 15, 2026
Bifurcations and Structural Stability of Generic PC-HC Families
Dorovskiy A., / Series arXiv "math". 2026.
In this paper the structural stability of generic families of vector fields of the PC-HC class on the two-dimensional sphere is proved. A classification of these families up to moderate equivalence in neighborhoods of their large bifurcation supports is presented, based on such invariants as the configuration and the characteristic set. The realization lemma is proved. ...
Added: May 14, 2026
The Sobolev space W_2^{1/2}: Simultaneous improvement of functions by a homeomorphism of the circle
Lebedev V., Journal of Mathematical Analysis and Applications 2026 Vol. 563 No. 2 Article 130787
It is known that for every continuous real-valued  function $f$ on the circle $\mathbb T=\mathbb R/2\pi\mathbb Z$ there exists a  change of variable, i.e., a self-homeomorphism $h$ of $\mathbb T$, such that  the superposition $f\circ h$ is in the Sobolev space $W_2^{1/2}(\mathbb T)$.  We obtain new results on simultaneous improvement of functions by a single  change of variable in relation ...
Added: May 14, 2026
ICC 2021-IEEE International Conference on Communications
Белоголовый А. В., IEEE, 2021.
In this paper, we present a gradient descent and Metropolis-Hastings (MH) principle based low-complexity, highly parallelizable, near-optimal MIMO demodulation framework. The optimal maximum-likelihood (ML) demodulation is a discrete optimization problem and is computationally infeasible for large MIMO dimensions. Many continuous-relaxed versions of ML, such as least-squares (LS) demodulation, have closedform solutions but are highly suboptimal to ML. However, the continuous surface ...
Added: April 7, 2026
Different types of multistability in the Chialvo map
Panyushev A., Посненкова О. М., Stankevich N., Proceedings of 2024 8th Scientific School Dynamics of Complex Networks and their Applications (DCNA), Publisher IEEE 2024 P. 181–183
We study different types of multistability in the neuron model with discrete time - the Chialvo map. Multistability of the first type corresponds to the case when resonant and non-resonant invariant curve coexist in phase space. For another parameters of the model we have found coexistence between invariant curve and chaos. We analyze mechanisms of ...
Added: December 1, 2024
Gradient descent clustering with regularization to recover communities in transformed attributed networks
Shalileh S., Social Network Analysis and Mining 2025 Vol. 15212 P. 137–148
Community detection in attributed networks aims to recover clusters in which the within-community nodes are as interconnected and as homogeneous as possible, while the between-communities nodes are as disconnected and as heterogeneous as possible. The current research proposes a straightforward data-driven model with an integrated regularization term to recover communities. For further improvement of the ...
Added: November 30, 2024
Least fractional order memristor nonlinearity to exhibits chaos in a hidden hyperchaotic system
S. Sabarathinam, Aravinthan D., Viktor Papov et al., Fractional Calculus and Applied Analysis 2024 Vol. 27 P. 2502–2520
In this article, we present least fractional nonlinearity for exhibiting chaos in a memristor-based hyper-chaotic multi-stable hidden system. When implementing memristor-based systems, distinct dimensions/order define the memristor nonlinearity. In this work, the memristor dimension has been changed fractionally to identify the lowest order of nonlinearity required to induce chaos in a proposed system. The two-parameter ...
Added: August 6, 2024
Spin Chaos of Exciton Polaritons in a Magnetic Field
S. S. Gavrilov, N. N. Ipatov, V. D. Kulakovskii, JETP Letters 2023 Vol. 118 No. 9 P. 637–643
The spin properties of exciton polaritons in a micropillar cavity placed in a static magnetic field and excited by a resonant light wave are studied theoretically. Owing to the Zeeman effect, a nonlinear polariton system has two branches of optical response that are characterized by opposite circular polarizations. An indirect mechanism of polarization reversal is ...
Added: March 15, 2024
Switching thresholds for multistable systems under strong external perturbation
Klinshov V., Некоркин В. И., Communications in Nonlinear Science and Numerical Simulation 2020 Vol. 83 P. 105067
Multistability is a common feature of various dynamical systems which manifests itself as the possibility to demonstrate various behaviors for the same parameter values. These behaviors or states of the system must be stable against weak perturbation in order to be observable in real life. However, a strong enough perturbation may destroy a certain state ...
Added: March 16, 2022
Dynamics of the Shapovalov mid-size firm model
Tatyana A. Alexeeva, Barnett W., Kuznetsov N. et al., Chaos, Solitons and Fractals 2020 Vol. 140 Article 110239
Forecasting and analyses of the dynamics of financial and economic processes such as deviations of macroeconomic aggregates (GDP, unemployment, and inflation) from their long-term trends, asset markets volatility, etc., are challenging because of the complexity of these processes. Important related research questions include, first, how to determine the qualitative properties of the dynamics of these ...
Added: October 21, 2020
Synchronous oscillations and symmetry breaking in a model of two interacting ultrasound contrast agents
Garashchuk I., Kazakov A., Sinelshchikov D., Nonlinear Dynamics 2020 Vol. 101 P. 1199–1213
We study nonlinear dynamics in a system of two coupled oscillators, describing the motion of two interacting microbubble contrast agents. In the case of identical bubbles, the corresponding symmetry of the governing system of equations leads to the possibility of existence of asymptotically stable synchronous oscillations. However, it may be difficult to create absolutely identical ...
Added: September 10, 2020
Hidden Attractors in a Model of a Bubble Contrast Agent Oscillating Near an Elastic Wall
Garashchuk I., Sinelshchikov D., Kudryashov N. A., European Journal of Physics: Web of Conference 2018 Vol. 173 P. 06006-1–06006-4
A model describing the dynamics of a spherical gas bubble in a compressible viscous liquid is studied. The bubble is oscillating close to an elastic wall of finite thickness under the influence of an external pressure field which simulates a contrast agent oscillating close to a blood vessel wall. Here we investigate numerically the coexistence ...
Added: December 16, 2019
  • About
  • About
  • Key Figures & Facts
  • Sustainability at HSE University
  • Faculties & Departments
  • International Partnerships
  • Faculty & Staff
  • HSE Buildings
  • HSE University for Persons with Disabilities
  • Public Enquiries
  • Studies
  • Admissions
  • Programme Catalogue
  • Undergraduate
  • Graduate
  • Exchange Programmes
  • Summer University
  • Summer Schools
  • Semester in Moscow
  • Business Internship
  • Research
  • International Laboratories
  • Research Centres
  • Research Projects
  • Monitoring Studies
  • Conferences & Seminars
  • Academic Jobs
  • Yasin (April) International Academic Conference on Economic and Social Development
  • Media & Resources
  • Publications by staff
  • HSE Journals
  • Publishing House
  • iq.hse.ru: commentary by HSE experts
  • Library
  • Economic & Social Data Archive
  • Video
  • HSE Repository of Socio-Economic Information
  • HSE1993–2026
  • Contacts
  • Copyright
  • Privacy Policy
  • Site Map
Edit