Trade-off between memory usage and optimization speed in a simple machine learning task

?

Trade-off between memory usage and optimization speed in a simple machine learning task

Proceedings of 2025 9th Scientific School Dynamics of Complex Networks and their Applications (DCNA), Publisher IEEE, Electronic ISSN: 2770-744X Print on Demand(PoD) ISSN: 2770-7431. 2025. P. 42–44.

Горюнов О. А., Соловьев И. А., Klinshov V.

—In modern machine learning, the efficiency of train ing is heavily influenced by the choice of optimization algorithm, with preconditioned methods offering significant speed improve ments at the cost of increased memory usage. This study explores the trade-off between memory consumption and optimization performance by generalizing the Shampoo preconditioning al gorithm to support arbitrarily sized preconditioning matrices, rather than being limited by tensor dimensions. We evaluate this approach on a perceptron trained for a simple regression task, measuring how different preconditioner sizes affect the training speed. Our results reveal a strong positive correlation: larger preconditioners consistently lead to faster learning, although the performance gains scale sublinearly with memory. Our findings provide practical insights into balancing computational resources and training efficiency in adaptive optimization.

Research target: Computer Science Mathematics

Publication based on the results of:

Машинное обучение и нелинейная динамика: пересечение, взаимодействие и синтез (2027)

Upper bounds for Steklov eigenvalues of a hypersurface of revolution

Denis Seliutskii, Russian Journal of Mathematical Physics 2025 Vol. 32 No. 2 P. 399–407

In this paper, we find an upper bound for the first Steklov eigenvalue for a surface of revolution with boundary consisting of two spheres of different radii. Moreover, we prove that, in some cases, this boundary is sharp. ...

Added: May 19, 2026

ML-based Fast Simulation of FARICH Responses

Shipilov F., Barnyakov A., Ivanov A. et al., / Series Physics "arxiv.org". 2026.

A fast simulation of the detector response is a vital task in high-energy physics (HEP). Traditional Monte-Carlo methods form the backbone of modern particle physics simulation software but are computationally expensive. We present a machine-learning-based approach to fast simulation of the Focusing Aerogel Ring Imaging Cherenkov (FARICH) detector response. Given a particle track and momentum, ...

Added: May 19, 2026

Proceedings of the 19th Conference of the European Chapter of the Association for Computational Linguistics (Volume 3: System Demonstrations)

Association for Computational Linguistics, 2026.

Added: May 19, 2026

Dataset of solubility values for organic compounds in binary mixtures of solvents at various temperatures

Bezzubov S., Malikov D., Krasnov L. et al., Scientific data 2026 Vol. 13 Article 727

Solubility is a crucial property of organic compounds, impacting their potential applications in synthetic chemistry, materials science and drug design. Moreover, in technological processes mixtures of solvents are often utilized, making the solubility assessment more complicated. Predicting solubility values in mixtures of solvents from a molecular structure can help to address this issue, although a ...

Added: May 19, 2026

Aerokinesis: An IoT-Based Vision-Driven Gesture Control System for Quadcopter Navigation Using Deep Learning and ROS2

Pikalov V., Meshcheryakov V., Kondratev S. et al., Technologies 2026 Vol. 14 No. 1 P. 1–27

This paper presents Aerokinesis, an IoT-based software–hardware system for intuitive gesture-driven control of quadcopter unmanned aerial vehicles (UAVs), developed within the Robot Operating System 2 (ROS2) framework. The proposed system addresses the challenge of providing an accessible human–drone interaction interface for operators in scenarios where traditional remote controllers are impractical or unavailable. The architecture comprises ...

Added: May 19, 2026

Aerokinesis: An IoT-Based Vision-Driven Gesture Control System for Quadcopter Navigation Using Deep Learning and ROS2

Кондратьев С., Никитин Г. Э., Дырченкова Ю. А. et al., Technologies 2026 Vol. 14 No. 1 P. 1–27

Added: May 19, 2026

On smooth Fano threefolds with coregularity zero

Жакупов О. Б., European Journal of Mathematics 2025 Vol. 11 Article 84

We provide examples of smooth three-dimensional Fano complete intersections of degree 2, 4, 6, and 8 that have absolute coregularity 0. Considering the main theorem of Avilov, Loginov, and Przyjalkowski (CNTP 18:506–577, 2024) on the remaining 101 families of smooth Fano threefolds, our result implies that each family of smooth Fano threefolds has an element of absolute ...

Added: May 18, 2026

Parallel Computational Technologies. PCT 2025

Springer, 2025.

This book constitutes the refereed proceedings of the 19th International Conference on Parallel Computational Technologies, PCT 2025, held in Moscow, Russia, during April 8–10, 2025. The 31 full papers included in this volume were carefully reviewed and selected from 122 submissions. These papers were organized under the following topical sections: High Performance Architectures, Tools and Technologies; ...

Added: May 18, 2026

KMHCR: A Key-Controlled Signal-Domain Transformation for 5G IoT Security

Ronglin Z., Wei L., Jiahong C. et al., Journal of Signal Processing Systems 2026 Vol. 98 P. 1–15

To address the need for lightweight and low-latency protection in massive resource-constrained 5G Internet of Things (IoT) systems, this paper proposes Key-Controlled Modulation Hopping and Constellation Rotation (KMHCR). KMHCR is designed as a physical-layer confidentiality-enhancement mechanism that avoids bit-wise full-payload encryption in the protection pipeline. It uses a shared key derived from channel-reciprocity secret key ...

Added: May 16, 2026

DPN Verifier: A Toolkit for Faster Soundness Verification and Repair of Process Models with Data

Suvorov N. M., Proceedings of the Institute for System Programming of the RAS 2026 Vol. 38 No. 3(2) P. 49–66

Data Petri Nets (DPNs) extend classical Petri nets to model processes where data directly influences control-flow, enabling a comprehensive view of system behavior and possibility to detect failure points that could otherwise be hidden. Soundness is a correctness criterion that captures such failure points as deadlocks and livelocks as well as model boundedness and absence ...

Added: May 16, 2026

2-Elliptic Periodic Orbits near a Nonsimple Homoclinic Tangency in Four-Dimensional Symplectic Maps

Lerman L. M., Turaev D. V., Regular and Chaotic Dynamics 2026 Vol. 31 No. 3 P. 349–369

We show that bifurcations of four-dimensional symplectic diffeomorphisms with a quadratic homoclinic tangency to a saddle periodic orbit with real multipliers produce 2-elliptic periodic orbits if the tangency is not partially hyperbolic. We show that a normal form for the rescaled first-return maps near such tangency is given by a four-dimensional symplectic H´enonlike map and study bifurcations of the ...

Added: May 15, 2026

Bibliometric Analysis by Network Models

Aleskerov F. T., Yakuba V. I., Khutorskaya O. et al., Springer, 2026.

The book contains new models of bibliometric analysis based on centrality measures in network analysis, pattern analysis and stability analysis. A distinctive feature of these centrality measures is that they account for the parameters of vertices and group influence of vertices to a vertex. This reveals specific groups of publications, authors, terms, journals and affiliations ...

Added: May 15, 2026

Neural-network maps for two-parameter modeling of bistability and codimension-two bifurcations in two-dimensional flow dynamical systems

Kuptsov P., Panyushev A., Stankevich N., Chaos 2026 Vol. 36 No. 5 Article 053138

We develop a machine-learning approach to reproduce the behavior of two versions of the van der Pol oscillator exhibiting a subcritical Andronov–Hopf bifurcation, with or without a codimension-2 Bautin point. We construct a neural-network model that functions as a recur rent map and train it on short segments of oscillator trajectories. The results show that, ...

Added: May 15, 2026

Bifurcations and Structural Stability of Generic PC-HC Families

Dorovskiy A., / Series arXiv "math". 2026.

In this paper the structural stability of generic families of vector fields of the PC-HC class on the two-dimensional sphere is proved. A classification of these families up to moderate equivalence in neighborhoods of their large bifurcation supports is presented, based on such invariants as the configuration and the characteristic set. The realization lemma is proved. ...

Added: May 14, 2026

The Sobolev space W_2^{1/2}: Simultaneous improvement of functions by a homeomorphism of the circle

Lebedev V., Journal of Mathematical Analysis and Applications 2026 Vol. 563 No. 2 Article 130787

It is known that for every continuous real-valued function $f$ on the circle $\mathbb T=\mathbb R/2\pi\mathbb Z$ there exists a change of variable, i.e., a self-homeomorphism $h$ of $\mathbb T$, such that the superposition $f\circ h$ is in the Sobolev space $W_2^{1/2}(\mathbb T)$. We obtain new results on simultaneous improvement of functions by a single change of variable in relation ...

Added: May 14, 2026

ICC 2021-IEEE International Conference on Communications

Белоголовый А. В., IEEE, 2021.

In this paper, we present a gradient descent and Metropolis-Hastings (MH) principle based low-complexity, highly parallelizable, near-optimal MIMO demodulation framework. The optimal maximum-likelihood (ML) demodulation is a discrete optimization problem and is computationally infeasible for large MIMO dimensions. Many continuous-relaxed versions of ML, such as least-squares (LS) demodulation, have closedform solutions but are highly suboptimal to ML. However, the continuous surface ...

Added: April 7, 2026

Different types of multistability in the Chialvo map

Panyushev A., Посненкова О. М., Stankevich N., Proceedings of 2024 8th Scientific School Dynamics of Complex Networks and their Applications (DCNA), Publisher IEEE 2024 P. 181–183

We study different types of multistability in the neuron model with discrete time - the Chialvo map. Multistability of the first type corresponds to the case when resonant and non-resonant invariant curve coexist in phase space. For another parameters of the model we have found coexistence between invariant curve and chaos. We analyze mechanisms of ...

Added: December 1, 2024

Gradient descent clustering with regularization to recover communities in transformed attributed networks

Shalileh S., Social Network Analysis and Mining 2025 Vol. 15212 P. 137–148

Community detection in attributed networks aims to recover clusters in which the within-community nodes are as interconnected and as homogeneous as possible, while the between-communities nodes are as disconnected and as heterogeneous as possible. The current research proposes a straightforward data-driven model with an integrated regularization term to recover communities. For further improvement of the ...

Added: November 30, 2024

Least fractional order memristor nonlinearity to exhibits chaos in a hidden hyperchaotic system

S. Sabarathinam, Aravinthan D., Viktor Papov et al., Fractional Calculus and Applied Analysis 2024 Vol. 27 P. 2502–2520

In this article, we present least fractional nonlinearity for exhibiting chaos in a memristor-based hyper-chaotic multi-stable hidden system. When implementing memristor-based systems, distinct dimensions/order define the memristor nonlinearity. In this work, the memristor dimension has been changed fractionally to identify the lowest order of nonlinearity required to induce chaos in a proposed system. The two-parameter ...

Added: August 6, 2024

Spin Chaos of Exciton Polaritons in a Magnetic Field

S. S. Gavrilov, N. N. Ipatov, V. D. Kulakovskii, JETP Letters 2023 Vol. 118 No. 9 P. 637–643

The spin properties of exciton polaritons in a micropillar cavity placed in a static magnetic field and excited by a resonant light wave are studied theoretically. Owing to the Zeeman effect, a nonlinear polariton system has two branches of optical response that are characterized by opposite circular polarizations. An indirect mechanism of polarization reversal is ...

Added: March 15, 2024

Switching thresholds for multistable systems under strong external perturbation

Klinshov V., Некоркин В. И., Communications in Nonlinear Science and Numerical Simulation 2020 Vol. 83 P. 105067

Multistability is a common feature of various dynamical systems which manifests itself as the possibility to demonstrate various behaviors for the same parameter values. These behaviors or states of the system must be stable against weak perturbation in order to be observable in real life. However, a strong enough perturbation may destroy a certain state ...

Added: March 16, 2022

Dynamics of the Shapovalov mid-size firm model

Tatyana A. Alexeeva, Barnett W., Kuznetsov N. et al., Chaos, Solitons and Fractals 2020 Vol. 140 Article 110239

Forecasting and analyses of the dynamics of financial and economic processes such as deviations of macroeconomic aggregates (GDP, unemployment, and inflation) from their long-term trends, asset markets volatility, etc., are challenging because of the complexity of these processes. Important related research questions include, first, how to determine the qualitative properties of the dynamics of these ...

Added: October 21, 2020

Synchronous oscillations and symmetry breaking in a model of two interacting ultrasound contrast agents

Garashchuk I., Kazakov A., Sinelshchikov D., Nonlinear Dynamics 2020 Vol. 101 P. 1199–1213

We study nonlinear dynamics in a system of two coupled oscillators, describing the motion of two interacting microbubble contrast agents. In the case of identical bubbles, the corresponding symmetry of the governing system of equations leads to the possibility of existence of asymptotically stable synchronous oscillations. However, it may be difficult to create absolutely identical ...

Added: September 10, 2020

Hidden Attractors in a Model of a Bubble Contrast Agent Oscillating Near an Elastic Wall

Garashchuk I., Sinelshchikov D., Kudryashov N. A., European Journal of Physics: Web of Conference 2018 Vol. 173 P. 06006-1–06006-4

A model describing the dynamics of a spherical gas bubble in a compressible viscous liquid is studied. The bubble is oscillating close to an elastic wall of finite thickness under the influence of an external pressure field which simulates a contrast agent oscillating close to a blood vessel wall. Here we investigate numerically the coexistence ...

Added: December 16, 2019