On the Influence of Layer Importance on LLM Fine-Tuning Acceleration and Quality

A. Demidovskij; Irina Novikova; Artyom Tugaryov; Vasilisa Blyudova; Olga Frolova; Ignatiev Y.; Igor Salnikov; Aleksei Trutnev; Egor Zharikov

doi:10.3233/FAIA251317

?

On the Influence of Layer Importance on LLM Fine-Tuning Acceleration and Quality

P. 4233–4240.

Demidovskij A., Irina Novikova, Artyom Tugaryov, Vasilisa Blyudova, Olga Frolova, Ignatiev Y., Igor Salnikov, Aleksei Trutnev, Egor Zharikov

Large Language Models (LLMs) have become central advancements in artificial intelligence, particularly in machine learning, natural language processing, and computer vision. Their ability to understand and generate human-like text has made them crucial in applications ranging from automated translation to text generation. Despite the vast capabilities of pre-trained LLMs, their deployment in specialized domains often requires fine-tuning, an adaptation process constrained by high resource demands and extensive computational time. At present, the most prominent approach for fine-tuning acceleration is LoRA, which involves inserting into the model trainable low-rank adapters while freezing the rest of the parameters. However, this state-of-the-art approach is significantly limited by the requirement to manually attach adapters to each Transformer block, leading to computational overhead. Developing novel fine-tuning strategies that overcome this limitation presents significant opportunities for reducing fine-tuning time without degrading the quality. This paper addresses this challenge by introducing an innovative fine-tuning solution that dynamically assigns LoRA adapters to Transformer blocks of the model based on the importance and convergence status, significantly enhancing the efficiency of the process. Our method improves upon existing techniques, providing a considerable fine-tuning acceleration on average of 55% without quality drop compared to LoRA.

Language: English

DOI

Text on another site

Keywords: ARTIFICIAL NEURAL NETWORKS fine-tuning acceleration weight importance

In book

Frontiers in Artificial Intelligence and Applications: 28th European Conference on Artificial Intelligence, 25-30 October 2025, Bologna, Italy

Vol. 413. , IOS Press Ebooks, 2025.

Hebb-Inspired Low Rank Adapters for Large Language Models Fine-Tuning

Alexander Demidovskij, Artyom Tugaryov, Igor Salnikov et al., , in: PRICAI 2025: Trends in Artificial Intelligence: 22nd Pacific Rim International Conference on Artificial Intelligence, PRICAI 2025, Wellington, New Zealand, November 17–21, 2025, Proceedings, Part IIIVol. 16453.: Springer, 2026. P. 603–612.

The backpropagation method is the predominant method for pre-training and fine-tuning of Large Language models. At the same time, it is considerably demanding in terms of memory and hardware. Therefore, it makes fine-tuning and pre-training very expensive, harmful for the environment due to the large carbon footprint, and raises the blocks for the development of ...

Added: April 21, 2026

Performance Study of Modern Zeroth-Order Optimization Methods for LLM Fine-Tuning

A. V. Demidovskij, A. I. Trutnev, Optical Memory and Neural Networks (Information Optics) 2025 Vol. 34 No. Suppl. 1 P. S16–S29

Large Language Models (LLMs) are widely employed across a broad range of applications due to their versatility and state-of-the-art performance. However, as usage scenarios grow, there is a pressing demand for task-specific adaptation of LLMs through fine-tuning. While full fine-tuning (FT) remains the most preferred in terms of quality, its high memory and computation requirements ...

Added: December 22, 2025

Going Beyond LoRA Fine-Tuning with Hebb Learning: Blazingly Fast and Accurate

Demidovskij A., Igor Salnikov, Olga Frolova et al., , in: Frontiers in Artificial Intelligence and Applications: 28th European Conference on Artificial Intelligence, 25-30 October 2025, Bologna, ItalyVol. 413.: IOS Press Ebooks, 2025. P. 2426–2433.

Modern Multimodal Large Language Models have increased demands on computational resources required for both pretraining and fine-tuning procedures. This challenge is primarily attributed to the backpropagation step because the computation of gradients is time-consuming and memory-intensive. This paper aims to alleviate the presented issues, and introduces novel fine-tuning strategy. Low-Rank Adaptation with Hebb Rapid Optimization ...

Added: October 23, 2025

Comprehensive Weight Decomposition Analysis of Modern Parameter-Efficient Methods

A.V. Demidovskij, I.G. Salnikov, A.M. Tugaryov et al., Optical Memory and Neural Networks (Information Optics) 2024 Vol. 33 No. 3 P. S513–S522

Large Language Models fine-tuning is an essential part of modern artificial intelligent systems that solve numerous tasks, such as natural language processing and computer vision. Among the various fine-tuning strategies, the most prominent approach for Large Language Model fine-tuning is Parameter-Efficient Fine-Tuning (PEFT), as it allows to achieve state-of-the-art performance on multiple tasks while minimizing ...

Added: March 12, 2025

ALOE: Boosting Large Language Model Fine-Tuning with Aggressive Loss-Based Elimination of Samples

Demidovskij A., Трутнев А. И., Тугарев А. М. et al., , in: Frontiers in Artificial Intelligence and Applications: 27th European Conference on Artificial Intelligence, 19–24 October 2024, Santiago de Compostela, SpainVol. 392.: IOS Press Ebooks, 2024. P. 3980–3986.

As modern neural network training and fine-tuning requires a lot of computational resources, there is a huge demand for novel, specialized algorithms for efficient and cost-effective training procedures. Aggressive Loss-based Elimination of Samples (ALOE) is an innovative method that operates with training samples based on losses obtained from a currently trained model or a pre-trained ...

Added: November 5, 2024

Efficient Monocular Depth Estimation for Edge Computing Platforms

Saleh S., Saleh H., Dmitry Goncharov et al., , in: 2023 International Symposium ELMAR, 11-13 September 2023, Zadar, Croatia.: IEEE, 2023. P. 23–27.

Estimating depth is necessary to understand and navigate the environment surrounding us. Over the years, many active sensors have been developed to measure depth, but they are expensive and require additional space for mounting. A cheaper alternative is estimating depth from a single RGB image taken by an ordinary monocular camera, which can be placed ...

Added: January 26, 2024

2023 International Symposium ELMAR, 11-13 September 2023, Zadar, Croatia

Saleh H., IEEE, 2023.

Added: November 30, 2023

Contagion or interdependence? Comparing spillover indices

Islam R., Volkov V., Empirical Economics 2022 P. 1403–1455

We propose a novel risk measure that is built on comparing high-frequency time-varying volatility and low-frequency return spillover estimates. This measure permits to identify the markets that are epidemic in their complex interdependence. We conjecture that initially a highly volatile market experiences episodes of risk transmission, but only later absorbs risk and becomes an epidemic ...

Added: November 10, 2023

Using an Artificial Neural Network to Predict Coronary Microvascular Obstruction (No-Reflow Phenomenon) during Percutaneous Coronary Interventions in Patients with Myocardial Infarction

Frolov A. A., Pochinka I. G., Shakhov B. Е. et al., Sovremennye Tehnologii v Medicine 2021 Vol. 13 No. 6 P. 6–13

The aim of the study was to develop, evaluate, and validate an artificial neural network to predict coronary microvascular obstruction (CMVO) during percutaneous coronary interventions (PCI) in patients with myocardial infarctions (MI) based on the parameters, which are routinely available in an operating room when choosing a surgical approach. Materials and Methods 5621 patients with MI and emergency ...

Added: June 21, 2023

2022 International Joint Conference on Neural Networks (IJCNN)

Institute of Electrical and Electronics Engineers Inc., 2022.

Added: May 29, 2023

ПОИСК ПУТЕЙ СОВЕРШЕНСТВОВАНИЯ ЦИФРОВОГО ПРЕДСТАВЛЕНИЯ ТЕКСТИЛЬНЫХ МАТЕРИАЛОВ С ЦЕЛЬЮ ОБНАРУЖЕНИЯ ДЕФЕКТОВ

Карева Т. Ю., Мирошниченко Д. А., Голубеева Г. И. et al., Известия высших учебных заведений. Технология текстильной промышленности 2022 № 2 (398) С. 104–108

The article discusses the possibility of using neural network technologies for the automated search for defects in textile materials. A developed laboratory stand is presented, where samples are photographed to form a training sample. A number of studies have been conducted on the influence of various types of illumination of the material when obtaining an ...

Added: November 1, 2022

IOP CONFERENCE SERIES: MATERIALS SCIENCE AND ENGINEERING. 1st International Conference on Innovative Informational and Engineering Technologies (IIET-2020) 28-29 May 2020, Stavropol, Russian Federation.

Сахнюк П. А., Bristol: IOP Publishing, 2020.

The article suggests the integration of a neural network as a parallel element base in a telecommunication system. In this case, the ability to learn or adapt to external conditions is applied as the main advantage. For telecommunication systems in conditions when it is possible, this ability will improve noise immunity, reliability, operability, etc. The ...

Added: December 8, 2020

Presumptions of Semantic Representations Evolution

Kharlamov A. A., Открытые семантические технологии проектирования интеллектуальных систем, Белоруссия 2020 No. 4 P. 141–148

To date, traditional information technology and artiﬁcial intelligence technology have evolved independently of each other. Now is the time to fundamentally rethink the experience of using and evolution of traditional information technology and its integration with artiﬁcial intelligence technology. Currently, the key problem in the development of information technology in general and artiﬁcial intelligence technology ...

Added: October 29, 2020