• A
  • A
  • A
  • АБВ
  • АБВ
  • АБВ
  • A
  • A
  • A
  • A
  • A
Обычная версия сайта
  • RU
  • EN
  • HSE University
  • Publications
  • Book chapter
  • On the Influence of Layer Importance on LLM Fine-Tuning Acceleration and Quality
  • RU
  • EN
Расширенный поиск
Высшая школа экономики
Национальный исследовательский университет
Priority areas
  • business informatics
  • economics
  • engineering science
  • humanitarian
  • IT and mathematics
  • law
  • management
  • mathematics
  • sociology
  • state and public administration
by year
  • 2027
  • 2026
  • 2025
  • 2024
  • 2023
  • 2022
  • 2021
  • 2020
  • 2019
  • 2018
  • 2017
  • 2016
  • 2015
  • 2014
  • 2013
  • 2012
  • 2011
  • 2010
  • 2009
  • 2008
  • 2007
  • 2006
  • 2005
  • 2004
  • 2003
  • 2002
  • 2001
  • 2000
  • 1999
  • 1998
  • 1997
  • 1996
  • 1995
  • 1994
  • 1993
  • 1992
  • 1991
  • 1990
  • 1989
  • 1988
  • 1987
  • 1986
  • 1985
  • 1984
  • 1983
  • 1982
  • 1981
  • 1980
  • 1979
  • 1978
  • 1977
  • 1976
  • 1975
  • 1974
  • 1973
  • 1972
  • 1971
  • 1970
  • 1969
  • 1968
  • 1967
  • 1966
  • 1965
  • 1964
  • 1963
  • 1958
  • More
Subject
News
July 2, 2026
Researchers Discover How Spelling Errors Slow Down Reading in Russian
Psycholinguists from the Centre for Language and Brain at HSE University–St Petersburg have shown that words that are frequently misspelled are processed more slowly by readers, even when presented with the correct spelling. The researchers confirmed this effect for the first time using Russian-language materials and found that response speed is most strongly linked to how confidently individuals can distinguish the correct spelling of a word from an incorrect one. The study has been published in The Mental Lexicon.
July 2, 2026
HSE Develops App for Assessing Phonological Processing in Children
Researchers at the HSE Centre for Language and Brain have developed a new digital tool for assessing children's phonological processing skills—the ZARYA (Sound Analysis of the Russian Language) test battery. It is the first standardised application in Russia designed to provide a fast and reliable assessment of children's ability to distinguish speech sounds, retain them in working memory, and perform phonemic analysis. The app runs on Android tablets and smartphones and is available for download from RuStore. Details of the test validation have been published in the Journal of Speech, Language, and Hearing Research.
July 1, 2026
Scientists Discover Why Europium 'Misbehaves'
Europium is a rare-earth metal responsible for the pure red glow in displays and other luminescent materials. For a long time, however, it refused to emit light when surrounded by certain organic molecules known as acylpyrazolone ligands. Chemists have now uncovered the reason: in europium complexes with these ligands, a 'black window' appears—a charge-transfer state in which the energy absorbed by the ligand is dissipated as heat rather than emitted as light. Understanding this mechanism opens the way to designing more efficient red-emitting materials for displays, fluorescent thermometers, and chemical sensors. The results have been published in Dalton Transactions.

 

Have you spotted a typo?
Highlight it, click Ctrl+Enter and send us a message. Thank you for your help!

Publications
  • Books
  • Articles
  • Chapters of books
  • Working papers
  • Report a publication
  • Research at HSE

?

On the Influence of Layer Importance on LLM Fine-Tuning Acceleration and Quality

P. 4233–4240.
Demidovskij A., Irina Novikova, Artyom Tugaryov, Vasilisa Blyudova, Olga Frolova, Ignatiev Y., Igor Salnikov, Aleksei Trutnev, Egor Zharikov

Large Language Models (LLMs) have become central advancements in artificial intelligence, particularly in machine learning, natural language processing, and computer vision. Their ability to understand and generate human-like text has made them crucial in applications ranging from automated translation to text generation. Despite the vast capabilities of pre-trained LLMs, their deployment in specialized domains often requires fine-tuning, an adaptation process constrained by high resource demands and extensive computational time. At present, the most prominent approach for fine-tuning acceleration is LoRA, which involves inserting into the model trainable low-rank adapters while freezing the rest of the parameters. However, this state-of-the-art approach is significantly limited by the requirement to manually attach adapters to each Transformer block, leading to computational overhead. Developing novel fine-tuning strategies that overcome this limitation presents significant opportunities for reducing fine-tuning time without degrading the quality. This paper addresses this challenge by introducing an innovative fine-tuning solution that dynamically assigns LoRA adapters to Transformer blocks of the model based on the importance and convergence status, significantly enhancing the efficiency of the process. Our method improves upon existing techniques, providing a considerable fine-tuning acceleration on average of 55% without quality drop compared to LoRA.

Language: English
DOI
Text on another site
Keywords: ARTIFICIAL NEURAL NETWORKSfine-tuning accelerationweight importance

In book

Frontiers in Artificial Intelligence and Applications: 28th European Conference on Artificial Intelligence, 25-30 October 2025, Bologna, Italy
Vol. 413. , IOS Press Ebooks, 2025.
Similar publications
Hebb-Inspired Low Rank Adapters for Large Language Models Fine-Tuning
Alexander Demidovskij, Artyom Tugaryov, Igor Salnikov et al., , in: PRICAI 2025: Trends in Artificial Intelligence: 22nd Pacific Rim International Conference on Artificial Intelligence, PRICAI 2025, Wellington, New Zealand, November 17–21, 2025, Proceedings, Part IIIVol. 16453.: Springer, 2026. P. 603–612.
The backpropagation method is the predominant method for pre-training and fine-tuning of Large Language models. At the same time, it is considerably demanding in terms of memory and hardware. Therefore, it makes fine-tuning and pre-training very expensive, harmful for the environment due to the large carbon footprint, and raises the blocks for the development of ...
Added: April 21, 2026
Performance Study of Modern Zeroth-Order Optimization Methods for LLM Fine-Tuning
A. V. Demidovskij, A. I. Trutnev, Optical Memory and Neural Networks (Information Optics) 2025 Vol. 34 No. Suppl. 1 P. S16–S29
Large Language Models (LLMs) are widely employed across a broad range of applications due to their versatility and state-of-the-art performance. However, as usage scenarios grow, there is a pressing demand for task-specific adaptation of LLMs through fine-tuning. While full fine-tuning (FT) remains the most preferred in terms of quality, its high memory and computation requirements ...
Added: December 22, 2025
Going Beyond LoRA Fine-Tuning with Hebb Learning: Blazingly Fast and Accurate
Demidovskij A., Igor Salnikov, Olga Frolova et al., , in: Frontiers in Artificial Intelligence and Applications: 28th European Conference on Artificial Intelligence, 25-30 October 2025, Bologna, ItalyVol. 413.: IOS Press Ebooks, 2025. P. 2426–2433.
Modern Multimodal Large Language Models have increased demands on computational resources required for both pretraining and fine-tuning procedures. This challenge is primarily attributed to the backpropagation step because the computation of gradients is time-consuming and memory-intensive. This paper aims to alleviate the presented issues, and introduces novel fine-tuning strategy. Low-Rank Adaptation with Hebb Rapid Optimization ...
Added: October 23, 2025
Comprehensive Weight Decomposition Analysis of Modern Parameter-Efficient Methods
A.V. Demidovskij, I.G. Salnikov, A.M. Tugaryov et al., Optical Memory and Neural Networks (Information Optics) 2024 Vol. 33 No. 3 P. S513–S522
Large Language Models fine-tuning is an essential part of modern artificial intelligent systems that solve numerous tasks, such as natural language processing and computer vision. Among the various fine-tuning strategies, the most prominent approach for Large Language Model fine-tuning is Parameter-Efficient Fine-Tuning (PEFT), as it allows to achieve state-of-the-art performance on multiple tasks while minimizing ...
Added: March 12, 2025
ALOE: Boosting Large Language Model Fine-Tuning with Aggressive Loss-Based Elimination of Samples
Demidovskij A., Трутнев А. И., Тугарев А. М. et al., , in: Frontiers in Artificial Intelligence and Applications: 27th European Conference on Artificial Intelligence, 19–24 October 2024, Santiago de Compostela, SpainVol. 392.: IOS Press Ebooks, 2024. P. 3980–3986.
As modern neural network training and fine-tuning requires a lot of computational resources, there is a huge demand for novel, specialized algorithms for efficient and cost-effective training procedures. Aggressive Loss-based Elimination of Samples (ALOE) is an innovative method that operates with training samples based on losses obtained from a currently trained model or a pre-trained ...
Added: November 5, 2024
Efficient Monocular Depth Estimation for Edge Computing Platforms
Saleh S., Saleh H., Dmitry Goncharov et al., , in: 2023 International Symposium ELMAR, 11-13 September 2023, Zadar, Croatia.: IEEE, 2023. P. 23–27.
Estimating depth is necessary to understand and navigate the environment surrounding us. Over the years, many active sensors have been developed to measure depth, but they are expensive and require additional space for mounting. A cheaper alternative is estimating depth from a single RGB image taken by an ordinary monocular camera, which can be placed ...
Added: January 26, 2024
2023 International Symposium ELMAR, 11-13 September 2023, Zadar, Croatia
Saleh H., IEEE, 2023.
Estimating depth is necessary to understand and navigate the environment surrounding us. Over the years, many active sensors have been developed to measure depth, but they are expensive and require additional space for mounting. A cheaper alternative is estimating depth from a single RGB image taken by an ordinary monocular camera, which can be placed ...
Added: November 30, 2023
Contagion or interdependence? Comparing spillover indices
Islam R., Volkov V., Empirical Economics 2022 P. 1403–1455
We propose a novel risk measure that is built on comparing high-frequency time-varying volatility and low-frequency return spillover estimates. This measure permits to identify the markets that are epidemic in their complex interdependence. We conjecture that initially a highly volatile market experiences episodes of risk transmission, but only later absorbs risk and becomes an epidemic ...
Added: November 10, 2023
Using an Artificial Neural Network to Predict Coronary Microvascular Obstruction (No-Reflow Phenomenon) during Percutaneous Coronary Interventions in Patients with Myocardial Infarction
Frolov A. A., Pochinka I. G., Shakhov B. Е. et al., Sovremennye Tehnologii v Medicine 2021 Vol. 13 No. 6 P. 6–13
The aim of the study was to develop, evaluate, and validate an artificial neural network to predict coronary microvascular obstruction (CMVO) during percutaneous coronary interventions (PCI) in patients with myocardial infarctions (MI) based on the parameters, which are routinely available in an operating room when choosing a surgical approach. Materials and Methods 5621 patients with MI and emergency ...
Added: June 21, 2023
2022 International Joint Conference on Neural Networks (IJCNN)
Institute of Electrical and Electronics Engineers Inc., 2022.
Added: May 29, 2023
ПОИСК ПУТЕЙ СОВЕРШЕНСТВОВАНИЯ ЦИФРОВОГО ПРЕДСТАВЛЕНИЯ ТЕКСТИЛЬНЫХ МАТЕРИАЛОВ С ЦЕЛЬЮ ОБНАРУЖЕНИЯ ДЕФЕКТОВ
Карева Т. Ю., Мирошниченко Д. А., Голубеева Г. И. et al., Известия высших учебных заведений. Технология текстильной промышленности 2022 № 2 (398) С. 104–108
The article discusses the possibility of using neural network technologies for the automated search for defects in textile materials. A developed laboratory stand is presented, where samples are photographed to form a training sample. A number of studies have been conducted on the influence of various types of illumination of the material when obtaining an ...
Added: November 1, 2022
IOP CONFERENCE SERIES: MATERIALS SCIENCE AND ENGINEERING. 1st International Conference on Innovative Informational and Engineering Technologies (IIET-2020) 28-29 May 2020, Stavropol, Russian Federation.
Сахнюк П. А., Bristol: IOP Publishing, 2020.
The article suggests the integration of a neural network as a parallel element base in a telecommunication system. In this case, the ability to learn or adapt to external conditions is applied as the main advantage. For telecommunication systems in conditions when it is possible, this ability will improve noise immunity, reliability, operability, etc. The ...
Added: December 8, 2020
Presumptions of Semantic Representations Evolution
Kharlamov A. A., Открытые семантические технологии проектирования интеллектуальных систем, Белоруссия 2020 No. 4 P. 141–148
To date, traditional information technology and artificial intelligence technology have evolved independently of each other. Now is the time to fundamentally rethink the experience of using and evolution of traditional information technology and its integration with artificial intelligence technology. Currently, the key problem in the development of information technology in general and artificial intelligence technology ...
Added: October 29, 2020
  • About
  • About
  • Key Figures & Facts
  • Sustainability at HSE University
  • Faculties & Departments
  • International Partnerships
  • Faculty & Staff
  • HSE Buildings
  • HSE University for Persons with Disabilities
  • Public Enquiries
  • Studies
  • Admissions
  • Programme Catalogue
  • Undergraduate
  • Graduate
  • Exchange Programmes
  • Summer University
  • Summer Schools
  • Semester in Moscow
  • Business Internship
  • Research
  • International Laboratories
  • Research Centres
  • Research Projects
  • Monitoring Studies
  • Conferences & Seminars
  • Academic Jobs
  • Yasin (April) International Academic Conference on Economic and Social Development
  • Media & Resources
  • Publications by staff
  • HSE Journals
  • Publishing House
  • iq.hse.ru: commentary by HSE experts
  • Library
  • Economic & Social Data Archive
  • Video
  • HSE Repository of Socio-Economic Information
  • HSE1993–2026
  • Contacts
  • Copyright
  • Privacy Policy
  • Site Map
Edit