• A
  • A
  • A
  • АБВ
  • АБВ
  • АБВ
  • A
  • A
  • A
  • A
  • A
Обычная версия сайта
  • RU
  • EN
  • HSE University
  • Publications
  • Articles
  • Применение методов машинного обучения для классификации контента коррупционной тематики в русскоязычных и англоязычных Интернет-СМИ
  • RU
  • EN
Расширенный поиск
Высшая школа экономики
Национальный исследовательский университет
Priority areas
  • business informatics
  • economics
  • engineering science
  • humanitarian
  • IT and mathematics
  • law
  • management
  • mathematics
  • sociology
  • state and public administration
by year
  • 2027
  • 2026
  • 2025
  • 2024
  • 2023
  • 2022
  • 2021
  • 2020
  • 2019
  • 2018
  • 2017
  • 2016
  • 2015
  • 2014
  • 2013
  • 2012
  • 2011
  • 2010
  • 2009
  • 2008
  • 2007
  • 2006
  • 2005
  • 2004
  • 2003
  • 2002
  • 2001
  • 2000
  • 1999
  • 1998
  • 1997
  • 1996
  • 1995
  • 1994
  • 1993
  • 1992
  • 1991
  • 1990
  • 1989
  • 1988
  • 1987
  • 1986
  • 1985
  • 1984
  • 1983
  • 1982
  • 1981
  • 1980
  • 1979
  • 1978
  • 1977
  • 1976
  • 1975
  • 1974
  • 1973
  • 1972
  • 1971
  • 1970
  • 1969
  • 1968
  • 1967
  • 1966
  • 1965
  • 1964
  • 1963
  • 1958
  • More
Subject
News
June 5, 2026
Neural Network Maps as a Method for Constructing Mathematical Models
Scientists from HSE University–Nizhny Novgorod and the Institute of Physics Belgrade, Serbia, are jointly exploring the application of machine learning techniques and neural networks to the study of nonlinear dynamics. Natalya Stankevich, Leading Research Fellow at the Laboratory of Topological Methods in Dynamics of the Faculty of Informatics, Mathematics, and Computer Science at HSE University–Nizhny Novgorod, spoke to the HSE News Service about this international project.
June 5, 2026
‘In the Age of Technology, It Is Interesting to Look into the Past and Think about What We Can Take from It
Polina Tabakova decided to apply for a Philology degree at HSE in Nizhny Novgorod because she grew up in Mari El and did not want to move far away from the Russian forests. In an interview for the Young Scientists of HSE University project, she spoke about the genre of the campus novel, the existential drama of Kolobok, and a blackout version of Eugene Onegin.
June 5, 2026
HSE Scientists Develop Method to Compress Large Language Models Without Losing Quality
Researchers from the AI and Digital Science Institute at the HSE Faculty of Computer Science have developed a new compression method for large language models such as GPT and LLaMA that reduces their size by 25–36% without additional training or significant loss of accuracy. This is the first approach to use mathematical transformations—specifically, rotations of model weights—to make models more amenable to compression with structured matrices. The study results have been published in ACL Findings 2025. The code is available on GitHub.

 

Have you spotted a typo?
Highlight it, click Ctrl+Enter and send us a message. Thank you for your help!

Publications
  • Books
  • Articles
  • Chapters of books
  • Working papers
  • Report a publication
  • Research at HSE

?

Применение методов машинного обучения для классификации контента коррупционной тематики в русскоязычных и англоязычных Интернет-СМИ

Социология: методология, методы, математическое моделирование. 2021. № 52. С. 131–157.
Artemova E., Maksimenko A., Охрименко Д. А.

The paper attempts to classify the corruption-related media content of Russianlanguage and English-language Internet media using machine learning methods. The methodological approach proposed in the article is very relevant and promising, since, according to our earlier data, corruption monitoring mechanisms used in foreign publications based on the use of advanced information technologies have rather limited potential effectiveness and are not always adequately interpreted. The study shows the principles and grounds for identifying identification parameters, and also describes in detail the layout scheme of the collected news array. In the course of automatic text processing, which took place in 2 stages (vectorization of the text and the use of a learning model), it was possible to solve the main 4 tasks: highlighting a significant quote from a news article to identify a text on corruption topics, predicting the type of news message, predicting a relevant article of the Criminal Code of the Russian Federation, which is used to determine responsibility for the described corruption offense, as well as predicting the type of relationship in corruption offenses. The results obtained showed that modern methods of automatic text processing successfully cope with the tasks of identification and classification of corruption-related content in both Russian and English

Research target: Psychology Media and Communications Computer Science
Language: Russian
Full text
DOI
Text on another site
Keywords: машинное обучениеInternet mediaкоррупционные правонарушенияRussian-language media space.artificial intelligenceавтоматическая обработка текстовclustering algorithmscorruption-related contentcorruption offensesEnglish-language mediaкоррупционный контентинтернет-СМИрусскоязычные медиаанглоязычные медиа
Similar publications
Психологические издержки традиционной модели государственного управления (на материале органов прокуратуры)
Ахмедова М. С., Orlov A. B., Вопросы психологии 2025 Т. 71 № 4 С. 41–51
The article reflects problems of functioning of procuratorial bodies in the traditional paradigm of public administration, leading to professional burnout and professional deformation of employees. Modern Russian and foreign science are at the stage of awareness and identification of systemic problems in the modern culture of public administration, but the search for ways to solve them ...
Added: June 5, 2026
Proceedings of the 43rd International Conference on Machine Learning (ICML 2026)
Seul: PMLR, 2026.
Added: June 4, 2026
Значимые изменения в улусе Улюнхан, произошедшие за 20 лет: транзитивность традиционной культуры бурят и эвенков
Obukhov A., Маерле М. А., Минаева Е. И., Исследователь/Researcher 2025 № 3-4 С. 282–299
The article presents a preliminary synthesis of materials from an expedition conducted in the Ulyunkhan ulus, an Evenk settlement in the Kurumkansky District of the Republic of Buryatia, in 2025, comparing them with materials from a 2005 expedition. The study focuses on the changes over the past twenty years as perceived by the local residents themselves (Buryats and Evenks). It compares ...
Added: June 2, 2026
Феномен устойчивых во времени неформальных разновозрастных сообществ, осуществляющих воспитательную работу с подростками
Obukhov A., Кириллова К. Б., Исследователь/Researcher 2025 № 3-4 С. 26–48
This article investigates the phenomenon of enduring informal multi-age communities contributing to adolescent upbringing. The conditions for the establishment and the factors ensuring the endurance of such communities are identified through the example of three organizations: the “Nadezhda” unit, the Children and Youth Organization “Ostrov Sokrovishch” (Treasure Island), and the “Karavella” (Caravel) unit. As for ...
Added: June 2, 2026
OpenAtom Foundation. Консорциум, развивающий Open Source в Китае.
Silakov D., Системный администратор 2026 № 3 С. 28–33
В статье про платформы для разработки открытого ПО в Китае мы рассказали про GitCode – молодой проект, позиционируемый как площадка для разработчиков со всего мира. Сейчас на GitCode размещаются проекты, созданные в КНР, но некоторые из них уже известны и на международной арене. Помочь открытым проектам в становлении, развитии и расширению аудитории призван фонд OpenAtom ...
Added: June 2, 2026
Система синтаксических инвариантов текстовой деятельности: статистические дескрипторы, семантическая структура и диагностические профили
Kudriavtseva E., / РЦИС. Серия № 0148-756-286. 2026.
The content of the work is the system is a system for identifying four types of written speech structures. A set of 11 calculated parameters, statistical standards, and semantic characteristics allows for the identification of a text's structure as the result of a specific cognitive schema (scene, event, story, evaluation). The method has been verified ...
Added: June 2, 2026
XII Международный Форум Ассоциации Когнитивно-Поведенческой Психотерапии CBTFORUM: сборник научных статей.
СПб.: Международный институт развития когнитивно-поведенческой терапии, 2026.
This collection of scientific articles addresses current issues and modern developments in cognitive-behavioral therapy in clinical psychology, psychotherapy, coaching, and related fields. ...
Added: May 31, 2026
Почему растущие доходы не делают людей счастливее: эмоциональное объяснение парадокса Истерлина (Why Growing Incomes Do Not Make People Happier: an Emotional Explanation of the Easterlin Paradox)
Vorchik A., / SSRN. Серия Social Science Research Network "Social Science Research Network". 2026.
This work is devoted to a theoretical explanation of the Easterlin paradox, according to which long-term economic growth does not make average level of people's happiness increasing. By happiness, we mean the intensity of emotions people experience while comparing their new income with its expected value, or the target income with its original value. In the first case, ...
Added: May 31, 2026
The recognition-by-components method
Slivnitsin P., Mylnikov L., Engineering Applications of Artificial Intelligence 2026 Vol. 179 Article 115185
The paper describes a applied artificial intelligence task of recognition-by-components method of real objects based on the recognition of a limited set of primitives or components. The recognition-by-components makes it possible to determine the components, that compose an object, and increase the number of recognizable objects without degrading the recognition quality. Training is performed on ...
Added: May 29, 2026
Сборник студенческих работ «Восточная перспектива»
М.: ООО «Адвансед солюшнз», 2026.
Данный выпуск сборника студенческих статей .Восточная перспектива. включает в себя статьи победителей и призеров XI Международной научной студенческой конференции "Восточная перспектива", состоявшейся 18 мая 2024 года. В 2024 году на конференцию было подано 115 заявок, офлайн и онлайн в конференции приняли участие докладчики и слушатели из различных вузов России и ближнего и дальнего Зарубежья. ...
Added: May 29, 2026
Сборник студенческих работ «Восточная перспектива»
М.: ООО «Адвансед солюшнз», 2026.
Данный выпуск сборника студенческих статей «Восточная перспектива» включает в себя статьи победителей и призеров X Международной научной студенческой конференции «Восточная перспектива», состоявшейся 15 апреля 2023 года. Юбилейная конференция стала знаковым событием для студентов различных подразделений НИУ ВШЭ и других вузов России, занимающихся подготовкой востоковедческих кадров. ...
Added: May 29, 2026
Методическая концепция развития навыков саморегуляции у студентов-музыкантов в классе вокала
Ван Г., Торопова А. В., Музыкальное искусство и образование 2025 Т. 13 № 4 С. 73–91
Contemporary vocal pedagogy faces a complex set of methodological challenges that encompass not only the search for ways to improve technical vocal skills for the realization of artistic vision, but also the development of self-control and self-regulation in future singers, as well as the entire creative process. A successful artist cannot function without the ability ...
Added: May 29, 2026
ИССЛЕДОВАНИЕ АССОЦИАЦИИ ГЕНЕТИЧЕСКИХ ВАРИАНТОВ С РАЗВИТИЕМ МУЗЫКАЛЬНЫХ СПОСОБНОСТЕЙ ЧЕЛОВЕКА
Kazantseva A. V., A.V. Toropova, Khusnutdinova E. K. et al., ВАВИЛОВСКИЙ ЖУРНАЛ ГЕНЕТИКИ И СЕЛЕКЦИИ, Федеральный исследовательский центр Институт цитологии и генетики Сибирского отделения Российской академии наук» (ИЦиГ СО РАН) (Новосибирск) 2025 Vol. 30 No. 3 P. 470–481
The development of musical abilities, including absolute pitch, musical memory, rhythm sense, and musicality, at a high degree is determined by a hereditary component (up to 68 %). The studies implementing a genome-wide linkage and association approach to musical aptitude have revealed more than 100 genetic loci. This spectrum is comprised of the genes encoding ...
Added: May 29, 2026
Brain-Computer Interfaces for Gait Rehabilitation After Stroke A Scoping Review
Mokienko O., Zisman M. A., Bobrov P. et al., American Journal of Physical Medicine and Rehabilitation 2026 Vol. 105 No. 6 P. 555–563
Brain-computer interfaces (BCIs) represent a promising technology for restoring lower limb motor functions and gait after stroke. The application of BCIs in this field is supported by a limited number of studies. The objective of the review was to systematically and critically evaluate the current evidence on the use of BCIs for lower limb function ...
Added: May 28, 2026
Социально-психологические факторы доверия искусственному интеллекту: состояние исследований
Samoilov O., Tatarko A., Вопросы теоретической экономики 2026 № 2 С. 209–228
The article presents a theoretical review of literature from the last ten years devoted to the analysis of socio-psychological factors of trust in artificial intelligence. The widespread adoption of automated artificial intelligence systems, which is associated with expected economic growth, reduced resource costs, and the optimization of various work processes, often faces user distrust in ...
Added: May 21, 2026
От неизвестности к прозрачности: обзор технологий объяснимого ИИ (XAI)
Avdoshin S. M., Pesotskaya E. Y., Информационные технологии 2026 Т. 32 № 4 С. 185–194
With the rapid advancement of artificial intelligence, and deep learning in particular, models have emerged that are capable of delivering highly accurate predictions. However, the internal logic of such models remains difficult to interpret—an issue of critical importance, especially in domains where the correctness of an algorithm directly affects high-stakes decision-making. One promising avenue for ...
Added: May 8, 2026
Современные методы анализа временных рядов в мониторинге и прогнозировании состояния оборудования для механизированной добычи
Neznanov A., Glushko A., Овчинников С. et al., В кн.: Интеллектуальный анализ данных в нефтегазовой отрасли.: М.: ООО «Геомодель Развитие», 2024. С. 140–143.
With the development of monitoring systems, now we have the opportunity to collect key performance indicators of devices in the process of artificial lift. Every day a huge amount of telemetry is generated by our devices, which can be used to forecast the working mode and health state of the equipment after the process of ...
Added: April 29, 2026
Правовой режим объектов, созданных искусственным интеллектом: обзор зарубежной практики
Kirsanova E., Pakshin P., Право и экономика 2026 № 3 (456) С. 26–34
The article examines the legal regime of intellectual property created by artificial intelligence. Changes to the existing legal framework towards recognizing artificial intelligence as a legal entity would violate the rationale and fundamental principles of the intellectual property rights system. This article provides an overview of different views on the rationale for granting copyright to ...
Added: April 28, 2026
Intelligent Interfaces and Systems for Human-Computer Interaction
Karpov A., Dvoynikova A., Ryumina E., , in: Lecture Notes in Networks and SystemsVol. 776.: Springer, 2023. P. 3–13.
Abstract. The paper presents a brief review of intelligent interfaces and systems of human-machine interaction. To date, few intelligent interfaces and systems are used in various areas of industry. All the systems of human-machine interaction can be divided into intelligent synthesis and analysis systems. Intelligent synthesis implies the presentation of information from the system to ...
Added: April 25, 2026
Machine Learning Approach to Anticancer Activity Prediction of Transition-Metal Complexes Based on a Large-Scale Experimental Database
Krasnov L., Malikov D., Kiseleva M. et al., Journal of Medicinal Chemistry 2026 Vol. 69 No. 8 P. 8838–8851
In this work, we developed a straightforward data-driven approach to predict the cytotoxicity of metal complexes based entirely on their (metal + ligands) composition. To this end, we have manually curated MetalCytoToxDB─a comprehensive experimental database comprising 26,500 IC50 values for 7050 metal complexes against 754 cell lines from 1921 articles. Based on these, machine learning ...
Added: April 23, 2026
LSTM-модель потребления тепловой энергии в многоэтажном жилом здании
Ершов И. А., Системная инженерия и инфокоммуникации 2025 № 4 С. 11–14
The heat consumption of residential buildings is a stochastic series. It is necessary for the design of thermal energy regulators the creation of a neural network model. In the paper, the model is carried out based on Long Short-Term Memory (LSTM). The high accuracy of reproducing the series was achieved by training the model on ...
Added: April 22, 2026
Алгоритм анализа новостной информации для принятия экономических решений
Чудинова О. С., Первицкая Л. А., Ramenskaya A., Индустриальная экономика 2026 № 1 С. 65–78
This article is devoted to the development of an algorithm for analyzing news information using machine learning methods implemented in Python libraries. The choice of tools used at each stage of the algorithm is justified by calculating metrics for the quality of the solution to the corresponding machine learning problems. The algorithm’s results are presented ...
Added: April 20, 2026
Modeling cosolvent effects on solubility in supercritical CO2 using data-driven approaches
Makarov D. M., Kalikin N., Gurikov P. et al., Journal of Supercritical Fluids 2026 Vol. 235 Article 106979
Supercritical CO2 (scCO2 ) is an environmentally friendly solvent, but its low polarity limits the solubility of polar compounds. Cosolvents are commonly used to enhance solvation capability, yet comprehensive datadriven studies are scarce. We compiled the largest dataset to date — 4401 experimental solubility records with 22 cosolvents for 93 nonionic solutes, plus 4855 records ...
Added: April 19, 2026
Эффективность применения прогнозов волатильности в активных торговых стратегиях институциональных инвесторов на российском рынке акций
Lysenok N., Фундаментальная и прикладная математика 2026 Т. 26 № 3 С. 33–42
This study examines the impact of realized volatility forecasts on the performance of active trading strategies in the Russian equity market. Using a sample of 17 liquid stocks over the period 2014–2026, a hybrid forecasting model is developed that combines HAR-J with gradient boosting; its superiority over the baseline HAR-J specification is confirmed by the ...
Added: April 17, 2026
  • About
  • About
  • Key Figures & Facts
  • Sustainability at HSE University
  • Faculties & Departments
  • International Partnerships
  • Faculty & Staff
  • HSE Buildings
  • HSE University for Persons with Disabilities
  • Public Enquiries
  • Studies
  • Admissions
  • Programme Catalogue
  • Undergraduate
  • Graduate
  • Exchange Programmes
  • Summer University
  • Summer Schools
  • Semester in Moscow
  • Business Internship
  • Research
  • International Laboratories
  • Research Centres
  • Research Projects
  • Monitoring Studies
  • Conferences & Seminars
  • Academic Jobs
  • Yasin (April) International Academic Conference on Economic and Social Development
  • Media & Resources
  • Publications by staff
  • HSE Journals
  • Publishing House
  • iq.hse.ru: commentary by HSE experts
  • Library
  • Economic & Social Data Archive
  • Video
  • HSE Repository of Socio-Economic Information
  • HSE1993–2026
  • Contacts
  • Copyright
  • Privacy Policy
  • Site Map
Edit