• A
  • A
  • A
  • АБВ
  • АБВ
  • АБВ
  • A
  • A
  • A
  • A
  • A
Обычная версия сайта
  • RU
  • EN
  • HSE University
  • Publications
  • Book chapter
  • Linguistic Modeling as a Basis for Creating Authorship Attribution Software
  • RU
  • EN
Расширенный поиск
Высшая школа экономики
Национальный исследовательский университет
Priority areas
  • business informatics
  • economics
  • engineering science
  • humanitarian
  • IT and mathematics
  • law
  • management
  • mathematics
  • sociology
  • state and public administration
by year
  • 2027
  • 2026
  • 2025
  • 2024
  • 2023
  • 2022
  • 2021
  • 2020
  • 2019
  • 2018
  • 2017
  • 2016
  • 2015
  • 2014
  • 2013
  • 2012
  • 2011
  • 2010
  • 2009
  • 2008
  • 2007
  • 2006
  • 2005
  • 2004
  • 2003
  • 2002
  • 2001
  • 2000
  • 1999
  • 1998
  • 1997
  • 1996
  • 1995
  • 1994
  • 1993
  • 1992
  • 1991
  • 1990
  • 1989
  • 1988
  • 1987
  • 1986
  • 1985
  • 1984
  • 1983
  • 1982
  • 1981
  • 1980
  • 1979
  • 1978
  • 1977
  • 1976
  • 1975
  • 1974
  • 1973
  • 1972
  • 1971
  • 1970
  • 1969
  • 1968
  • 1967
  • 1966
  • 1965
  • 1964
  • 1963
  • 1958
  • More
Subject
News
April 30, 2026
HSE Researchers Compile Scientific Database for Studying Childrens Eating Habits
The database created at HSE University can serve as a foundation for studying children’s eating habits. This is outlined in the study ‘The Influence of Age, Gender, and Social-Role Factors on Children’s Compliance with Age-Based Nutritional Norms: An Experimental Study Using the Dish-I-Wish Web Application.’ The work has been carried out as part of the HSE Basic Research Programme and was presented at the XXVI April International Academic Conference named after Evgeny Yasin.
April 30, 2026
New Foresight Centre Study Identifies the Most Destructive Global Trends for Humankind
A team of researchers from the HSE International Research and Educational Foresight Centre has examined how global trends affect the quality of human life—from life expectancy to professional fulfilment. The findings of the study titled ‘Human Capital Transformation under the Influence of Global Trends’ were published in Foresight.
April 28, 2026
Scientists Develop Algorithm for Accurate Financial Time Series Forecasting
Researchers at the HSE Faculty of Computer Science benchmarked more than 200,000 model configurations for predicting financial asset prices and realised volatility, showing that performance can be improved by filtering out noise at specific frequencies in advance. This technique increased accuracy in 65% of cases. The authors also developed their own algorithm, which achieves accuracy comparable to that of the best models while requiring less computational power. The study has been published in Applied Soft Computing.

 

Have you spotted a typo?
Highlight it, click Ctrl+Enter and send us a message. Thank you for your help!

Publications
  • Books
  • Articles
  • Chapters of books
  • Working papers
  • Report a publication
  • Research at HSE

?

Linguistic Modeling as a Basis for Creating Authorship Attribution Software

P. 1063–1074.
Khomenko A., Baranova Y., Romanov A., Zadvornov K.

This paper discusses approbation of an integrative attribution method for texts in the Russian language. The methodology goes after (Koppel, Schler 2003): computer program tries to imitate human expert work. So, it is based on interpretative language study with its objectification through mathematical statistics. The choice of parameters describing the author’s individual style is rooted to considering text to be a product of an authentic language personality. Language personality is described using psycholinguistic (Yu.N. Karaulov), sociolinguistic (M.Coulthard, R. W.Shuy) methods and the methodology of forensic linguistics (S.M. Vul, D.Wright). On the basis of the principles above, the software for attribution is created: http://khorom-attribution.ru/#/. As output the resource displays mathematical models of persons’ individual styles and the metrics for null hypothesis evaluation: Pearson correlation coefficient, linear regression and Student’s t-test. The functionality of the resource is aimed to solve an identification problem of text attribution for «closed class» (Juola 2008) with pair-wise comparison, but the resource can also be used in the personality diagnostics in forensic, philological and cultural researchers

Language: English
Full text
Text on another site
Keywords: математическая модельmathematical modelauthorship attributionлингвистическая модельlinguistic model языковая личностьlanguage personalityатрибуция авторства

In book

Компьютерная лингвистика и интеллектуальные технологии
Вып. 20 (27): Дополнительный том. , Изд-во РГГУ, 2021.
Similar publications
Фильтрация с эрозией осадка в пористой среде
Kuzmina L., Осипов Ю. В., Строительные материалы 2025 № 10 С. 83–87
Models of transport and filtration of small particles in porous media are used in the construction industry when designing foundations and underground structures. A liquid with particles moves through the channels of a porous soil. When particles are transported, some of them are locked in the pores and form a sediment. If the fluid flows ...
Added: February 25, 2026
Об атрибуции портретных миниатюр из собрания ГМИИ им. А.С. Пушкина при подготовке выставки «Живой портрет… Волос в волос. Механическое рисование»
Lubnikova M., В кн.: Материалы отчетной научной сессии, посвященной итогам работы Государственного музея изобразительных искусств имени А.С. Пушкина за 2023 год.: М.: ГМИИ им. А. С. Пушкина, 2024. С. 55–68.
Сборник включает в себя статьи сотрудников ГМИИ им. А.С. Пушкина, подготовленные ими по итогам работы за 2023 год и посвященные широкому кругу тем. ...
Added: October 28, 2025
Тайны княгини Голицыной. О коллекции портретных миниатюр из усадьбы Большие Вяземы в собрании ГМИИ им. А.С. Пушкина
Lubnikova M., Искусствознание 2024 № 1 С. 288–319
Статья представляет новый взгляд на хранящуюся в ГМИИ им. А.С. Пушкина подборку миниатюр из круглой гостиной усадьбы князей Голицыных Большие Вяземы. Автор предлагает новые атрибуции и реинтерпретирует личности изображенных на большей части портретов. Это не только позволяет включить некоторые произведения в число работ крупнейших европейских миниатюристов, но и помогает переосмыслить щиток с миниатюрами целиком, связав его с ...
Added: October 28, 2025
Новая количественная модель Платоновского корпуса 2. Филогенетические методы в стилометрии
Alieva O., Вестник Православного Свято-Тихоновского гуманитарного университета. Серия 3: Филология 2025 Т. 84 С. 55–83
Despite the criticism, the standard chronology of Plato’s works continues to hold sway not only over “developmentalists”, but also over various types of “unitarians”. The authority of the standard chronology rests on the confidence that the division of the dialogues into three groups has been “proven” with quantitative methods. In addition to the general theoretical ...
Added: August 28, 2025
Моделирование формирования общественного восприятия и политического доверия в условиях обязательности: кейс цифровых государственных услуг
Akhremenko A. S., Philippov I., Sychev V. et al., Политическая наука 2025 № 1 С. 77–102
Modern academic literature has focused on technology adaptation and innovation in politics and public administration. At present, there is a large number of studies, the central object of which is such an actively developing phenomenon as public digital services. Despite the diverse range of theoretical approaches, there is no established methodological tradition of modelling the ...
Added: May 1, 2025
Automation of Forensic Authorship Attribution: Problems and Prospects
Romanova T. V., Khomenko A., Legal Issues in the Digital Age 2022 Vol. 3 No. 2 P. 90–115
The article deals with validation of an integrative attribution algorithm based on the analysis of the author’s idiostyle using methods of interpretative linguistics with ob jectification of the available data with the help of mathematical statistics. The algo rithm addresses the identification problem of the attribution. The choice of parameters describing the individual style of ...
Added: March 12, 2025
Математическая модель зависимости динамической вязкости моторных масел от температуры, концентрации сажи и ее морфологии
Лесин А. В., Исаев А. В., Тонконогов Б. П. et al., Тонкие химические технологии 2023 Т. 19 № 6 С. 485–496
Objectives. A quick cold start of emergency and auxiliary power units based on diesel engines should be possible at any time without problems and in the shortest possible time. The condition of the engine oil is one of the most important factors influencing the smooth start-up of power plants. During diesel engine operation, engine oil accumulates soot ...
Added: February 26, 2025
Intracellular Gemcitabine Monophosphate Levels Predict Chemotherapy Efficacy in Gemcitabine-Treated Patients with Bladder Cancer
M. R. Yanova, A. P. Zhiyanov, I. D. Antipenko et al., Doklady Biochemistry and Biophysics 2023 Vol. 513 P. 324–327
Gemcitabine monophosphate (dFdCMP), one of the intracellular forms of phosphorylated gemcitabine, determines its antitumor activity. A pharmaco-molecular model for determining relative gemcitabine monophosphate level based on the assessment of the activity of ENT1 and ENT2 channels as well as dCK and CDA enzymes in tumor tissue was developed. Relative gemcitabine monophosphate level is a more ...
Added: January 23, 2025
Исследование социального конфликта в цифровой медиасреде методами математического моделирования
Nefedova Y., Меди@льманах 2023 № 6(119) С. 44–51
В статье предлагается новый подход к анализу социальных конфликтов в цифровой медиасреде на основе автоматизированных методов обработки текстовой информации. Медиасообщество рассматривается как система, имеющая вход и выход, и задача математической модели сводится к определению наличия конфликта в ответных сообщениях и его основных характеристик, таких как количество конфликтных тем, длительность их обсуждения, уровень агрессии, а также ...
Added: September 29, 2024
How does Burrows' Delta work on medieval Chinese poetic texts?
Orekhov B., / Series Computer Science "arxiv.org". 2024.
Burrows' Delta was introduced in 2002 and has proven to be an effective tool for author attribution. Despite the fact that it was applied to different languages, they mostly belong to the same grammatical type and use the same graphic principle to convey speech in writing: a phonemic alphabet with word separation using spaces. The question ...
Added: August 8, 2024
Does Delta really confirm that Rowling and Galbraith are the same author?
Orekhov B., / Series Computer Science "arxiv.org". 2024.
Added: August 8, 2024
Отключение интернета как теоретическая проблема политической науки, или что мы (не) понимаем в сетевой протестной мобилизации
Akhremenko A. S., Полис. Политические исследования 2024 № 2 С. 118–134
The influence of Internet communication on “street” protest activity is the focus of this paper. In recent years, there has been some stagnation in this area of research: a shortage of breakthrough works that indicate new research directions or at least significantly strengthen the empirical foundation of already established hypotheses. The paradox is that when ...
Added: March 31, 2024
Mathematical model explains differences in Omicron and Delta SARS-CoV-2 dynamics in Caco-2 and Calu-3 cells
Vladimir Staroverov, Alexei Galatenko, Evgeny Knyazev et al., PeerJ 2024 Vol. 12 Article e16964
Within-host infection dynamics of Omicron dramatically differs from previous variants of SARS-CoV-2. However, little is still known about which parameters of virus-cell interplay contribute to the observed attenuated replication and pathogenicity of Omicron. Mathematical models, often expressed as systems of differential equations, are frequently employed to study the infection dynamics of various viruses. Adopting such ...
Added: March 20, 2024
Диагностика ауто- и гетероагрессии методами психолингвистики
Khomenko A., Касимова Л. Н., Сычугов Е. М. et al., В кн.: Современный медиатекст и судебная экспертиза: междисциплинарные связи и экспертная оценка: сборник научных работ по итогам Международной научно-практической конференции «Современный медиатекст и судебная экспертиза: междисциплинарные связи и экспертная оценка».: М.: ООО «СОЮЗКНИГ», 2023. С. 337–361.
The article will focus on the creation of stimulus material for the diagnosis of auto- and heteroaggressive tendencies in the behavior of young people using eyetracking methods. The process of creating stimulus material is based on the principles of idiolect understanding in the paradigm of forensic authorship attribution and the theory of language personality. The ...
Added: March 18, 2024
Перспектива создания стимульного материала для диагностики ауто- и гетероагрессии по речевосприятию
Khomenko A., В кн.: ЯЗЫК. ПРАВО. ОБЩЕСТВО Сборник статей по материалам VII Международной научно-практической конференции.: Пенза: Издательство ПГУ, 2023. С. 104–107.
The article focuses on the creation of stimulus material for diagnosing autoand hetero-aggressive tendencies in the behavior of young people using eye-tracking methods. The creation of stimulus material is based on the understanding of idiolects in forensic autonomy and the theory of linguistic personality. The result of the work is an application containing a set ...
Added: March 18, 2024
A Mathematical Model of in vitro Hepatocellular Cholesterol and Lipoprotein Metabolism for Hyperlipidemia Therapy
Efremov Y., Ermolaeva A., Vladimirov G. et al., Plos One 2022 Vol. 17 No. 6 Article e0264903
Cardiovascular diseases associated with high cholesterol (hypercholesterolemia) and low-density lipoproteins (LDL) levels are significant contributors to total mortality in developing and developed countries. Mathematical modeling of LDL metabolism is an important step in the development of drugs for hypercholesterolemia. The aim of this work was to develop and to analyze an integrated mathematical model of ...
Added: March 12, 2024
Уровень внутриклеточного монофосфата гемцитабина предсказывает эффективность химиотерапии с использованием гемцитабина у больных раком мочевого пузыря
Yanova M., Zhiyanov A., Antipenko I. et al., Доклады российской академии наук. Науки о жизни 2023 Т. 513 № 1 С. 581–585
Gemcitabine monophosphate (dFdCMP), one of the intracellular forms of phosphorylated gem- citabine, determines its antitumor activity. A pharmaco-molecular model for determining relative gemcit- abine monophosphate level based on the assessment of the activity of ENT1 and ENT2 channels as well as dCK and CDA enzymes in tumor tissue was developed. Relative gemcitabine monophosphate level is ...
Added: November 17, 2023
  • About
  • About
  • Key Figures & Facts
  • Sustainability at HSE University
  • Faculties & Departments
  • International Partnerships
  • Faculty & Staff
  • HSE Buildings
  • HSE University for Persons with Disabilities
  • Public Enquiries
  • Studies
  • Admissions
  • Programme Catalogue
  • Undergraduate
  • Graduate
  • Exchange Programmes
  • Summer University
  • Summer Schools
  • Semester in Moscow
  • Business Internship
  • Research
  • International Laboratories
  • Research Centres
  • Research Projects
  • Monitoring Studies
  • Conferences & Seminars
  • Academic Jobs
  • Yasin (April) International Academic Conference on Economic and Social Development
  • Media & Resources
  • Publications by staff
  • HSE Journals
  • Publishing House
  • iq.hse.ru: commentary by HSE experts
  • Library
  • Economic & Social Data Archive
  • Video
  • HSE Repository of Socio-Economic Information
  • HSE1993–2026
  • Contacts
  • Copyright
  • Privacy Policy
  • Site Map
Edit