• A
  • A
  • A
  • АБВ
  • АБВ
  • АБВ
  • A
  • A
  • A
  • A
  • A
Обычная версия сайта
  • RU
  • EN
  • HSE University
  • Publications
  • Book chapter
  • Spot the Bot: Distinguishing Human-Written and Bot-Generated Texts Using Clustering and Information Theory Techniques
  • RU
  • EN
Расширенный поиск
Высшая школа экономики
Национальный исследовательский университет
Priority areas
  • business informatics
  • economics
  • engineering science
  • humanitarian
  • IT and mathematics
  • law
  • management
  • mathematics
  • sociology
  • state and public administration
by year
  • 2027
  • 2026
  • 2025
  • 2024
  • 2023
  • 2022
  • 2021
  • 2020
  • 2019
  • 2018
  • 2017
  • 2016
  • 2015
  • 2014
  • 2013
  • 2012
  • 2011
  • 2010
  • 2009
  • 2008
  • 2007
  • 2006
  • 2005
  • 2004
  • 2003
  • 2002
  • 2001
  • 2000
  • 1999
  • 1998
  • 1997
  • 1996
  • 1995
  • 1994
  • 1993
  • 1992
  • 1991
  • 1990
  • 1989
  • 1988
  • 1987
  • 1986
  • 1985
  • 1984
  • 1983
  • 1982
  • 1981
  • 1980
  • 1979
  • 1978
  • 1977
  • 1976
  • 1975
  • 1974
  • 1973
  • 1972
  • 1971
  • 1970
  • 1969
  • 1968
  • 1967
  • 1966
  • 1965
  • 1964
  • 1963
  • 1958
  • More
Subject
News
May 20, 2026
HSE University Opens First Representative Office of Satellite Laboratory in Brazil
HSE University-St Petersburg opened a representative office of the Satellite Laboratory on Social Entrepreneurship at the University of Campinas in Brazil. The platform is going to unite research and educational projects in the spheres of sustainable development, communications and social innovations.
May 18, 2026
The 'Second Shift' Is Not Why Women Avoid News
Women are more likely than men to avoid political and economic news, but the reasons for this behaviour are linked less to structural inequality or family-related stress than to personal attitudes and the emotional perception of news content. This conclusion was reached by HSE researchers after analysing data from a large-scale survey of more than 10,000 residents across 61 regions of Russia. The study findings have been published in Woman in Russian Society.
May 15, 2026
Preserving Rationality in a Period of Turbulence
The HSE International Laboratory for Logic, Linguistics and Formal Philosophy studies logic and rationality in a transformed world characterised by a diversity of logical systems and rational agents. The laboratory supports and develops academic ties with Russian and international partners. The HSE News Service spoke with the head of the laboratory, Prof. Elena Dragalina-Chernaya, about its work.

 

Have you spotted a typo?
Highlight it, click Ctrl+Enter and send us a message. Thank you for your help!

Publications
  • Books
  • Articles
  • Chapters of books
  • Working papers
  • Report a publication
  • Research at HSE

?

Spot the Bot: Distinguishing Human-Written and Bot-Generated Texts Using Clustering and Information Theory Techniques

Ch. 3. P. 20–27.
Gromov V., Dang Q. N.
Language: English
DOI
Text on another site
Keywords: clusteringinformation theorysemantic analysis

In book

10th International Conference, PReMI 2023, Kolkata, India, December 12–15, 2023, Proceedings. Pattern Recognition and Machine Intelligence. LNCS, volume 14301
Cham: Springer, 2023.
Similar publications
Картирование медицинской науки: результаты интеллектуального анализа больших данных
Grebenyuk A. Y., Lobanova P., Саввин Н. В. et al., Медицинские технологии. Оценка и выбор 2026 № 1(47) С. 36–47
Objective: Analysis of the current global agenda in medical science. Material and methods: The article proposes an approach to building a medical research landscape based on semantic analysis and mapping of medical topics. For this purpose, 2252 topics from English-language articles published in 2024 related to the field of medicine were vectorized, the embeddings were obtained ...
Added: February 20, 2026
Flexible Stock Market Algorithm
Rubchinskiy A., Chubarova D., Technology and Investment 2025 Vol. 16 No. 4 P. 211–240
The article considers one of the most famous examples of socio-economic systems characterized by significant uncertainty—the S&P-500 stock market, where shares of 500 largest US companies are traded. The flexible algorithm for daily trading has been developed. It is based on known fixed data about cost of shares in previous days as well as on ...
Added: December 19, 2025
Tunnel Clustering Method
F. T. Aleskerov, A. L. Myachin, V. I. Yakuba, Doklady Mathematics 2024 Vol. 110 No. 3 P. 474–479
We propose a novel method for rapid pattern analysis of high-dimensional numerical data, termed tunnel clustering. The main advantages of the method are its relatively low computational complexity, endogenous determination of cluster composition and number, and a high degree of interpretability of final results. We present descriptions of three different variations: one with fixed hyperparameters, ...
Added: March 3, 2025
Использование Z-чисел для описания набора данных
Гусейнов О., Degtyarev K. Y., IRETC MTÜ PAHTEI - Proceedings of Azerbaijan High Technical Educational Institutions 2025 Т. 48 № 1 С. 360–370
The concept of Z-number was proposed by Prof. Lotfi Zadeh to describe partial reliability of information, and it is a kind of fusion of fuzziness and probabilistic uncertainty. Z-number can be presented as a pair of fuzzy numbers Z(A,B) used to describe a value of a random variable X. The first component (A) is a ...
Added: February 20, 2025
Using Big Data for Foresight: Scientometric and Semantic Analysis for South Africa
Saritas O., Kotsemir M., , in: 21st Century Foresight: Shaping the Future for Sustainable Social, Economic and Environmental Development in South Africa.: Cham: Springer, 2024. P. 115–208.
The South Africa 2030 Foresight study, utilizing big data analytics, aimed at advancing the national innovation system and enhancing knowledge generation capacity. This chapter presents outcomes from bibliometric and semantic analyses conducted by ISSEK HSE. Bibliometric analysis explored South Africa’s research competences, scientific capacity and global collaborators, revealing mature and emerging research areas. Semantic analysis ...
Added: February 14, 2025
Gradient descent clustering with regularization to recover communities in transformed attributed networks
Shalileh S., Social Network Analysis and Mining 2025 Vol. 15212 P. 137–148
Community detection in attributed networks aims to recover clusters in which the within-community nodes are as interconnected and as homogeneous as possible, while the between-communities nodes are as disconnected and as heterogeneous as possible. The current research proposes a straightforward data-driven model with an integrated regularization term to recover communities. For further improvement of the ...
Added: November 30, 2024
An empirical scrutinization of four crisp clustering methods with four distance metrics and one straightforward interpretation rule
T. A. Alvandyan, S. Shalileh, Doklady Mathematics 2024 Vol. 110 No. S1 P. S236–S250
Clustering has always been in great demand by scientific and industrial communities.  However, due to the lack of ground truth, interpreting its obtained results can be debatable. The current research provides an empirical benchmark on the efficiency of three popular and one recently proposed crisp clustering methods. To this end, we extensively analyzed these (four) ...
Added: November 30, 2024
Моделирование оплаты труда учителей в условиях неоднородности социально-экономического состояния регионов
Богданова Т. К., Жукова Л. В., В кн.: XI-я международная конференция «Многомерный статистический анализ, эконометрика и моделирование реальных процессов» имени С.А. Айвазяна.: М.: ЦЭМИ РАН, 2024. С. 41–44.
The paper is devoted to the analysis and forecasting of the average salary of teachers. For 84 regions on the basis of their socio-demographic characteristics according to Rosstat data using Ward's method we obtained a two-cluster solution, which allowed us to identify quite strong differences in the level of wages, GRP per capita, level of ...
Added: October 4, 2024
Cross-country analysis of science, technology and innovation policies: non-covid-19 related and Covid-19 specific STI policies in OECD countries
Russo M., Pavone P., Meissner D. et al., Quality and Quantity 2025 Vol. 59 No. Suppl 1 P. S343–S367
In OECD countries, Science, Technology and Innovation (STI) policies were seen as key aspects of coping with the Covid-19 pandemic. Now that the pandemic is over, identifying which policy mix portfolios characterised countries in terms of their non-Covid-19 related and Covid-19 specific STI policies fills a knowledge gap on changes in STI policies induced by ...
Added: September 27, 2024
Clustering with empty clusters
Penikas H. I., Феста Ю. Ю., Известия Дальневосточного федерального университета. Экономика и управление 2024 Vol. 2 P. 75–94
Кластерный анализ широко используется в различных научных и практических областях, связанных с анализом данных. Это важный инструмент для решения задач в таких областях, как машинное обучение, обработка изображений, распознавание текста и т.д. Отсутствие наблюдений не всегда означает отсутствие информации, поэтому предполагается, что наличие пробелов в данных, наличие“пустых” кластеров, также несёт в себе информацию об объекте исследования, как и реальные наблюдения. В этом исследовании предполагается, ...
Added: August 10, 2024
Detecting linguistic variation with geographic sampling
Koile E., Moroz G., Journal of Linguistic Geography 2024 Vol. 12 No. 1 P. 24–31
Geolectal variation is often present in settings where one language is spoken across a vast geographic area. This can be found in phonological, morphosyntactic, and lexical features. For practical reasons, it is not always possible to conduct fieldwork in every single location of interest in order to obtain the full pattern of variation, and a ...
Added: May 6, 2024
О влиянии неровновероятности выходной последовательности на качество криптографических преобразований
Los A., Nesterenko A., Rogacheva O., В кн.: Алгебра, теория чисел, дискретная математика и многомасштабное моделирование: современные проблемы, приложения и проблемы истории. Материалы XXII Международной конференции, посвящённой 120-летию со дня рождения академика Андрея Николаевича Колмогорова и 60-летию со дня открытия школы-интерната № 18 при Московском университете.: [б.и.], 2023. С. 151–157.
One of the requirements for the quality of cryptographic algorithms is the equiprobable distribution of the characters of the sequence obtained after applying the cryptographic transformation. This requirement is due to the fact that in the presence of unequal probability of signs of the output sequence, it becomes possible to construct an effective method for ...
Added: April 24, 2024
Unsourced Random Access With the MIMO Receiver: Projection Decoding Analysis
Kirill Andreev, Ustinova D., Alexey Frolov, IEEE Wireless Communications Letters 2024 Vol. 13 No. 1 P. 69–73
We consider unsourced random access with MIMO receiver – a crucial communication scenario for future 5G/6G wireless networks. We perform a projection-based decoder analysis and derive energy efficiency achievability bounds when channel state information is unknown at transmitters and the receiver (no-CSI scenario). A comparison to the maximum-likelihood (ML) achievability bounds by Gao et al. ...
Added: January 22, 2024
Temperature-driven transition into vortex clusters in low-kappa intertype superconductors
Backs A., Al-Falou A., Vagov A. et al., Physical Review B: Condensed Matter and Materials Physics 2023 Vol. 107 No. 17 Article 174527
In the vicinity of the type-I/type-II crossover in conventional superconductors, vortices exhibit a nonmonotonic interaction, which leads to exotic vortex matter states. We perform molecular dynamics simulations on a model superconductor in the intertype regime. In a field cooled approach, we examine the transition of a homogeneous vortex lattice (VL) into a structure consisting of ...
Added: November 2, 2023
2023 Fifth International Conference Neurotechnologies and Neurointerfaces (CNN) 18-20 Sept. 2023
Alshanskaia E., Martynova O., IEEE, 2023.
Cognitive and emotional load in the course of increasing the complexity of tasks leads to the activation of various parts of the autonomic nervous system (ANS) and can be accompanied by an increase in the efficiency of problem solving. An increase in cognitive load under the condition of high motivation is a stress factor and ...
Added: September 24, 2023
Energy efficient coded random access for the wireless uplink
Kowshik S., Kirill Andreev, Frolov A. et al., IEEE Transactions on Communications 2020 Vol. 68 No. 8 P. 4694–4708
We discuss the problem of designing channel access architectures for enabling fast, low-latency, grant-free, and uncoordinated uplink for densely packed wireless nodes. Specifically, we study random-access codes, previously introduced for the AWGN MAC, in the practically more relevant case of Rayleigh fading, when channel gains are unknown to the decoder. We propose a random coding ...
Added: September 9, 2023
Energy Efficiency of Unsourced Random Access over the Binary-Input Gaussian Channel
Glebov A., Rybin P., Kirill Andreev et al., IEEE Communications Letters 2023 Vol. 27 No. 9 P. 2313–2317
We investigate the fundamental limits of the unsourced random access over the binary-input Gaussian channel. By fundamental limits, we mean the minimal energy per bit required to achieve the target per-user probability of error. The original method proposed by Y. Polyanskiy (2017) and based on Gallager’s trick does not work well for binary signaling. We ...
Added: September 9, 2023
Coded Compressed Sensing With List Recoverable Codes for the Unsourced Random Access
Kirill Andreev, Rybin P., Alexey Frolov, IEEE Transactions on Communications 2022 Vol. 70 No. 12 P. 7886–7898
We consider a coded compressed sensing approach for the unsourced random access and replace the outer tree code proposed by Amalladinne et al. (2020) with the list recoverable code capable of correcting t errors. A finite-length random coding bound for such codes is derived. The numerical experiments in the single-antenna quasi-static Rayleigh fading channel show that transition ...
Added: September 9, 2023
2023 Wave Electronics and its Application in Information and Telecommunication Systems (WECONF)
IEEE, 2023.
Processing and transmission of information and telecommunication systems It is supposed to consider the results of current and promising scientific research on: processing and transmission of information in infocommunication systems; solving problems of error-correcting coding, assessing the limiting characteristics of communication systems; methods of machine learning and decision making; solving urgent problems that are formed at the junction of information ...
Added: July 18, 2023
Новая программная платформа для моделирования транспортных потоков с участием беспилотных автомобилей
Beklaryan A., Вестник ЦЭМИ 2023 Т. 6 № 1 Статья 5
The article presents a new software platform for modelling traffic flows involving unmanned vehicles, using a number of advanced technological solutions, in particular, the FLAME GPU supercomputer agent modelling framework, intelligent software modules based on fuzzy and hierarchical clustering, genetic optimization algorithms, a subsystem for visualizing the state of agents-vehicles based on OpenGL, etc. As ...
Added: June 4, 2023
  • About
  • About
  • Key Figures & Facts
  • Sustainability at HSE University
  • Faculties & Departments
  • International Partnerships
  • Faculty & Staff
  • HSE Buildings
  • HSE University for Persons with Disabilities
  • Public Enquiries
  • Studies
  • Admissions
  • Programme Catalogue
  • Undergraduate
  • Graduate
  • Exchange Programmes
  • Summer University
  • Summer Schools
  • Semester in Moscow
  • Business Internship
  • Research
  • International Laboratories
  • Research Centres
  • Research Projects
  • Monitoring Studies
  • Conferences & Seminars
  • Academic Jobs
  • Yasin (April) International Academic Conference on Economic and Social Development
  • Media & Resources
  • Publications by staff
  • HSE Journals
  • Publishing House
  • iq.hse.ru: commentary by HSE experts
  • Library
  • Economic & Social Data Archive
  • Video
  • HSE Repository of Socio-Economic Information
  • HSE1993–2026
  • Contacts
  • Copyright
  • Privacy Policy
  • Site Map
Edit