• A
  • A
  • A
  • АБВ
  • АБВ
  • АБВ
  • A
  • A
  • A
  • A
  • A
Обычная версия сайта
  • RU
  • EN
  • HSE University
  • Publications
  • Book chapter
  • News headline as a form of news text compression
  • RU
  • EN
Расширенный поиск
Высшая школа экономики
Национальный исследовательский университет
Priority areas
  • business informatics
  • economics
  • engineering science
  • humanitarian
  • IT and mathematics
  • law
  • management
  • mathematics
  • sociology
  • state and public administration
by year
  • 2027
  • 2026
  • 2025
  • 2024
  • 2023
  • 2022
  • 2021
  • 2020
  • 2019
  • 2018
  • 2017
  • 2016
  • 2015
  • 2014
  • 2013
  • 2012
  • 2011
  • 2010
  • 2009
  • 2008
  • 2007
  • 2006
  • 2005
  • 2004
  • 2003
  • 2002
  • 2001
  • 2000
  • 1999
  • 1998
  • 1997
  • 1996
  • 1995
  • 1994
  • 1993
  • 1992
  • 1991
  • 1990
  • 1989
  • 1988
  • 1987
  • 1986
  • 1985
  • 1984
  • 1983
  • 1982
  • 1981
  • 1980
  • 1979
  • 1978
  • 1977
  • 1976
  • 1975
  • 1974
  • 1973
  • 1972
  • 1971
  • 1970
  • 1969
  • 1968
  • 1967
  • 1966
  • 1965
  • 1964
  • 1963
  • 1958
  • More
Subject
News
June 11, 2026
Doctoral Student at HSE University Reveals Hidden Layout of Ancient Parion
İdil Malgil, a researcher at HSE University, conducted a UAV-based LiDAR survey of the ancient Roman city of Parion in present-day Turkey. The high density of the scans allowed the team to detect subtle terrain features concealed beneath the ground and vegetation. The survey revealed traces of entire neighbourhoods, terraced structures, and walls that had remained invisible during routine excavations and could not be identified through aerial photography. The findings have been published in Ancient Civilizations from Scythia to Siberia.
June 11, 2026
Mathematicians from Nizhny Novgorod and Shanghai Study System Stability
Mathematicians at HSE University–Nizhny Novgorod, in collaboration with colleagues from Tongji University in Shanghai, are investigating the fundamental causes of structural stability in systems and the mechanisms underlying its disruption. In this interview with the HSE News Service, Prof. Olga Pochinka, Head of the International Laboratory of Dynamical Systems and Applications at HSE University–Nizhny Novgorod and leader of the project ‘Qualitative Theory of Systems of Ordinary and Partial Differential Equations,’ discusses the project, which is being implemented as part of HSE University's International Academic Cooperation programme.
June 11, 2026
Neurolinguists Assist in Awake Surgery on 11-Year-Old Patient with Epilepsy
Researchers at the HSE Centre for Language and Brain took part in a rare awake neurosurgical procedure performed on an 11-year-old patient with drug-resistant epilepsy. Working alongside surgeons at the Voyno-Yasenetsky Centre of Specialised Medical Care for Children in Solntsevo, they monitored the resection of a portion of the left temporal lobe, where the epileptic focus had been identified.

 

Have you spotted a typo?
Highlight it, click Ctrl+Enter and send us a message. Thank you for your help!

Publications
  • Books
  • Articles
  • Chapters of books
  • Working papers
  • Report a publication
  • Research at HSE

?

News headline as a form of news text compression

P. 139–147.
Kochetkova N. A., Pronoza E., Yagunova E.

In this paper we analyze news text collections (clusters) via extracting their paraphrase headlines into a paraphrase graph and working with this graph. Our aim is to test whether news headline is an appropriate form of news text compression. Different types of news collections: dynamic, static and combined (both dynamic and static) clusters are analyzed and it is shown that their respective paraphrase graphs reflect the characteristics of the texts. We also automatically extract the most informationally important linked fragments of news texts, and these fragments characterize news texts as either informative, conveying some information, or publicistic ones, trying to affect the readers emotionally. It is shown that news headlines of the informative type do represent their respective compressed news reports

Language: English
DOI
Text on another site
Keywords: text analysislinked text segmentsnews clusterparaphrase extractionparaphrase graph

In book

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics). 10th International Conference on Social Informatics, SocInfo 2018; St.Petersburg
Springer, 2018.
Similar publications
Анализ культурных референций в творчестве А. Вознесенского: цифровое исследование имен персоналий
Tyuryakova-Matveeva D., Цифровые гуманитарные исследования 2026 № 1 С. 4–26
The article explores cultural references in the works of Andrei Voznesensky by analyzing the personalities he mentions. A total of 1,678 works were processed, including poetry, prose, and early unpublished poems. NER methods based on Natasha, spaCy, and LLM Grok tools made it possible to study the frequency of mentions of famous people and their ...
Added: May 31, 2026
Перспективы медиа-мониторинга в исследованиях общественного мнения (на примере доверия президенту)
Ankudinov I., Социология: методология, методы, математическое моделирование 2025 № 61 С. 165–203
The changing political mood of Russians is a constant subject of interest for sociological agencies. With the development of the Internet, conventional questionnaire research began to be supplemented by online surveys and, despite some skepticism, by social media mining. This article attempts to adjust an accidental web-sample so as to bring its estimates closer to ...
Added: April 22, 2026
Алгоритм анализа новостной информации для принятия экономических решений
Чудинова О. С., Первицкая Л. А., Ramenskaya A., Индустриальная экономика 2026 № 1 С. 65–78
This article is devoted to the development of an algorithm for analyzing news information using machine learning methods implemented in Python libraries. The choice of tools used at each stage of the algorithm is justified by calculating metrics for the quality of the solution to the corresponding machine learning problems. The algorithm’s results are presented ...
Added: April 20, 2026
Юсуф-Ходжа и его братья: О родстве Афанасия Никитина
Lifshits A., Slovĕne 2025 Т. 14 № 1 С. 300–312
The article considers those episodes from the notes of Afanasy Nikitin that allow us to doubt his merchant status. Based on the analysis of grammar, vocabulary and pragmatics of Afanasy’s messages, it is concluded that he traveled along the Volga and further as the head of a small community of people and that he differed ...
Added: September 3, 2025
Semantic Text Analysis Using Artificial Neural Networks Based on Neural-Like Elements with Temporal Signal Summation
Kharlamov Alexander, Eugeny S., Kuznetsov D. et al., Problems of Artificial Intelligence 2023 No. 3(30) P. 4–27
Text as an image is analyzed in the human visual analyzer. In this case, the image is scanned along the points of the greatest informativity, which are the inflections of the contours of the equitextural areas, into which the image is roughly divided. In the case of text analysis, individual characters of the alphabet are ...
Added: October 20, 2024
Use of Text Skeleton Structures for the Development of Semantic Search Methods
A. V. Mylnikova, V. A. Trusov, L. A. Mylnikov, Automatic Documentation and Mathematical Linguistics 2023 Vol. 57 No. 5 P. 301–307
This paper considers the problem of the generation of descriptors to reduce data volumes, text data resources, and search times through the use of the new factors of authorship, region, emotive meaning, and popularity, as well as a text category without special marks that can be used to generate descriptors. This approach allows the use ...
Added: February 29, 2024
Investor sentiment and the NFT hype index: to buy or not to buy?
Baklanova V., Kurkin A., Teplova T., China Finance Review International 2024 Vol. 14 No. 3 P. 522–548
Purpose – The primary objective of this research is to provide a precise interpretation of the constructed machine learning model and produce definitive summaries that can evaluate the influence of investor sentiment on the overall sales of non-fungible token (NFT) assets. To achieve this objective, the NFT hype index was constructed as well as several approaches of ...
Added: December 10, 2023
SmartTips: Online Products Recommendations System Based on Analyzing Customers Reviews
Ali N., Alshahrani A., Alghamdi A. et al., Applied Sciences (Switzerland) 2022 Vol. 12 No. 17 Article 8823
Online customers’ opinions represent a significant resource for both customers and enterprises to extract much information that helps them make the right decision. Finding relevant data while searching the internet is a big challenge for web users, known as the “Problem of Information Overload”. Recommender systems have been recognized as a promising way of solving ...
Added: October 4, 2022
A Semi-automated Pipeline for Mapping the Shifts and Continuities in Media Discourse
Shirokanova A., Silyutina O., , in: Digital Transformation and Global Society. 6th International Conference, DTGS 2021, St. Petersburg, Russia, June 23–25, 2021, Revised Selected Papers.: Springer, 2022. P. 19–35.
Added: January 27, 2022
The Lexico-Semantic Pattern Extraction Automation Based on the Analysis of Text Corpora
Borovin V., Lanin V., Lyadova L. N., , in: 2021 IEEE 15th International Conference on Application of Information and Communication Technologies (AICT).: IEEE, 2021. P. 1–5.
The need for using English at Russian universities has increased. It makes the ability to write good quality academic texts a necessary skill. Despite the existence of various types of software which can check grammar and/or style of a text, there is no software focusing on linguistic characteristics of academic texts. The academic community accumulated ...
Added: November 2, 2021
ОЦЕНКА КАЧЕСТВА РАСКРЫТИЯ НЕФИНАНСОВОЙ ИНФОРМАЦИИ ПО СТАНДАРТАМ GRI РОССИЙСКИМИ КОМПАНИЯМИ
Fedorova E., Khrustova L., Демин И. С., AlterEconomics (ранее - Журнал экономической теории) 2020 Т. 17 № 2 С. 412–423
The non-financial information is defined as a significant determinant of the company’s activity in terms of many modern theories. The evolution of the company’s investment attractiveness evaluating theory has led to the conclusion that the determining factors include other non-financial characteristics of the company, such as management structure, degree of social and environmental responsibility and ...
Added: October 23, 2021
Методы классификации текстовых данных: можно ли потенциал количественного анализа использовать в качественном исследовании?
Aleksandrova M., ИНТЕРакция. ИНТЕРвью. ИНТЕРпретация 2021 Т. 13 № 2 С. 81–96
Text mining has developed rapidly in recent years. In this article, we compare classification methods that are suitable for solving problems of predicting item nonresponse. The author builds reasoning about how the analysis of textual data can be implemented in a wider research field based on this material. The author considers a number of metrics ...
Added: August 20, 2021
Construction of paraphrase graphs as a means of news clusters extraction
Yagunova E., Pronoza E., Kochetkova N. A., Computacion y Sistemas 2018 Vol. 22 No. 4 P. 1329–1336
In this paper, we construct paraphrase graphs for news text collections (clusters). Our aims are, first, to prove that paraphrase graph construction method can be used for news clusters identification and, second, to analyze and compare stylistically different news collections. Our news collections include dynamic, static and combined (dynamic and static) texts. Their respective paraphrase ...
Added: October 30, 2020
ТОНАЛЬНОСТЬ ОСВЕЩЕНИЯ ПОЗИЦИИ РОССИИ В АНГЛОЯЗЫЧНЫХ СМИ В ПЕРИОД САНКЦИЙ
Khrustova L., Федоров Ф. Ю., Fedorova E., Контуры глобальных трансформаций: политика, экономика, право 2020 Т. 13 № 4 С. 292–310
Обострение политической обстановки, которая свойственна текущей стадии развития международных отношений, сопровождается масштабной информационной войной. Проблема освещения положения России в международной прессе с негативной точки зрения обсуждается с начала 2000-х годов. Российско-украинский конфликт, который начался в конце 2013 - начале 2014 годов, заставил иностранные средства массовой информации вновь обратить внимание на Россию и спровоцировал увеличение количества ...
Added: October 29, 2020
Полнота раскрытия нефинансовой информации российскими компаниями: влияние на инвестиционную привлекательность
Khrustova L., Fedorova E., Демин И. С., Российский журнал менеджмента 2020 Т. 18 № 1 С. 51–72
In the context of the development of the digital economy, the role of a company’s information transparency has become increasingly important. Alongside purely financial information, investors are more likely to also take into account the disclosure of non-financial information in the annual accounts. The purpose of this study is to empirically examine the relationship between ...
Added: August 20, 2020
DISTRIBUTIONAL AND NETWORK SEMANTICS. TEXT ANALYSIS APPROACHES
Kharlamov A. A., Pantiukhin D., Gordeev D., , in: Neuroinformatics and Semantic Representations: Theory and Applications.: Cambridge Scholars Publishing, 2020. Ch. 4 P. 55–113.
Abstract. Over the past decade, a new wave of interest in dialogue agents has been observed. This is largely due to the introduction of machine learning in the tasks of automatic natural language processing. Using the tools of distributional and network semantics makes it possible to summarize data from huge corpora of texts. New language ...
Added: June 22, 2020
Application of NLP Algorithms: Automatic Text Classifier Tool
Romanov A., Ekaterina Kozlova, Lomotin Konstantin, , in: Digital Transformation and Global Society. Third International Conference, DTGS 2018, St. Petersburg, Russia, 2018, Revised Selected Papers. Part II. Communications in Computer and Information Science 859Issue 859.: Springer, 2018. P. 310–323.
This research is dedicated to the design of a decision support system for categorization of scientific literature. The purpose of this work is to research possible ways to apply the machine learning algorithms to the automation of manual text categorization. The following stages are considered: preprocessing of raw data, word embedding, model selection, classification model, ...
Added: August 26, 2019
  • About
  • About
  • Key Figures & Facts
  • Sustainability at HSE University
  • Faculties & Departments
  • International Partnerships
  • Faculty & Staff
  • HSE Buildings
  • HSE University for Persons with Disabilities
  • Public Enquiries
  • Studies
  • Admissions
  • Programme Catalogue
  • Undergraduate
  • Graduate
  • Exchange Programmes
  • Summer University
  • Summer Schools
  • Semester in Moscow
  • Business Internship
  • Research
  • International Laboratories
  • Research Centres
  • Research Projects
  • Monitoring Studies
  • Conferences & Seminars
  • Academic Jobs
  • Yasin (April) International Academic Conference on Economic and Social Development
  • Media & Resources
  • Publications by staff
  • HSE Journals
  • Publishing House
  • iq.hse.ru: commentary by HSE experts
  • Library
  • Economic & Social Data Archive
  • Video
  • HSE Repository of Socio-Economic Information
  • HSE1993–2026
  • Contacts
  • Copyright
  • Privacy Policy
  • Site Map
Edit