• A
  • A
  • A
  • АБВ
  • АБВ
  • АБВ
  • A
  • A
  • A
  • A
  • A
Обычная версия сайта
  • RU
  • EN
  • HSE University
  • Publications
  • Book chapter
  • Style-transfer and Paraphrase: Looking for a Sensible Semantic Similarity Metric
  • RU
  • EN
Расширенный поиск
Высшая школа экономики
Национальный исследовательский университет
Priority areas
  • business informatics
  • economics
  • engineering science
  • humanitarian
  • IT and mathematics
  • law
  • management
  • mathematics
  • sociology
  • state and public administration
by year
  • 2027
  • 2026
  • 2025
  • 2024
  • 2023
  • 2022
  • 2021
  • 2020
  • 2019
  • 2018
  • 2017
  • 2016
  • 2015
  • 2014
  • 2013
  • 2012
  • 2011
  • 2010
  • 2009
  • 2008
  • 2007
  • 2006
  • 2005
  • 2004
  • 2003
  • 2002
  • 2001
  • 2000
  • 1999
  • 1998
  • 1997
  • 1996
  • 1995
  • 1994
  • 1993
  • 1992
  • 1991
  • 1990
  • 1989
  • 1988
  • 1987
  • 1986
  • 1985
  • 1984
  • 1983
  • 1982
  • 1981
  • 1980
  • 1979
  • 1978
  • 1977
  • 1976
  • 1975
  • 1974
  • 1973
  • 1972
  • 1971
  • 1970
  • 1969
  • 1968
  • 1967
  • 1966
  • 1965
  • 1964
  • 1963
  • 1958
  • More
Subject
News
June 25, 2026
HSE Researchers Make Aldehydes Perform Dual Function
Chemists from HSE University have discovered a way to carry out a reductive addition reaction without using an external reducing agent. Instead, the required 'resource' is supplied by the aldehyde itself, one of the reaction participants. This approach helps prevent unwanted side reactions, reduces toxicity, and simplifies the production and synthesis of organic molecules, including those used in the manufacture of medicines. The study has been published in Journal of Catalysis.
June 25, 2026
HSE Scientists Explain Why Findings in Autism Research Differ
Researchers from the Cognitive Health and Intelligence Centre at HSE University conducted the first-ever systematic review of studies on the specifics of emotion-from-motion perception in autism. The review showed that differences found between autistic and non-autistic individuals are largely associated with the experimental design and the types of tasks given to study participants. The review findings have been published in Research in Autism.
June 22, 2026
‘In Science, You Are Your Own Boss
Polina Nasledskova is interested in identifying gaps in linguistics and topics that have been overlooked by other researchers. In an interview for the  Young Scientists of HSE University project, she spoke about rare ordinal numerals in Nakh-Daghestanian languages, the benefits of knitting for concentration, and the beauty of the Patriarshy Bridge.

 

Have you spotted a typo?
Highlight it, click Ctrl+Enter and send us a message. Thank you for your help!

Publications
  • Books
  • Articles
  • Chapters of books
  • Working papers
  • Report a publication
  • Research at HSE

?

Style-transfer and Paraphrase: Looking for a Sensible Semantic Similarity Metric

P. 14213–14220.
Yamshchikov I. P., Shibaev V., Khlebnikov N., Tikhonov A.

The rapid development of such natural language processing tasks as style transfer, paraphrase, and machine translation often calls for the use of semantic similarity metrics. In recent years a lot of methods to measure the semantic similarity of two short texts were developed. This paper provides a comprehensive analysis for more than a dozen of such methods. Using a new dataset of fourteen thousand sentence pairs human-labeled according to their semantic similarity, we demonstrate that none of the metrics widely used in the literature is close enough to human judgment in these tasks. A number of recently proposed metrics provide comparable results, yet Word Mover Distance is shown to be the most reasonable solution to measure semantic similarity in reformulated texts at the moment.

Language: English
Text on another site
Keywords: semantic similaritySemantic similarity measure

In book

The Thirty-Fifth AAAI Conference on Artificial Intelligence. Technical Tracks 16
Leyton-Brown K., Mausam M., Yang Q., Liu Y., Röger G. Vol. 35. Issue 16. , AAAI Press, 2021.
Similar publications
Aschern at CheckThat! 2021: Lambda-Calculus of Fact-Checked Claims
Chernyavskiy A., Ilvovsky D., Nakov P., , in: CLEF 2021 Working Notes.: CEUR Workshop Proceedings, 2021. P. 484–493.
We describe our system for the CLEF 2021 CheckThat! Lab Task 2 Subtask A on detecting previously fact-checked claims. We developed a pipeline using TF.IDF, sentence-BERT fine-tuned on the training data, and reranking using LambdaMART and the predicted similarity scores and positions in the ranked list as features. We examined the quality of each model ...
Added: May 9, 2024
Semantic Recommendation System for Bilingual Corpus of Academic Papers
Safaryan A., Petr Filchenkov, Yan W. et al., , in: Recent Trends in Analysis of Images, Social Networks and Texts. 9th International Conference, AIST 2020, Skolkovo, Moscow, Russia, October 15–16, 2020 Revised Supplementary ProceedingsVol. 12602.: Springer, 2021. Ch. 3 P. 22–36.
We tested four methods of making document representations cross-lingual for the task of semantic search for the similar papers based on the corpus of papers from three Russian conferences on NLP: Dialogue, AIST and AINL. The pipeline consisted of three stages: preprocessing, word-by-word vectorisation using models obtained with various methods to map vectors from two ...
Added: September 18, 2023
Moving Other Way: Exploring Word Mover Distance Extensions
Smirnov, I., Yamshchikov I. P., , in: COMPLEXIS 2022. Proceedings of the 7th International Conference on Complexity, Future Information Systems and Risk. April 23-24, 2022.: Science and Technology Publications, Lda, 2022. P. 92–97.
Added: September 8, 2022
Rethinking Crowd Sourcing for Semantic Similarity
Solomon S., Cohn A., Rosenblum H. et al., / Series Computer Science "arxiv.org". 2021.
Estimation of semantic similarity is crucial for a variety of natural language processing (NLP) tasks. In the absence of a general theory of semantic information, many papers rely on human annotators as the source of ground truth for semantic similarity estimation. This paper investigates the ambiguities inherent in crowd-sourced semantic labeling. It shows that annotators ...
Added: December 3, 2021
Lexicographic Study of Synonymy: Clarifying Semantic Similarity between Words
Solovyev V., Гималетдинова Г., Халитова Л. et al., Computacion y Sistemas 2021 Vol. 25 No. 3 P. 667–675
The problem of determining semantic similarity between words affects the understanding of synonymy 13 and creates obstacles to the work of lexicographers. The study was carried out as a part of a larger 14 research project on expert assessment of synonymic rows in RuWordNet thesaurus (a WordNet–like 15 thesaurus for the Russian language). The aim ...
Added: December 1, 2021
Similarity Measures and Models for Movie Series Recommender System
Bliznuk D., Yagunova E., Pronoza E., , in: Internet Science. INSCI 2018Vol. 11193.: Springer, 2018. P. 181–193.
In this paper we propose a method of movie series recommender system development. Our recommender system is content-based, and movie series are represented by their scripts. We experiment with several semantic similarity measures, lexico-morphological metrics, keywords and vector space models to extract similar movie series. Evaluation is conducted in the experiment with informants. The best ...
Added: October 30, 2020
Semantic Proximity Establishment in the Tasks of Knowledge Extraction and Named Entities Recognition
Kozerenko E. B., Kuznetsov K. I., Morozova Y. I. et al., , in: PROCEEDINGS OFTHE 2017 INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE.: American Council on Science & Education, 2017. P. 339–344.
The paper deals with the problem of establishing text segments containing the similar semantic units for the tasks of analytical text processing within the semantic technology platform. The methods and instruments presented in the paper provide the discovery of relevant content based on users' focused interests within a certain domain. The hybrid approach comprising linguistic ...
Added: February 23, 2018
Trend Monitoring for Linking Science and Strategy
Bakhtin P. D., Saritas O., Chulok A. et al., Scientometrics 2017 Vol. 111 No. 3 P. 2059–2075
Rapid changes in Science & Technology (S&T) along with breakthroughs in products and services concern a great deal of policy and strategy makers and lead to an ever increasing number of Foresight and other types of forward-looking work. At the outset, the purpose of these efforts is to investigate emerging S&T areas, set priorities and ...
Added: December 21, 2016
  • About
  • About
  • Key Figures & Facts
  • Sustainability at HSE University
  • Faculties & Departments
  • International Partnerships
  • Faculty & Staff
  • HSE Buildings
  • HSE University for Persons with Disabilities
  • Public Enquiries
  • Studies
  • Admissions
  • Programme Catalogue
  • Undergraduate
  • Graduate
  • Exchange Programmes
  • Summer University
  • Summer Schools
  • Semester in Moscow
  • Business Internship
  • Research
  • International Laboratories
  • Research Centres
  • Research Projects
  • Monitoring Studies
  • Conferences & Seminars
  • Academic Jobs
  • Yasin (April) International Academic Conference on Economic and Social Development
  • Media & Resources
  • Publications by staff
  • HSE Journals
  • Publishing House
  • iq.hse.ru: commentary by HSE experts
  • Library
  • Economic & Social Data Archive
  • Video
  • HSE Repository of Socio-Economic Information
  • HSE1993–2026
  • Contacts
  • Copyright
  • Privacy Policy
  • Site Map
Edit