• A
  • A
  • A
  • АБВ
  • АБВ
  • АБВ
  • A
  • A
  • A
  • A
  • A
Обычная версия сайта
  • RU
  • EN
  • HSE University
  • Publications
  • Book chapter
  • Using Taxonomy Tree to Generalize a Fuzzy Thematic Cluster
  • RU
  • EN
Расширенный поиск
Высшая школа экономики
Национальный исследовательский университет
Priority areas
  • business informatics
  • economics
  • engineering science
  • humanitarian
  • IT and mathematics
  • law
  • management
  • mathematics
  • sociology
  • state and public administration
by year
  • 2027
  • 2026
  • 2025
  • 2024
  • 2023
  • 2022
  • 2021
  • 2020
  • 2019
  • 2018
  • 2017
  • 2016
  • 2015
  • 2014
  • 2013
  • 2012
  • 2011
  • 2010
  • 2009
  • 2008
  • 2007
  • 2006
  • 2005
  • 2004
  • 2003
  • 2002
  • 2001
  • 2000
  • 1999
  • 1998
  • 1997
  • 1996
  • 1995
  • 1994
  • 1993
  • 1992
  • 1991
  • 1990
  • 1989
  • 1988
  • 1987
  • 1986
  • 1985
  • 1984
  • 1983
  • 1982
  • 1981
  • 1980
  • 1979
  • 1978
  • 1977
  • 1976
  • 1975
  • 1974
  • 1973
  • 1972
  • 1971
  • 1970
  • 1969
  • 1968
  • 1967
  • 1966
  • 1965
  • 1964
  • 1963
  • 1958
  • More
Subject
News
May 25, 2026
HSE Scientists Train Neural Network to 'Hear' Faults in Electric Motors
Researchers at the AI and Digital Science Institute of the HSE Faculty of Computer Science have developed a new method—the Signature-Guided Data Augmentation (SGDA) framework—that achieves 99% accuracy in motor fault detection and 86% accuracy in fault classification. The application of this approach can reduce industrial equipment repair costs, minimise downtime, and improve production safety. The study results have been published in Engineering Applications of Artificial Intelligence.
May 25, 2026
'The Humanities Serve as a Conscience'
Maria Mizernaia studies Soviet literature and the history of book publishing. In this interview for the HSE Young Scientists project, she discusses plans to publish a novel about besieged Leningrad, AI-provoked reflections on what it means to be human, and how novels can help satisfy our dopamine hunger.
May 25, 2026
Is It Possible to Predict a Citys Life Based on the Shape of Its Neighbourhoods?
Is it possible to predict, based on the configuration of streets and buildings, where a café will open or where traffic congestion will occur? Participants in the Spatial Analysis and Modelling of Urban Processes research and study group use open data and machine learning to identify universal patterns. Alexander Sheludkov and Eduard Somov discuss the purpose of comparing cities, the need for new forms of urban statistics, and how open data is transforming approaches to urban studies.

 

Have you spotted a typo?
Highlight it, click Ctrl+Enter and send us a message. Thank you for your help!

Publications
  • Books
  • Articles
  • Chapters of books
  • Working papers
  • Report a publication
  • Research at HSE

?

Using Taxonomy Tree to Generalize a Fuzzy Thematic Cluster

P. 1–6.
Frolov D., Mirkin B., Nascimento S., Fenner T.

This paper presents an algorithm, ParGenFS, for generalizing, or “lifting”, a fuzzy set of topics to higher ranks of a hierarchical taxonomy of a research domain. The algorithm ParGenFS finds a globally optimal generalization of the topic set to minimize a penalty function, by balancing the number of introduced “head subjects” and related errors, the “gaps” and “offshoots”, differently weighted. This leads to a generalization of the topic set in the taxonomy. The usefulness of the method is illustrated on a set of 17685 abstracts of research papers on Data Science published in Springer journals for the past 20 years. We extracted a taxonomy of Data Science from the international Association for Computing Machinery Computing Classification System 2012 (ACM-CCS). We find fuzzy clusters of leaf topics over the text collection, lift them in the taxonomy, and interpret found head subjects to comment on the tendencies of current research.

Language: English
Full text
DOI
Text on another site
Keywords: fuzzy setsgeneralizationannotated suffix treegap-offshoot penalty
Publication based on the results of:
Разработка методов структуризации и концептуализации текстовых данных на основе таксономии предметной области (2019)

In book

Fuzzy Systems (FUZZ-IEEE), IEEE International Conference Proceedings
IEEE, 2019.
Similar publications
Применение моделей, основанных на нечеткой логике, к финансовым временным рядам
Shvedov A. S., Sviyazov V., В кн.: Системное моделирование социально-экономических процессов: труды 46-ой международной научной школы-семинара, г. Уфа, 9 - 15 октября 2023 г.: Воронеж: Истоки, 2024. С. 526–531.
The generalized autoregressive conditional heteroscedasticity model is widely applied to financial time series. There are further generalizations of this model. One of such generalizations is a combination of Takagi–Sugeno type fuzzy systems and autoregressive conditional heteroscedasticity models. The Takagi–Sugeno fuzzy systems advantage is that there is a standalone generalized autoregressive conditional heteroscedasticity model constructed for ...
Added: June 26, 2024
22nd International Conference, MMST 2022, Nizhny Novgorod, Russia, November 14–17, 2022, Revised Selected Papers
Springer, 2022.
This book constitutes selected and revised papers from the 22nd International Conference on Mathematical Modeling and Supercomputer Technologies, MMST 2022, held in Nizhny Novgorod, Russia, in November 2022.    The 20 full papers and 5 short papers presented in the volume were thoroughly reviewed and selected from the 48 submissions. They are organized in topical secions on ​computational methods ...
Added: December 26, 2022
Admissible and Bayes decisions with fuzzy-valued losses
Shvedov A. S., Discrete Mathematics and Applications 2022 Vol. 32 No. 2 P. 139–145
Added: May 31, 2022
Рождаемость в муниципальных образованиях России в 2011–2019 гг.
Petrosian A., Демографическое обозрение 2021 № 3 С. 42–73
Insufficient data until recently have resulted in a lack of studies on population indicators for small areas in Russia. The article focuses on an analysis of fertility in 2, 304 municipal level areas of Russia’s regions. The study is based on the Rosstat municipal data on the number of women of reproductive age by five year age groups and the number ...
Added: May 2, 2022
A Robust Credibility DEA Model with Fuzzy Perturbation Degree: An Application to Hospitals Performance
Omrani H., Alizadeh A., Emrouznejad A. et al., Expert Systems with Applications 2022 Vol. 189 Article 116021
Performance evaluation enables decision makers (DMs) to have a better view about the weaknesses and strengths of leading units to improve efficiencies as a crucial goal. Data envelopment analysis (DEA) is the most popular technique to measure performance efficiency of decision making units (DMUs). However, conventional DEA is unable to consider uncertainty of input and ...
Added: October 12, 2021
Formal Concept Analysis: 16th International Conference, ICFCA 2021, Strasbourg, France, June 29 – July 2, 2021, Proceedings
Springer, 2021.
This book constitutes the proceedings of the 16th International Conference on Formal Concept Analysis, ICFCA 2021, held in Strasbourg, France, in June/July 2021. The 14 full papers and 5 short papers presented in this volume were carefully reviewed and selected from 32 submissions. The book also contains four invited contributions in full paper length. The research part ...
Added: July 10, 2021
Abstraction and Generalization in the Logic of Science: Cases from Nineteenth-Century Scientific Practice
Pietarinen A., Cristalli C., HOPOS 2021 Vol. 11 No. 1 P. 93–121
Abstraction and generalization are two processes of reasoning that have a special role in the construction of scientific theories and models. They have been important parts of the scientific method ever since the nineteenth century. A philosophical and historical analysis of scientific practices shows how abstraction and generalization found their way into the theory of ...
Added: May 29, 2021
A Hybrid Approach to the Analysis of a Collection of Research Papers
Mirkin B., Frolov D., Vlasov A. et al., , in: Intelligent Data Engineering and Automated Learning – IDEAL 2020/ 21st International Conference, Guimaraes, Portugal, November 4–6, 2020, Proceedings, Part IIVol. 12490: Lecture Notes in Computer Science.: Cham: Springer, 2020. P. 423–433.
We define and find a most specific generalization of a fuzzy set of topics assigned to leaves of the rooted tree of a taxonomy. This generalization lifts the set to a “head subject” in the higher ranks of the taxonomy, that is supposed to “tightly” cover the query set, possibly bringing in some errors, both ...
Added: November 13, 2020
War of Attrition with Incomplete Information and Fuzzy Players' Types
Shvedov A. S., Automation and Remote Control 2020 Vol. 81 No. 7 P. 1279–1285
The result on existence of a pure-strategy symmetric Bayesian Nash equilibrium in the war of attrition is generalized for fuzzy players’ actions and types. ...
Added: September 26, 2020
Война на изнурение с неполной информацией и с нечеткими типами игроков
Shvedov A. S., Автоматика и телемеханика 2020 № 7 С. 139–147
The result on existence of a pure-strategy symmetric Bayesian Nash equilibrium in the war of attrition is generalized for fuzzy players' actions and types. ...
Added: September 26, 2020
A Hybrid Approach to Interpretable Analysis of Research Paper Collections
Mirkin B., Frolov D., Vlasov A. et al., , in: WIMS 2020: Proceedings of the 10th International Conference on Web Intelligence, Mining and Semantics.: Association for Computing Machinery (ACM), 2020. P. 184–189.
We define and find a most specific generalization of a fuzzy set of topics assigned to leaves of the rooted tree of a taxonomy. This generalization lifts the set to a “head subject” in the higher ranks of the taxonomy, that is supposed to “tightly” cover the query set, possibly bringing in some errors, both ...
Added: August 28, 2020
Computational Generalization in Taxonomies Applied to: (1) Analyze Tendencies of Research and (2) Extend User Audiences
Frolov D., Mirkin B., Nascimento S. et al., , in: Intelligent Data Engineering and Automated Learning – IDEAL 2019Vol. 2.: Springer, 2019. P. 3–11.
We define a most specific generalization of a fuzzy set of topics assigned to leaves of the rooted tree of a domain taxonomy. This generalization lifts the set to its “head subject” node in the higher ranks of the taxonomy tree. The head subject is supposed to “tightly” cover the query set, possibly bringing in some errors referred to ...
Added: December 7, 2019
Intelligent Data Engineering and Automated Learning – IDEAL 2019
Springer, 2019.
We define a most specific generalization of a fuzzy set of topics assigned to leaves of the rooted tree of a domain taxonomy. This generalization lifts the set to its “head subject” node in the higher ranks of the taxonomy tree. The head subject is supposed to “tightly” cover the query set, possibly bringing in some errors referred to ...
Added: December 7, 2019
Parsimonious Generalization of Fuzzy Thematic Sets in Taxonomies Applied to the Analysis of Tendencies of Research in Data Science
Frolov D., Nascimento S., Fenner T. et al., Information Sciences 2020 Vol. 512 P. 595–615
This paper proposes a novel method, referred to as ParGenFS, for finding a most specific generalization of a query set represented by a fuzzy set of topics assigned to leaves of the rooted tree of a taxonomy. The query set is generalized by “lifting” it to one or more “head subjects” in the higher ranks ...
Added: October 9, 2019
Proceedings of the 11th Conference of the European Society for Fuzzy Logic and Technology (EUSFLAT 2019)
P.: Atlantis Press, 2019.
Added: September 25, 2019
Globally Optimal Parsimoniously Lifting a Fuzzy Query Set Over a Taxonomy Tree
Frolov D., Mirkin B., Nascimento S. et al., , in: Optimization of Complex Systems: Theory, Models, Algorithms and Applications.: Switzerland: Springer Publishing Company, 2020. P. 779–789.
This paper presents a relatively rare case of an optimization problem in data analysis to admit a globally optimal solution by a recursive algorithm. We are concerned with finding a most specific generalization of a fuzzy set of topics assigned to leaves of domain taxonomy represented by a rooted tree. The idea is to “lift” ...
Added: June 25, 2019
Fuzzy Phonetic Encoding of Speech Signals in Voice Processing Systems
Savchenko L.V., Savchenko A.V., Journal of Communications Technology and Electronics 2019 Vol. 64 No. 3 P. 238–244
In this paper, we studied the phonetic approach for voice processing. A method for automatic recognition of speech signals, in which each quasistationary segment is associated with a fuzzy set of phonemes, was developed. We proposed the operation of the probabilistic triangular norm for fuzzy sets corresponding to the input frame and the nearest reference phoneme. The developed ...
Added: June 7, 2019
Using Domain Taxonomy to Model Generalization of Thematic Fuzzy Clusters
Frolov D., Mirkin B., Nascimento S. et al., , in: CONTENT 2019, The Eleventh International Conference on Creative Content Technologies.: International Academy, Research, and Industry Association (IARIA), 2019. P. 20–25.
We define a most specific generalization of a fuzzy set of topics assigned to leaves of the rooted tree of a domain taxonomy. This generalization lifts the set to its 'head subject' in the higher ranks of the taxonomy tree. The head subject is supposed to 'tightly' cover the query set, possibly bringing in some ...
Added: June 4, 2019
CONTENT 2019, The Eleventh International Conference on Creative Content Technologies
International Academy, Research, and Industry Association (IARIA), 2019.
Added: June 4, 2019
Method for Generalization of Fuzzy Sets
Frolov D., Mirkin B., Nascimento S. et al., , in: International Conference on Artificial Intelligence and Soft Computing. 18th International Conference, ICAISC 2019, Zakopane, Poland, June 16–20, 2019, Proceedings* 1. Issue 11508.: Cham: Springer, 2019. P. 273–286.
We define and find a most specific generalization of a fuzzy set of topics assigned to leaves of the rooted tree of a taxonomy. This generalization lifts the set to a “head subject” in the higher ranks of the taxonomy, that is supposed to “tightly” cover the query set, possibly bringing in some errors, both ...
Added: June 3, 2019
International Conference on Artificial Intelligence and Soft Computing. 18th International Conference, ICAISC 2019, Zakopane, Poland, June 16–20, 2019, Proceedings
Cham: Springer, 2019.
The series Lecture Notes in Computer Science (LNCS), including its subseries Lecture Notes in Artificial Intelligence (LNAI) and Lecture Notes in Bioinformatics (LNBI), has established itself as a medium for the publication of new developments in computer science and information technology research and teaching - quickly, informally, and at a high level. The two-volume set LNCS ...
Added: June 3, 2019
  • About
  • About
  • Key Figures & Facts
  • Sustainability at HSE University
  • Faculties & Departments
  • International Partnerships
  • Faculty & Staff
  • HSE Buildings
  • HSE University for Persons with Disabilities
  • Public Enquiries
  • Studies
  • Admissions
  • Programme Catalogue
  • Undergraduate
  • Graduate
  • Exchange Programmes
  • Summer University
  • Summer Schools
  • Semester in Moscow
  • Business Internship
  • Research
  • International Laboratories
  • Research Centres
  • Research Projects
  • Monitoring Studies
  • Conferences & Seminars
  • Academic Jobs
  • Yasin (April) International Academic Conference on Economic and Social Development
  • Media & Resources
  • Publications by staff
  • HSE Journals
  • Publishing House
  • iq.hse.ru: commentary by HSE experts
  • Library
  • Economic & Social Data Archive
  • Video
  • HSE Repository of Socio-Economic Information
  • HSE1993–2026
  • Contacts
  • Copyright
  • Privacy Policy
  • Site Map
Edit