• A
  • A
  • A
  • АБВ
  • АБВ
  • АБВ
  • A
  • A
  • A
  • A
  • A
Обычная версия сайта
  • RU
  • EN
  • HSE University
  • Publications
  • Book chapter
  • Object-Attribute Biclustering for Elimination of Missing Genotypes in Ischemic Stroke Genome-Wide Data
  • RU
  • EN
Расширенный поиск
Высшая школа экономики
Национальный исследовательский университет
Priority areas
  • business informatics
  • economics
  • engineering science
  • humanitarian
  • IT and mathematics
  • law
  • management
  • mathematics
  • sociology
  • state and public administration
by year
  • 2027
  • 2026
  • 2025
  • 2024
  • 2023
  • 2022
  • 2021
  • 2020
  • 2019
  • 2018
  • 2017
  • 2016
  • 2015
  • 2014
  • 2013
  • 2012
  • 2011
  • 2010
  • 2009
  • 2008
  • 2007
  • 2006
  • 2005
  • 2004
  • 2003
  • 2002
  • 2001
  • 2000
  • 1999
  • 1998
  • 1997
  • 1996
  • 1995
  • 1994
  • 1993
  • 1992
  • 1991
  • 1990
  • 1989
  • 1988
  • 1987
  • 1986
  • 1985
  • 1984
  • 1983
  • 1982
  • 1981
  • 1980
  • 1979
  • 1978
  • 1977
  • 1976
  • 1975
  • 1974
  • 1973
  • 1972
  • 1971
  • 1970
  • 1969
  • 1968
  • 1967
  • 1966
  • 1965
  • 1964
  • 1963
  • 1958
  • More
Subject
News
April 30, 2026
HSE Researchers Compile Scientific Database for Studying Childrens Eating Habits
The database created at HSE University can serve as a foundation for studying children’s eating habits. This is outlined in the study ‘The Influence of Age, Gender, and Social-Role Factors on Children’s Compliance with Age-Based Nutritional Norms: An Experimental Study Using the Dish-I-Wish Web Application.’ The work has been carried out as part of the HSE Basic Research Programme and was presented at the XXVI April International Academic Conference named after Evgeny Yasin.
April 30, 2026
New Foresight Centre Study Identifies the Most Destructive Global Trends for Humankind
A team of researchers from the HSE International Research and Educational Foresight Centre has examined how global trends affect the quality of human life—from life expectancy to professional fulfilment. The findings of the study titled ‘Human Capital Transformation under the Influence of Global Trends’ were published in Foresight.
April 28, 2026
Scientists Develop Algorithm for Accurate Financial Time Series Forecasting
Researchers at the HSE Faculty of Computer Science benchmarked more than 200,000 model configurations for predicting financial asset prices and realised volatility, showing that performance can be improved by filtering out noise at specific frequencies in advance. This technique increased accuracy in 65% of cases. The authors also developed their own algorithm, which achieves accuracy comparable to that of the best models while requiring less computational power. The study has been published in Applied Soft Computing.

 

Have you spotted a typo?
Highlight it, click Ctrl+Enter and send us a message. Thank you for your help!

Publications
  • Books
  • Articles
  • Chapters of books
  • Working papers
  • Report a publication
  • Research at HSE

?

Object-Attribute Biclustering for Elimination of Missing Genotypes in Ischemic Stroke Genome-Wide Data

P. 185–204.
Ignatov D. I., Khvorykh G., Khrunin A., Nikolic S., Shaban, M., Petrova E., Petrova E., Koltsova E., Takelait F., Egurnov D.

© 2021, Springer Nature Switzerland AG.Missing genotypes can affect the efficacy of machine learning approaches to identify the risk genetic variants of common diseases and traits. The problem occurs when genotypic data are collected from different experiments with different DNA microarrays, each being characterised by its pattern of uncalled (missing) genotypes. This can prevent the machine learning classifier from assigning the classes correctly. To tackle this issue, we used well-developed notions of object-attribute biclusters and formal concepts that correspond to dense subrelations in the binary relation patients× SNPs. The paper contains experimental results on applying a biclustering algorithm to a large real-world dataset collected for studying the genetic bases of ischemic stroke. The algorithm could identify large dense biclusters in the genotypic matrix for further processing, which in return significantly improved the quality of machine learning classifiers. The proposed algorithm was also able to generate biclusters for the whole dataset without size constraints in comparison to the In-Close4 algorithm for generation of formal concepts.

Language: English
DOI
Keywords: data miningFormal Concept Analysisbiclusteringischemic strokeSingle Nucleotide PolymorphismMissing Genotypes
Publication based on the results of:
Development of Mathematical Models and Methods for Recommender Systems and Natural Language Processing (2021)

In book

Recent Trends in Analysis of Images, Social Networks and Texts. 9th International Conference, AIST 2020, Skolkovo, Moscow, Russia, October 15–16, 2020 Revised Supplementary Proceedings
Vol. 12602. , Springer, 2021.
Similar publications
Is Canfield Right? On the Asymptotic Coefficients for the Maximum Antichain of Partitions and Related Counting Inequalities
Ignatov D. I., , in: 11th International Conference, AIST 2023, Yerevan, Armenia, September 28–30, 2023, Revised Selected Papers. Analysis of Images, Social Networks and Texts. Lecture Notes in Computer Science (LNCS, volume 14486).: Cham: Springer, 2024. P. 349 – 361.
This paper dates back to the asymptotic solutions of Rota’s problem on the size of maximum antichain in the set partition lattice by Canfield and Harper and others. The knowledge of asymptotic coefficients could pave the way to the asymptotic solutions of such problems as (maximal) antichain counting in partition lattices. In addition to our ...
Added: January 23, 2026
Предикторы годовой выживаемости после ишемического инсульта
Kulikova S., Polyakova I., Kuzmicheva E. et al., Неврология, нейропсихиатрия, психосоматика 2025 Т. 17 № 5 С. 48–54
Predicting the outcome of ischaemic stroke (IS) is a complex task, as mortality and disability depend on many factors, including age, gender, type and severity of stroke, and comorbidities. Survival rates also vary between countries depending on genetic characteristics and differences in the organisation of healthcare systems. Objective: to search for predictors of one-year survival after ...
Added: October 27, 2025
Роль очага инфаркта мозга в определении этиологии ишемического инсульта: обзор литературы
Кулеш А. А., Мехряков С. А., Демин Д. А. et al., Неврология и нейрохирургия Восточная Европа 2025 Т. 15 № 3 С. 436–445
The etiology of ischemic stroke is extremely diverse. According to the SSS-TOAST classification, ischemic stroke can be caused by atherosclerotic lesions of large arteries, cardiogenic thromboembolism, occlusion of small arteries (lacunar), other established (dissection, cerebral venous thrombosis, migraine, reversible cerebral vasoconstriction syndrome, antiphospholipid syndrome, etc.), and unspecified causes. Strokes of unknown etiology may be caused ...
Added: October 20, 2025
Machine Learning and Knowledge Discovery in Databases. Applied Data Science Track. European Conference, ECML PKDD 2024, Vilnius, Lithuania, September 9–13, 2024, Proceedings, Part X. LNCS, volume 14950
Cham: Springer, 2024.
This multi-volume set, LNAI 14941 to LNAI 14950, constitutes the refereed proceedings of the European Conference on Machine Learning and Knowledge Discovery in Databases, ECML PKDD 2024, held in Vilnius, Lithuania, in September 2024. ...
Added: November 22, 2024
2023 IEEE International Conference on Data Mining Workshops (ICDMW) 1–4 December 2023, Shanghai, China
Shanghai: IEEE Computer Society, 2023.
The IEEE International Conference on Data Mining (ICDM) has established itself as the world’s premier research conference in data mining. It provides an international forum for presentation of original research results, as well as exchange and dissemination of innovative and practical development experiences. The conference covers all aspects of data mining, including algorithms, software, systems, ...
Added: March 20, 2024
Поиск закономерностей и важности признаков в данных виктимизационного опроса
D'yakonov A., Головина А. М., Прикладная математика и информатика 2023 Т. 61 № 74 С. 91–108
A methodology for finding patterns by solving machine learning problems with a teacher is described and applied to the analysis of national victimization survey data. Important features for machine learning models, interesting patterns and inconsistencies in the data are found. Experiments on estimating feature importance using different methods are described. ...
Added: March 18, 2024
Сентимент-анализ как метод исследования информационной повестки и общественного мнения (на примере СМИ и социальных сетей КНР)
Анташева М. С., Lobanova P., Isaeva J. K. et al., Социология: методология, методы, математическое моделирование 2023 № 57 С. 7–41
The information agenda broadcast by Chinese media resources is a   source of up-to-date data on public opinion on key issues of social welfare. Due to the technical peculiarities of the organization of Chinese websites and the need to attract additional resources for automatic processing  (parsing)  of texts in Chinese, this topic is not widely represented in domestic and foreign studies. The ...
Added: November 9, 2023
Data Analysis and Optimization. In Honor of Boris Mirkin's 80th Birthday
Springer, 2023.
This book presents the state-of-the-art in the emerging field of data science and includes models for layered security with applications in the protection of sites—such as large gathering places—through high-stake decision-making tasks. Such tasks include cancer diagnostics, self-driving cars, and others where wrong decisions can possibly have catastrophic consequences. Additionally, this book provides readers with ...
Added: August 31, 2023
Knowledge Discovery, Knowledge Engineering and Knowledge Management: 13th International Joint Conference, IC3K 2021, Virtual Event, October 25–27, 2021, Revised Selected Papers
Springer, 2023.
This book constitutes the extended and revised versions of a set of selected papers from the 13th International Joint Conference on Knowledge Discovery, Knowledge Engineering and Knowledge Management, IC3K 2021, on October 25–27, 2021. The conference was held virtually due to the COVID-19 crisis. The 9 full papers included in this book were carefully reviewed and ...
Added: July 8, 2023
Исследование и определение признаков скрытых атак на предприятии для алгоритмов машинного обучения
Золотухина М. А., Zykov S. V., Вестник Российского нового университета 2023 № 1 С. 20–28
Зачастую именно человеческий фактор ведет к распространению угроз на предприятиях. Если техническое устройство представляет собой четко работающий и слаженный механизм с возможностью при помощи диагностического оборудования проводить замеры параметров неисправностей и устранять их, то для исследования скрытых атак необходим новый компонент системы. Предприятия и промышленность в целом нуждаются в интеллектуальной системе защиты и обнаружения скрытых ...
Added: April 11, 2023
Information Systems and Design. Third International Conference, ICID 2022, Tashkent, Uzbekistan, September 12–13, 2022, Revised Selected Papers
Springer, 2023.
This book constitutes the proceedings of Third International Conference on Information Systems and Design, ICID 2022, which took place in Tashkent, Uzbekistan, in September 2022.  The 12 papers presented in this volume were carefully reviewed and selected from 35 submissions. They were organized in topical sections as follows: methodological support of analysis and management tools: theoretical-focused ...
Added: March 31, 2023
Triclustering in Big Data Setting
Egurnov D., Точилкин Д. С., Ignatov D. I., , in: Complex Data Analytics with Formal Concept Analysis.: Springer, 2022. P. 239–258.
In this paper, we describe versions of triclustering algorithms adapted for efficient calculations in distributed environments with MapReduce model or parallelisation mechanism provided by modern programming languages. OAC-family of triclustering algorithms shows good parallelisation capabilities due to the independent processing of triples of a triadic formal context. We provide time and space complexity of the ...
Added: November 1, 2022
Machine learning methods for demographic data analysis
Muratova A., Ignatov D. I., Mitrofanova E., , in: Recent Trends in Analysis of Images, Social Networks and Texts. 9th International Conference, AIST 2020, Skolkovo, Moscow, Russia, October 15–16, 2020 Revised Supplementary ProceedingsVol. 12602.: Springer, 2021. P. 297–299.
This is the extended abstract of a case study on demographic sequences analysis by machine learning and data mining methods. ...
Added: November 1, 2022
Triclusters of Close Values for the Analysis of 3D Data
Egurnov D., Ignatov D. I., Automation and Remote Control 2022 Vol. 83 No. 6 P. 894–902
Abstract: The paper deals with the problem of triclustering in multivalued triadic contexts in termsof one multidimensional extension of formal concept analysis; triclustering can be viewed as asearch for dense subtensors in three-dimensional tensors over the field of real numbers. Twomethods are proposed for solving this problem, namely, NOAC—a version of the OACtriclustering method for ...
Added: November 1, 2022
Recent Trends in Analysis of Images, Social Networks and Texts. 10th International Conference, AIST 2021, Tbilisi, Georgia, December 16–18, 2021, Revised Selected Papers
Springer, 2022.
This book constitutes revised selected papers of the 10th International Conference on Analysis of Images, Social Networks and Texts, AIST 2021, held in Tbilisi, Georgia, in December 2021. Due to the COVID-19 pandemic the conference was held in hybrid mode.  The 17 full papers were carefully reviewed and selected from 118 submissions, out of which 92 ...
Added: October 31, 2022
Использование метода интеллектуального анализа данных для прогнозирования академически рискованных студентов в зависимости от их темперамента (на примере факультета ИМиКН в НИУ ВШЭ-Нижний Новгород)
Shadrina E. V., Вестник Нижегородского университета им. Н.И. Лобачевского. Серия: Социальные науки 2022 № 3(67) С. 229–236
The article discusses the influence of temperament on the academic performance of the first-year students at HSENizhny Novgorod on the example of the Faculty of Informatics, Mathematics and Computer Science. Analysis was held with the help of statistics methods and methods of data mining. The baseline data for the study is information about students, collected ...
Added: October 18, 2022
Artificial Intelligence in Education. Posters and Late Breaking Results, Workshops and Tutorials, Industry and Innovation Tracks, Practitioners’ and Doctoral Consortium -23rd International Conference, AIED 2022, Durham, UK, July 27–31, 2022, Proceedings, Part II
Springer, 2022.
This two-volume set LNAI 13355 and 13356 constitutes the refereed proceedings of the 23rd International Conference on Artificial Intelligence in Education, AIED 2022, held in Durham, UK, in July 2022. The 40 full papers and  40 short papers presented together with 2 keynotes, 6 industry papers, 12 DC papers, 6 Workshop papers, 10 Practitioner papers, 97 ...
Added: July 28, 2022
Credit scoring methods: latest trends and points to consider
Anton Markov, Zinaida Seleznyova, Victor Lapshin, Journal of Finance and Data Science 2022 Vol. 8 P. 180–201
Credit risk is the most significant risk by impact for any bank and financial institution. Accurate credit risk assessment affects an organisation’s balance sheet and income statement, since credit risk strategy determines pricing, and might even influence seemingly unrelated domains, e.g. marketing, and decision-making. This article aims at providing a systemic review of the most recent (2016–2021) articles, identifying ...
Added: July 28, 2022
Research of Correlation Dependencies in Russian Household Data Using Data Mining Methods
Usachev V., Brus V., Voronova L. et al., , in: Digitalization of Society, Economics and Management: A Digital Strategy Based on Post-pandemic DevelopmentsIssue 53.: Springer, 2022. P. 151–161.
Added: June 24, 2022
A Practical Study of Process Mining from Event Logs Using Machine Learning and Petry Net Models.
Nikitina V., Panfilov Peter, , in: Digitalization of Society, Economics and Management: A Digital Strategy Based on Post-pandemic DevelopmentsIssue 53.: Springer, 2022. Ch. 13 P. 173–185.
Added: April 22, 2022
21st IEEE International Conference on Data Mining Workshops, ICDMW 2021
IEEE Computer Society, 2021.
The 21th IEEE International Conference on Data Mining (IEEE ICDM 2021) is a premier and truly international conference for researchers and practitioners in the broad area of data mining. The ICDM Workshops program (IEEE ICDMW) aims to provide a platform for multiple workshops with a range of more focused topics to be discussed and explored, where attendees can present ...
Added: February 4, 2022
  • About
  • About
  • Key Figures & Facts
  • Sustainability at HSE University
  • Faculties & Departments
  • International Partnerships
  • Faculty & Staff
  • HSE Buildings
  • HSE University for Persons with Disabilities
  • Public Enquiries
  • Studies
  • Admissions
  • Programme Catalogue
  • Undergraduate
  • Graduate
  • Exchange Programmes
  • Summer University
  • Summer Schools
  • Semester in Moscow
  • Business Internship
  • Research
  • International Laboratories
  • Research Centres
  • Research Projects
  • Monitoring Studies
  • Conferences & Seminars
  • Academic Jobs
  • Yasin (April) International Academic Conference on Economic and Social Development
  • Media & Resources
  • Publications by staff
  • HSE Journals
  • Publishing House
  • iq.hse.ru: commentary by HSE experts
  • Library
  • Economic & Social Data Archive
  • Video
  • HSE Repository of Socio-Economic Information
  • HSE1993–2026
  • Contacts
  • Copyright
  • Privacy Policy
  • Site Map
Edit