• A
  • A
  • A
  • АБВ
  • АБВ
  • АБВ
  • A
  • A
  • A
  • A
  • A
Обычная версия сайта
  • RU
  • EN
  • HSE University
  • Publications
  • Articles
  • Randomness in Cancer Breakpoint Prediction
  • RU
  • EN
Расширенный поиск
Высшая школа экономики
Национальный исследовательский университет
Priority areas
  • business informatics
  • economics
  • engineering science
  • humanitarian
  • IT and mathematics
  • law
  • management
  • mathematics
  • sociology
  • state and public administration
by year
  • 2027
  • 2026
  • 2025
  • 2024
  • 2023
  • 2022
  • 2021
  • 2020
  • 2019
  • 2018
  • 2017
  • 2016
  • 2015
  • 2014
  • 2013
  • 2012
  • 2011
  • 2010
  • 2009
  • 2008
  • 2007
  • 2006
  • 2005
  • 2004
  • 2003
  • 2002
  • 2001
  • 2000
  • 1999
  • 1998
  • 1997
  • 1996
  • 1995
  • 1994
  • 1993
  • 1992
  • 1991
  • 1990
  • 1989
  • 1988
  • 1987
  • 1986
  • 1985
  • 1984
  • 1983
  • 1982
  • 1981
  • 1980
  • 1979
  • 1978
  • 1977
  • 1976
  • 1975
  • 1974
  • 1973
  • 1972
  • 1971
  • 1970
  • 1969
  • 1968
  • 1967
  • 1966
  • 1965
  • 1964
  • 1963
  • 1958
  • More
Subject
News
May 25, 2026
HSE Scientists Train Neural Network to 'Hear' Faults in Electric Motors
Researchers at the AI and Digital Science Institute of the HSE Faculty of Computer Science have developed a new method—the Signature-Guided Data Augmentation (SGDA) framework—that achieves 99% accuracy in motor fault detection and 86% accuracy in fault classification. The application of this approach can reduce industrial equipment repair costs, minimise downtime, and improve production safety. The study results have been published in Engineering Applications of Artificial Intelligence.
May 25, 2026
'The Humanities Serve as a Conscience'
Maria Mizernaia studies Soviet literature and the history of book publishing. In this interview for the HSE Young Scientists project, she discusses plans to publish a novel about besieged Leningrad, AI-provoked reflections on what it means to be human, and how novels can help satisfy our dopamine hunger.
May 25, 2026
Is It Possible to Predict a Citys Life Based on the Shape of Its Neighbourhoods?
Is it possible to predict, based on the configuration of streets and buildings, where a café will open or where traffic congestion will occur? Participants in the Spatial Analysis and Modelling of Urban Processes research and study group use open data and machine learning to identify universal patterns. Alexander Sheludkov and Eduard Somov discuss the purpose of comparing cities, the need for new forms of urban statistics, and how open data is transforming approaches to urban studies.

 

Have you spotted a typo?
Highlight it, click Ctrl+Enter and send us a message. Thank you for your help!

Publications
  • Books
  • Articles
  • Chapters of books
  • Working papers
  • Report a publication
  • Research at HSE

?

Randomness in Cancer Breakpoint Prediction

Journal of Computational Biology. 2021. Vol. 28. No. 7. P. 716–731.
Cheloshkina K., Bzhikhatlov I., Poptsova M.

Cancer genomes are susceptible to multiple rearrangements by deleting, inserting, and translocating genomic regions. Recently, the problem of finding determinants of breakpoint formations was approached with machine learning methods; however, unlike cancer point mutations, breakpoint prediction appeared to be a more difficult task, and various machine learning models did not achieve high prediction power often slightly exceeding the threshold of random guessing. This raised the question of whether the breakpoints are random noise in cancer mutagenesis or there exist determinants in structural mutagenesis. In the present study, we investigated randomness in cancer breakpoint genome distributions through the power of machine learning models to predict breakpoint hot spots. We divided all cancer types into three groups by degree of randomness in their breakpoint formation. We tested different density thresholds and explored the bias in hot spot definition. We also compared prediction of hot spots versus individual breakpoints. We found that hot spots are considerably better predicted than individual breakpoints; however, some individual breakpoints can also be predicted with a satisfactory power, and thus, it is not proper to filter them from analyses. We demonstrated that positive-unlabeled learning can provide insights into insufficiency of cancer data sets, which are not always reflected by data set sizes. Overall, the present results support the view that cancer breakpoint landscape can be represented by predictable dense breakpoint regions and scattered individual breakpoints, which are not all random noise, but some are generated by detectable mechanism.

Research target: Biology Basic Medicine
Priority areas: IT and mathematics
Language: English
DOI
Keywords: машинное обучениеmachine learningракгеномикаслучайный лесcancerМашинное обучение в Биоинформатикеcancer breakpointsCancer genome rearrangementsCancer breakpoint hotspotsrandom forestгеномыне перестановкиразрывы в раковых геномах
Publication based on the results of:
Аnalysis of regulatory alternative DNA structures (2021)
Similar publications
Novelty, Category and Orientation Tuning for Printed Characters: A Magnetoencephalography Study with Fast Periodic Visual Stimulation
Kochetkova Ekaterina, Kostanian D., Martynova O. et al., Brain Topography 2026 Vol. 39 No. 4 Article 51
Letter recognition is assumed to involve several levels of analysis, including coarse tuning for category and novelty and more fine tuning for specific features, related to letter orientation. We employed an oddball fast periodic visual stimulation (FPVS) paradigm with magnetoencephalography (Elekta VectorView, 306 sensors) to study neural discrimination responses in the source space. Using contrasts ...
Added: May 24, 2026
Molecular dynamics simulations refine the pathogenicity of ACVRL1 kinase domain variants by quantifying impacts on ATP binding in pulmonary arterial hypertension
Borovikova I., Okhrimenko G., Zamyatin V. et al., Journal of Structural Biology 2026 Vol. 218 No. 2 Article 108315
Single amino acid substitutions in the ATP-binding domain of ACVRL1, a key receptor in the bone morphogenetic protein (BMP) signaling pathway, are frequently classified as variants of uncertain significance (VUS), complicating molecular diagnosis for pulmonary arterial hypertension (PAH) and Hereditary Hemorrhagic Telangiectasia (HHT). Since aberrant ATP binding disrupts downstream SMAD1/5/8 phosphorylation, we employed molecular dynamics ...
Added: May 20, 2026
ML-based Fast Simulation of FARICH Responses
Shipilov F., Barnyakov A., Ivanov A. et al., / Series Physics "arxiv.org". 2026.
A fast simulation of the detector response is a vital task in high-energy physics (HEP). Traditional Monte-Carlo methods form the backbone of modern particle physics simulation software but are computationally expensive. We present a machine-learning-based approach to fast simulation of the Focusing Aerogel Ring Imaging Cherenkov (FARICH) detector response. Given a particle track and momentum, ...
Added: May 19, 2026
uPAR deficiency triggers TGFβ1-mediated fibrotic remodeling in a cardiac perivascular-like microenvironment
Goltseva Y., Tsokolaeva Z., Beloglazova I. et al., Stem Cell Research and Therapy 2026 Vol. 17 No. 1
Background Cardiac fibrosis represents a significant health burden, with endothelial dysfunction and damaged perivascular microenvironment increasingly recognized as key contributors to fibrotic remodeling. The urokinase plasminogen activator receptor (uPAR), a critical component of the urokinase system, plays a pivotal role in vascular remodeling and fibrosis. While prior evidence indicates that uPAR deficiency leads to microvascular dysfunction and perivascular fibrosis, ...
Added: May 19, 2026
От неизвестности к прозрачности: обзор технологий объяснимого ИИ (XAI)
Avdoshin S. M., Pesotskaya E. Y., Информационные технологии 2026 Т. 32 № 4 С. 185–194
With the rapid advancement of artificial intelligence, and deep learning in particular, models have emerged that are capable of delivering highly accurate predictions. However, the internal logic of such models remains difficult to interpret—an issue of critical importance, especially in domains where the correctness of an algorithm directly affects high-stakes decision-making. One promising avenue for ...
Added: May 8, 2026
Motor imagery perspective shapes corticospinal excitability with effector-specific effects
Perevoznyuk G., Batov A., Pleskovskaya A. et al., Scientific Reports 2026 Vol. 16 Article 13098
Motor imagery (MI) allows individuals to mentally simulate movements without execution, engaging neural pathways that overlap with those used during real actions. However, how imagery perspective influences corticospinal excitability across different effectors remains unclear. Using neuronavigated transcranial magnetic stimulation (TMS), we compared kinesthetic (KMI), first-person visual (VMI-1PP), and third-person visual (VMI-3PP) imagery of elbow flexion-extension ...
Added: May 5, 2026
Мультимодальные модели в медицинской диагностике как универсальный инструмент
Назаренко А. Г., Федоров М. В., Moshkin A. et al., Вестник Росздравнадзора 2026 № 1 С. 14–29
Multimodal foundation models and medical multimodal large language models are establishing a new class of diagnostic clinical decision support systems capable of operating on heterogeneous data sources, including medical imaging (X-ray, CT, MRI, ultrasound, histopathology), physiological signals (ECG, EEG), clinical text (electronic health records, reports, discharge summaries), laboratory measurements, molecular profiling data, and related modalities. ...
Added: May 4, 2026
Современные методы анализа временных рядов в мониторинге и прогнозировании состояния оборудования для механизированной добычи
Neznanov A., Glushko A., Овчинников С. et al., В кн.: Интеллектуальный анализ данных в нефтегазовой отрасли.: М.: ООО «Геомодель Развитие», 2024. С. 140–143.
With the development of monitoring systems, now we have the opportunity to collect key performance indicators of devices in the process of artificial lift. Every day a huge amount of telemetry is generated by our devices, which can be used to forecast the working mode and health state of the equipment after the process of ...
Added: April 29, 2026
Natural hazard database from Internet publications: text mining with a large language model
Derkacheva A., Sakirkina M., Kraev G. et al., /. 2026.
Comprehensive data on natural hazards and their consequences are crucial for effective for risk assessment, adaptation planning, and emergency response. However, many countries face challenges with fragmented, inconsistent, and inaccessible data, particularly regarding local-scale events. To address this data gap in Russia, we developed an end-to-end processing pipeline that scrapes news from various online sources, ...
Added: April 28, 2026
Multimodal EEG, ECG, and video dataset of yoga practitioners during concentration and mind-wandering tasks
Kashevnik A., Glekler E., Artemenko E. et al., Scientific data 2026 Article 7209
We present a multimodal dataset containing electroencephalography (EEG), electrocardiography (ECG), and video recordings from 49 participants. Each participant completed a single session lasting approximately 45 minutes. During each session they performed five tasks: resting state, inward concentration (focusing on the center of the forehead), outward concentration (visual search), and mind-wandering. EEG data were recorded from ...
Added: April 23, 2026
Machine Learning Approach to Anticancer Activity Prediction of Transition-Metal Complexes Based on a Large-Scale Experimental Database
Krasnov L., Malikov D., Kiseleva M. et al., Journal of Medicinal Chemistry 2026 Vol. 69 No. 8 P. 8838–8851
In this work, we developed a straightforward data-driven approach to predict the cytotoxicity of metal complexes based entirely on their (metal + ligands) composition. To this end, we have manually curated MetalCytoToxDB─a comprehensive experimental database comprising 26,500 IC50 values for 7050 metal complexes against 754 cell lines from 1921 articles. Based on these, machine learning ...
Added: April 23, 2026
LSTM-модель потребления тепловой энергии в многоэтажном жилом здании
Ершов И. А., Системная инженерия и инфокоммуникации 2025 № 4 С. 11–14
The heat consumption of residential buildings is a stochastic series. It is necessary for the design of thermal energy regulators the creation of a neural network model. In the paper, the model is carried out based on Long Short-Term Memory (LSTM). The high accuracy of reproducing the series was achieved by training the model on ...
Added: April 22, 2026
Rapid synthesis of redox-responsive trithiocyanuric acid-based nanocarriers: in vitro multidrug resistance reversal and In vivo safety assessment
Kopoleva E., Sergey A. Tsymbal, Kuchur O. et al., Drug Delivery and Translational Research 2026 P. 1–16
Multidrug resistance (MDR) remains one of the principal challenges in cancer chemotherapy, necessitating new strategies to restore drug efficacy. In this study we developed and characterize a redox-responsive nanoplatform based on trithiocyanuric acid and polyethylene glycol (TTCA-PEG NPs) as a potential tool to overcome MDR and provide a biocompatible delivery system for conventional antitumor drugs. ...
Added: April 20, 2026
Алгоритм анализа новостной информации для принятия экономических решений
Чудинова О. С., Первицкая Л. А., Ramenskaya A., Индустриальная экономика 2026 № 1 С. 65–78
This article is devoted to the development of an algorithm for analyzing news information using machine learning methods implemented in Python libraries. The choice of tools used at each stage of the algorithm is justified by calculating metrics for the quality of the solution to the corresponding machine learning problems. The algorithm’s results are presented ...
Added: April 20, 2026
Algorithmic overlaps as thermodynamic variables: from local to cluster Monte Carlo dynamics in critical phenomena
Pilé I., Deng Y., Shchur L., / Series arXiv "math". 2026. No. 2604.10254.
We investigate the spatial overlap of successive spin configurations in Markov chain Monte Carlo simulations using the local Metropolis algorithm and the Svendsen-Wang and Wolff cluster algorithms. We examine the dynamics of these algorithms for two models in different universality classes: the Ising model and the Potts model with three components. The overlap of two ...
Added: April 20, 2026
Modeling cosolvent effects on solubility in supercritical CO2 using data-driven approaches
Makarov D. M., Kalikin N., Gurikov P. et al., Journal of Supercritical Fluids 2026 Vol. 235 Article 106979
Supercritical CO2 (scCO2 ) is an environmentally friendly solvent, but its low polarity limits the solubility of polar compounds. Cosolvents are commonly used to enhance solvation capability, yet comprehensive datadriven studies are scarce. We compiled the largest dataset to date — 4401 experimental solubility records with 22 cosolvents for 93 nonionic solutes, plus 4855 records ...
Added: April 19, 2026
Функциональные признаки листьев и экологические стратегии важны для формирования растительных сообществ субальпийских болот и высокотравья
Гулов Д. М., Елумеева Т. Г., Федоров Н. И. et al., Журнал общей биологии 2024 Т. 85 № 2 С. 83–94
Plant functional traits are important for the formation of plant communities and for the ability of plants to dominate there. The comparison of mean trait values of organisms within community with that for the random samples of the local biota allows estimating the importance of the trait for the formation of the community composition. The ...
Added: April 19, 2026
The partitioning of mobile fractions of chemical elements in the mineral loamy soils during natural regeneration after plowing in Western Russia
Enchilik P. R., G.V.Klink, Semenkov I. N., Science of the Total Environment 2025 Vol. 998 Article 180219
Natural regeneration of forest soils affected by agricultural practices is a common phenomenon in many regions worldwide, particularly in boreal ecosystems. Affecting soil quality, the partitioning of extractable fractions of heavy metals and metalloids (HMMs) is not studied during natural regeneration. The following stages of natural regeneration after plowing have been identified in the Smolenskoye ...
Added: April 19, 2026
Self-face viewing attenuates cardiac modulation of corticospinal excitability
Makarova M., Fedosov N., Mikhailova I. et al., FRONTIERS IN SIGNAL PROCESSING 2026 Vol. 6 Article 1776807
Introduction:  While self-referential attention is thought to enhance interoceptive sensitivity, its effect on cardiac modulation of corticospinal excitability remains unexplored. This pilot study investigated how viewing one’s own face (self-face processing) modulates the cardiac-phase coupling of motor output and whether this heart-brain coupling depends on interoceptive accuracy (heartbeat perception). Methods:  In 15 healthy adults, motor-evoked potentials (MEPs) were ...
Added: April 17, 2026
Эффективность применения прогнозов волатильности в активных торговых стратегиях институциональных инвесторов на российском рынке акций
Lysenok N., Фундаментальная и прикладная математика 2026 Т. 26 № 3 С. 33–42
This study examines the impact of realized volatility forecasts on the performance of active trading strategies in the Russian equity market. Using a sample of 17 liquid stocks over the period 2014–2026, a hybrid forecasting model is developed that combines HAR-J with gradient boosting; its superiority over the baseline HAR-J specification is confirmed by the ...
Added: April 17, 2026
Inactivation of NONO by Auranofin or RNA Interference Triggers Lethal Oxidative Stress in Neuroblastoma Cells
Pogodaeva S. S., Miletina O. O., Mammadova L. V. et al., Frontiers in Bioscience - Landmark 2026 Vol. 31 No. 4 Article 48544
Background: Neuroblastoma (NB), a transcriptionally driven pediatric malignancy, exhibits a remarkable clinical and biological heterogeneity. Two major subtypes, adrenergic and mesenchymal, are differentially governed by distinct subsets of transcription factors that constitute the core regulatory circuitry (CRC). The adrenergic subtype is often associated with MYCN amplification and is particularly aggressive and therapy-resistant, underscoring the need ...
Added: April 15, 2026
Discovery of Novel 1,3,5-Triazine Derivative as Potent Anticancer Agent: DNA and HSA Binding, MTT Assay
Chernov I., Protas A., Popova E. et al., Biochemistry. Biokhimiia 2025 Vol. 19 No. 3 P. 279–287
This article reports the synthesis, characterization, cytotoxic activity, DNA binding study and antioxidant activity of N-(2-(2-(2-azidoethoxy)ethoxy)ethyl)-4,6-di(aziridin-1-yl)-1,3,5-triazin-2-amine. The synthesized compound was found to possess cytotoxic effects against six various cancer cell lines including HeLa, HCT116, T96G, A549, MCF7, and PANC-1 and non-cancer ECV cell line. Additionally, the capacity for DNA binding was assessed. The addition of 1,3,5-triazine ...
Added: April 15, 2026
Saying “Yes” to NONO: A Therapeutic Target for Neuroblastoma and Beyond
Pogodaeva S., Miletina O., Antipova N. et al., Cancers 2025 Vol. 17 No. 19 Article 3228
Pediatric tumors such as neuroblastoma are characterized by a genome-wide ‘transcriptional burden’, surmising the involvement of multiple alterations of gene expression. Search for master regulators of transcription whose inactivation is lethal for tumor cells identified the non-POU domain-containing octamer-binding protein (NONO), a member of the Drosophila Behavior/Human Splicing family known for the ability to form ...
Added: April 15, 2026
Metal-Based Therapeutic Approaches for Overcoming Cancer Drug Resistance: Mechanisms, Drug Delivery Strategies, and Clinical Perspectives
Chernov K., Savin A., Otvodnikova D. et al., Oncology Research 2026 Vol. - No. -
The formation of drug resistance poses the ultimate threat in modern oncology. Targeted therapy lacks versatility, while conventional therapy is famous for its side effects. However, for the new therapeutics to address the challenge of drug resistance, such compounds should combine properties of both modalities. In this review, we argue that metal-based therapeutics are paramount ...
Added: April 15, 2026
  • About
  • About
  • Key Figures & Facts
  • Sustainability at HSE University
  • Faculties & Departments
  • International Partnerships
  • Faculty & Staff
  • HSE Buildings
  • HSE University for Persons with Disabilities
  • Public Enquiries
  • Studies
  • Admissions
  • Programme Catalogue
  • Undergraduate
  • Graduate
  • Exchange Programmes
  • Summer University
  • Summer Schools
  • Semester in Moscow
  • Business Internship
  • Research
  • International Laboratories
  • Research Centres
  • Research Projects
  • Monitoring Studies
  • Conferences & Seminars
  • Academic Jobs
  • Yasin (April) International Academic Conference on Economic and Social Development
  • Media & Resources
  • Publications by staff
  • HSE Journals
  • Publishing House
  • iq.hse.ru: commentary by HSE experts
  • Library
  • Economic & Social Data Archive
  • Video
  • HSE Repository of Socio-Economic Information
  • HSE1993–2026
  • Contacts
  • Copyright
  • Privacy Policy
  • Site Map
Edit