Understanding cancer breakpoint determinants with omics data

Cheloshkina K.; M. Poptsova

doi:10.15761/ICST.1000333

Publications

Article

Understanding cancer breakpoint determinants with omics data

Integrative Cancer Science and Therapeutics. 2020. Vol. 7. P. 1–5.

Cheloshkina K., Poptsova M.

Over the last 20 years whole-genome sequencing of cancer genomes supported the phenomenon of cancer mutation heterogeneity both for point and structural variants. Alongside with the whole-genome sequencing projects many next-generation sequencing experiments including ChIP-seq for histone modifications and transcription factors, DNase-seq, MeDIP-Seq, Hi-C, and others were collected for thousands of cancer genomes. Machine learning approach became an efficient method of predictive modeling because machine learning algorithms are able to consider multiple factors and their interactions and range them in an order of importance. Machine learning models, predicting cancer point mutations at 1Mb scale and using as predictors state of the chromatin, epigenetic factors and non-B DNA structures, achieved a good predictive power. However, predicting cancer breakpoints appeared to be a more difficult task than predicting point mutations. Machine learning models, that were successfully used to predict cancer point mutations, using the same features, could not achieve high performance in predicting cancer breakpoints. Nevertheless, the available models demonstrate that aggregating information from omics experiments increases the model prediction power. Here we review state-of-the art machine learning approaches to predict cancer breakpoints and discuss current understanding of the determinants of cancer breakpoint formation.

Keywords: machine learning epigenetics non-B DNA cancer breakpoints breakpoint hotspots genome rearrangements

Exploring Semantic Concreteness and Abstractness for Metaphor Identification and Beyond

Badryzlova Y., , in: Компьютерная лингвистика и интеллектуальные технологии: По материалам ежегодной международной конференции «Диалог» (Москва, 17 июня — 20 июня 2020 г.)Вып. 19(26).: М.: Изд-во РГГУ, 2020. P. 33–47..

The paper presents a method for computing indexes of semantic concreteness and abstractness in two languages (Russian and English). These indexes are used in metaphor identification experiments in both languages; the results are either comparable to or surpass pervious work and the baselines. We analyze the obtained indexes of concreteness and abstractness to see how ...

Added: August 24, 2020

Artificial Intelligence. RCAI 2021. Lecture Notes in Computer Science

Springer, 2021..

This book constitutes the proceedings of the 19th Russian Conference on Artificial Intelligence, RCAI 2021, held in Moscow, Russia, in October 2021. The 19 full papers and 7 short papers presented in this volume were carefully reviewed and selected from 80 submissions. The conference deals with a wide range of topics, categorized into the following topical ...

Added: October 28, 2021

WIMS 2020: Proceedings of the 10th International Conference on Web Intelligence, Mining and Semantics

Association for Computing Machinery (ACM), 2020..

On behalf of the conference chairs, we welcome you to the 10th International Conference on Web Intelligence, Mining and Semantics (WIMS'20) hosted by LIUPPA Lab of the University de Pau & Pays de l'Adour, Biarritz-France. The 1st WIMS conference was organized in Sogndal Norway. Since then, it has always been published by ACM. After 10 ...

Added: August 28, 2020

Proceedings of the Fifth International Workshop on Experimental Economics and Machine Learning (EEML 2019),Perm, Russia, September 26, 2019

CEUR Workshop Proceedings, 2019..

Proceedings of the Fifth Workshop on Experimental Economics and Machine Learning at the National Research Univeristy Higher School of Economics co-located with the Seventh International Conference on Applied Research in Economics (iCare7) ...

Added: October 23, 2019

Predictive Analytics Approach for Steel Billets Quality Control System

Belov A. V., Ekaterina A. Melekhova, Vorontsova T., , in: 2022 International Conference on Quality Management, Transport and Information Security, Information Technologies (IT&QM&IS).: St. Petersburg: IEEE, 2022. P. 219–223..

The paper deals with the problem of improving the quality of metal products. Nowadays destructive methods of quality control of the steel billets prevail at metallurgical enterprises. This approach to assessing the quality of the steel billets is wasteful, which increases its cost. One of the ways to reduce the cost of production of metal ...

Added: January 28, 2023

Application of Artificial Intelligence Methods for Improvement of Strategic Decision-Making in Logistics

Kitzmann H., Strimovskaya A., Serova E., , in: Transfer, Diffusion and Adoption of Next-Generation Digital Technologies. IFIP WG 8.6 International Working Conference on Transfer and Diffusion of IT, TDIT 2023 Nagpur, India, December 15–16, 2023 Proceedings, Part IIVol. 698.: Springer, 2024. P. 132–143..

Highly evolving economic environment requires from logistics companies fast response and agile solutions. Recently development of digital technologies gives significant advantages to logistics business. Hence many optimized processes belong to operational management level. At the same time the importance of digital technologies adoption to strategic management level should not be underestimated, as it allows gaining competitive advantages alongside the supply chain. ...

Added: January 12, 2024

Proceedings of the International Workshop "What can FCA do for Artificial Intelligence?" (FCA4AI at ECAI 2014)

Prague: CEUR Workshop Proceedings, 2014..

The first and the second edition of the FCA4AI Workshop showed that many researchers working in Artificial Intelligence are indeed interested by a well-founded method for classi- fication and mining such as Formal Concept Analysis (see http://www.fca4ai.hse.ru/). The first edition of FCA4AI was co-located with ECAI 2012 in Montpellier and published as http://ceur-ws.org/Vol-939/ while the ...

Added: September 12, 2014

Human knowledge models: Learning applied knowledge from the data

Dudyrev E., Semenkov Ilia, Kuznetsov S. et al., Plos One 2022 Vol. 17 No. 10 Article e0275814.

Artificial intelligence and machine learning have demonstrated remarkable results in science and applied work. However, present AI models, developed to be run on computers but used in human-driven applications, create a visible disconnect between AI forms of processing and human ways of discovering and using knowledge. In this work, we introduce a new concept of ...

Added: October 29, 2022

SEQUENCE-BASED AND STRUCTURE-BASED MACHINE-LEARNING MODELS FOR RECOGNITION OF 3’-END L1 AND ALU STEM-LOOPS IN HUMAN GENOME

Poptsova M., Шеин А. В., Zaikin A., , in: The proceedings of International congress «Biotechnology: state of the art and perspectives» FEBRUARY 25 - 27, 2019.: LLC “RED GROUP”, 2019. P. 356–356..

We built and evaluated two types of models: sequence-based and structure-based for recognition of 3’-end stem- loops of human L1s and Alus and found most important parameters contributing to recognition: Shift, Tilt and Rise, and aslo hydrophilicity. ...

Added: November 12, 2019

Faster variational inducing input Gaussian process classification

Izmailov P., Kropotov D., Journal of machine learning and data analysis 2017 Vol. 3 No. 1 P. 20–35.

Background: Gaussian processes (GP) provide an elegant and effective approach to learning in kernel machines. This approach leads to a highly interpretable model and allows using the Bayesian framework for model adaptation and incorporating the prior knowledge about the problem. The GP framework is successfully applied to regression, classification, and dimensionality reduction problems. Unfortunately, the ...

Added: December 6, 2018

Epileptogenic high-frequency oscillations present larger amplitude both in mesial temporal and neocortical regions

Karpychev V., Balatskaya A., Utyashev N. et al., Frontiers in Human Neuroscience 2022 No. 16 Article 984306.

High-frequency oscillations (HFO) are a promising biomarker for the identification of epileptogenic tissue. While HFO rates have been shown to predict seizure outcome, it is not yet clear whether their morphological features might improve this prediction. We validated HFO rates against seizure outcome and delineated the distribution of HFO morphological features. We collected stereo-EEG recordings ...

Added: October 1, 2022

Controlling Overestimation Bias with Truncated Mixture of Continuous Distributional Quantile Critics

Кузнецов А. С., Shvechikov P., Grishin A. et al., , in: International Conference on Machine Learning (ICML 2020)Vol. 119.: PMLR, 2020. P. 5556–5566..

Added: October 17, 2020

Machine-learning models for cancer breakpoints prediction based on DNA structure distributions

Cheloshkina K., Poptsova M., , in: Сборник трудов 42-й междисциплинарной школы-конференции ИППИ РАН "Информационные технологии и системы 2018".: Институт проблем передачи информации им. А.А. Харкевича РАН, 2018. P. 1–5..

With the advances in the sequencing technology the International Cancer Genome Consortium (ICGC) [1] and The Cancer Genome Atlas (TCGA) [2] collected data on more than 16 000 genome-wide pairs tumor-normal tissue providing a valuable resource to study cancer mutations. In this research we focus on pre- evaluation of the relationship between cancer breakpoint hotspots ...

Added: January 15, 2019

Deep learning approach for predicting functional Z-DNA regions using omics data

Beknazarov N., Jin S., Poptsova M., Scientific Reports 2020 Vol. 10 P. 19134.

Computational methods to predict Z-DNA regions are in high demand to understand the functional role of Z-DNA. The previous state-of-the-art method Z-Hunt is based on statistical mechanical and energy considerations about B- to Z-DNA transition using sequence information. Z-DNA CHiP-seq experiment results showed little overlap with Z-Hunt predictions implying that sequence information only is not ...

Added: December 11, 2020

Randomness in Cancer Breakpoint Prediction

Cheloshkina K., Bzhikhatlov I., Poptsova M., Journal of Computational Biology 2021 Vol. 28 No. 7 P. 716–731.

Cancer genomes are susceptible to multiple rearrangements by deleting, inserting, and translocating genomic regions. Recently, the problem of finding determinants of breakpoint formations was approached with machine learning methods; however, unlike cancer point mutations, breakpoint prediction appeared to be a more difficult task, and various machine learning models did not achieve high prediction power often ...

Added: September 10, 2021

Conserved microRNAs and Flipons Shape Gene Expression during Development by Altering Promoter Conformations

Herbert A., Pavlov F., Dmitrii Konovalov et al., International Journal of Molecular Sciences 2023 Vol. 24 No. 5 Article 4884.

The classical view of gene regulation draws from prokaryotic models, where responses to environmental changes involve operons regulated by sequence-specific protein interactions with DNA, although it is now known that operons are also modulated by small RNAs. In eukaryotes, pathways based on microRNAs (miR) regulate the readout of genomic information from transcripts, while alternative nucleic ...

Added: March 17, 2023

Pupillometry and autonomic nervous system responses to cognitive load and false feedback: an unsupervised machine learning approach

Evgeniia I. Alshanskaia, Portnova G., Liaukovich K. et al., Frontiers in Neuroscience 2024 Vol. 18 Article 1445697.

Objectives: Pupil dilation is controlled both by sympathetic and parasympathetic nervous system branches. We hypothesized that the dynamic of pupil size changes under cognitive load with additional false feedback can predict individual behavior along with heart rate variability (HRV) patterns and eye movements reflecting specific adaptability to cognitive stress. To test this, we employed an ...

Added: September 2, 2024

Models, Algorithms, and Technologies for Network Analysis. Springer Proceedings in Mathematics & Statistics

Springer, 2017..

This valuable source for graduate students and researchers provides a comprehensive introduction to current theories and applications in optimization methods and network models. Contributions to this book are focused on new efficient algorithms and rigorous mathematical theories, which can be used to optimize and analyze mathematical graph structures with massive size and high density induced ...

Added: June 26, 2017

Классификация коннектомов на основе локальных метрик на стохастических матрицах

Ivanov A., Petrov D., В кн.: Сборник статей конференции "Информационные технологии и системы" (ИТиС'16).: М.: ИППИ РАН, 2016. С. 509–516..

Многие графовые метрики основаны на предположении, что веса графа представляют расстояния между вершинами, которые мы можем складывать. Если считать эти метрики для стохастических матриц случайного блуждания на графе, то физический смысл вероятностей перехода между вершинами теряется (поскольку вероятности переходов перемножаются, а не складываются). Мы предлагаем решать эту проблему использованием отрицательных логарифмов весов ребер. Используя этот ...

Added: December 15, 2016

Hidden Feedback Loops in Machine Learning Systems: A Simulation Model and Preliminary Results

Anton Khritankov, , in: Software Quality: Future Perspectives on Software Engineering Quality: 13th International Conference, SWQD 2021, Vienna, Austria, January 19–21, 2021, Proceedings.: Springer, 2021. P. 54–65..

In this concept paper, we explore some of the aspects of quality of continuous learning artificial intelligence systems as they interact with and influence their environment. We study an important problem of implicit feedback loops that occurs in recommendation systems, web bulletins and price estimation systems. We demonstrate how feedback loops intervene with user behavior ...

Added: September 23, 2021

Что в профиле тебе моем: Данные «ВКонтакте» как инструмент изучения интересов современных подростков

Polivanova K. N., Smirnov I., Вопросы образования 2017 № 2 С. 134–152.

Children’s interests play a key role in their psychological development. However, research in this field is associated with serious methodological problems, as it has traditionally used questionnaire surveys that cannot adequately describe the diverse and dynamic world of interests of a developing person. The article suggests using the information on VKontakte communities followed by teenagers, ...

Added: July 21, 2017

Referential Choice: Predictability and Its Limits

Kibrik A. A., Khudyakova M., Dobrov G. B. et al., Frontiers in Psychology 2016 Vol. 7 No. 1429 P. 1–21.

We report a study of referential choice in discourse production, understood as the choice between various types of referential devices, such as pronouns and full noun phrases. Our goal is to predict referential choice, and to explore to what extent such prediction is possible. Our approach to referential choice includes a cognitively informed theoretical component, ...

Added: September 28, 2016

Meta-Learning with Memory-Augmented Neural Networks

Santoro A., Bartunov S., Botvinick M. et al., Journal of Machine Learning Research 2016 Vol. 48.

Despite recent breakthroughs in the applications of deep neural networks, one setting that presents a persistent challenge is that of "one-shot learning." Traditional gradient-based networks require a lot of data to learn, often through extensive iterative training. When new data is encountered, the models must inefficiently relearn their parameters to adequately incorporate the new information ...

Added: October 19, 2016

Proceedings of the International Conference «Wave Electronics and its Application in Information and Telecommunication Systems (WECONF)».IEEE # 47647. Saint Petersburg State University of Aerospace Instrumentation. June 03-07, 2019

Nazarov A., Сычев А. К., Voronkov I. M., IEEE, 2019..

The article describes the shortcomings of the modern datasets used in the development of next-generation intrusion detection systems and proposed new requirements for datasets. Based on the requirements, new software architecture has been proposed, which allows to model modern computer attacks and at the same time “mark up” logs generated on hosts and by network ...

Added: May 8, 2020