Reliability of maximum spanning tree identification in correlation-based market networks

V. A. Kalyagin; A. P. Koldanov; P. A. Koldanov

doi:10.1016/j.physa.2022.127482

Publications

?

Reliability of maximum spanning tree identification in correlation-based market networks

Physica A: Statistical Mechanics and its Applications. 2022. Vol. 599. Article 127482.

V. A. Kalyagin, A. P. Koldanov, P. A. Koldanov

Maximum spanning tree is a popular tool in market network analysis. Large number of publications are devoted to the maximum spanning tree calculation and its interpretation for particular stock markets. Usually one use market data to calculate Pearson correlations between stock returns and construct a compete weighted graph, where weights of edges are given by calculated correlations. Then maximum spanning tree of the obtained network is calculated and its market interpretation is discussed.
However, Pearson correlation is not only one similarity measure which can be used for market network analysis. Different measures of similarity will generate different market networks, and, as a consequence, different maximum spanning trees. The main goal of the present paper is to analyze the key points of this difference. We show that this is related with uncertainty (reliability) of maximum spanning tree identification in different networks. We study uncertainty in the framework of the concept of random variable network (RVN). We consider different correlation based networks in the large class of elliptical distributions. We show that true maximum spanning tree is the same in three correlation networks: Pearson correlation network, Fechner correlation network, and Kendall correlation network. It means, that from theoretical point of view there is no difference between maximum spanning trees in these networks. The observed difference between maximum spanning trees in different networks can be, therefore, explained by uncertainty of maximum spanning tree identification by observations. We argue that among different measures of uncertainty the FDR (False Discovery Rate) is the most appropriated to measure uncertainty (reliability) of maximum spanning tree identification. We investigate FDR of Kruskal algorithm for maximum spanning tree identification and show that reliability of maximum spanning tree identification is different in these three networks. In particular, for Pearson correlation network the FDR essentially depends on distribution of stock returns. We prove that for market
network with Fechner correlation the FDR is non sensitive to the assumption on stock’s return distribution. Some interesting phenomena are discovered for Kendall correlation network. Our experiments show that FDR of Kruskal algorithm for maximum spanning tree identification in Kendall correlation network weakly depend on distribution and at the same time the value of FDR is almost the best in comparison with maximum spanning tree identification in other networks.

Research target: Computer Science Mathematics

Language: English

Full text

DOI

Keywords: Statistical uncertainty Market network model false discovery rate Random variable networks Maximum spanning tree Correlation based network Distribution free statistical procedures

Publication based on the results of:

Modern approaches to analysis of network structures (2022)

RISK FUNCTION AND OPTIMALITY OF STATISTICAL PROCEDURES FOR IDENTIFICATION OF NETWORK STRUCTURES

Koldanov P., Ученые записки Казанского университета. Серия: Физико-математические науки 2018 Vol. 160 No. 2 P. 317–326

Identification of network structures using the finite-size sample has been considered. The concepts of random variables network and network model, which is a complete weighted graph, have been introduced. Two types of network structures have been investigated: network structures with an arbitrary number of elements and network structures with a fixed number of elements of the network model. The ...

Added: February 13, 2019

Formal Concept Analysis: 16th International Conference, ICFCA 2021, Strasbourg, France, June 29 – July 2, 2021, Proceedings

Springer, 2021.

This book constitutes the proceedings of the 16th International Conference on Formal Concept Analysis, ICFCA 2021, held in Strasbourg, France, in June/July 2021. The 14 full papers and 5 short papers presented in this volume were carefully reviewed and selected from 32 submissions. The book also contains four invited contributions in full paper length. The research part ...

Added: July 10, 2021

A Parallel Algorithm to Detect Structural Breaks in Time Series

Furmanov K. K., Nikol'skii I. M., Computational Mathematics and Modeling 2016 Vol. 27 No. 2 P. 247–253

Added: December 22, 2016

A Note on a Single Machine Scheduling Problem with Generalized Total Tardiness Objective Function

Gafarov E., Lazarev A. A., Information Processing Letters 2012 Т. 112 № 3 С. 72–76

In this note, we consider a single machine scheduling problem with generalized total tardiness objective function. A pseudo-polynomial time solution algorithm is proposed for a special case of this problem. Moreover, we present a new graphical algorithm for another special case, which corresponds to the classical problem of minimizing the weighted number of tardy jobs on a single ...

Added: November 24, 2012

Математика и междисциплинарные исследования – 2020

Пермь: Пермский государственный национальный исследовательский университет, 2020.

В сборнике представлены статьи участников Всероссийской научно-практической конференции молодых ученых с международным участием «Математика и междисциплинарные исследования – 2020». На конференцию было прислано более ста статей из различных регионов России, а также из ближнего и дальнего зарубежья. По итогам работы экспертной комиссии для публикации было отобрано шестьдесят две статьи. Каждая статья оценивалась группой экспертов в той области, которая рассматривается автором. Представленные ...

Added: December 10, 2020

Построение доверительного множества связанных акций фондового рынка

Koldanov A. P., Koldanov P., Semenov D., Журнал Новой экономической ассоциации 2021 Т. 2 № 50 С. 12–34

. The problem of analysis of pairwise connections between stocks of financial market by observations on stock returns is considered. Such problem arise in stock market network analysis. It is assumed that joint distribution of stock returns belongs to the wide class of elliptical distributions. Classical Pearson correlation, Fechner correlation and Kendall correlation are used ...

Added: June 17, 2021

О некоторых медленно сходящихся системах преобразований термов

Beklemishev L. D., Оноприенко А. А., Математический сборник 2015 Т. 206 № 9 С. 3–20

We formulate some term rewriting systems in which the number of computation steps is finite for each output, but this number cannot be bounded by a provably total computable function in Peano arithmetic PA. Thus, the termination of such systems is unprovable in PA. These systems are derived from an independent combinatorial result known as the Worm ...

Added: March 13, 2016

Пятая Международная конференция «Системный анализ и информационные технологии» САИТ-2013 (19–25 сентября 2013 г., г.Красноярск, Россия): Труды конференции. В 2-х т.

Красноярск: ИВМ СО РАН, 2013.

Труды Пятой Международной конференции «Системный анализ и информационные технологии» САИТ-2013 (19–25 сентября 2013 г., г.Красноярск, Россия): ...

Added: November 18, 2013

The complexity of the 3-colorability problem in the absence of a pair of small forbidden induced subgraphs

Malyshev D., Discrete Mathematics 2015 Vol. 338 No. 11 P. 1860–1865

We completely determine the complexity status of the 3-colorability problem for hereditary graph classes defined by two forbidden induced subgraphs with at most five vertices. ...

Added: April 7, 2014

Agent-based modelling of interactions between air pollutants and greenery using a case study of Yerevan, Armenia

Akopov A. S., Beklaryan L. A., Saghatelyan A. K., Environmental Modelling and Software 2019 Vol. 116 P. 7–25

Urban greenery such as trees can effectively reduce air pollution in a natural and eco-friendly way. However, how to spatially locate and arrange greenery in an optimal way remains as a challenging task. We developed an agent-based model of air pollution dynamics to support the optimal allocation and configuration of tree clusters in a city. The Pareto ...

Added: February 24, 2019

Logic in Central and Eastern Europe: History, Science, and Discourse

Lanham: University Press of America, 2012.

The history of logic and analytic philosophy in Central and Eastern Europe is still known to very few people. As an exception to the rule, only two scientific schools became internationally popular: the Vienna Circle and the Lvov-Warsaw School. Nevertheless, the countries included in this region have not only joint history, but also joint cultural ...

Added: February 13, 2013

Measures of uncertainty in market network analysis

Kalyagin V.A., Koldanov A.P., Koldanov P.A. et al., Physica A: Statistical Mechanics and its Applications 2014 Vol. 413 No. 1 P. 59–70

A general approach to measure statistical uncertainty of different filtration techniques for market network analysis is proposed. Two measures of statistical uncertainty are introduced and discussed. One is based on conditional risk for multiple decision statistical procedures and another one is based on average fraction of errors. It is shown that for some important cases ...

Added: July 19, 2014

Сценарное моделирование движения беспилотных транспортных средств в искусственной дорожной сети с использованием FLAME GPU

Akopov A. S., Beklaryan A., Искусственные общества 2021 Т. 16 № 1 С. 1–23

This article presents a model of the ground autonomous vehicles (AVs) motion in the Artificial Road Network (ARN) belonging to the "Manhattan Lattice" type with the implementation of the large-scale agent-based modeling framework FLAME GPU. The most important scenarios of the traffic situation development are investigated, in particular, which are associated with reducing visibility on ...

Added: April 1, 2021

Испоьзование методов искусственного интеллекта в изучении личности серийных убийц

Yasnitsky L., Ваулева С. В., Сафонова Д. Н. et al., Всероссийский криминологический журнал 2015 Т. 9 № 3 С. 423–430

Modern criminalists do not share a common opinion regarding the choice of parameters which could be used to work out a system of characteristics to differentiate a maniac killer from an ordinary person. This hinders the development of efficient software for investigation purposes. The paper describes the experience of developing a neural network that can ...

Added: October 1, 2015

Particle Simulation for Predicting Effective Properties of Short Fiber Reinforced Composites

Skoptsov K. A., Sheshenin S., Galatenko V. V. et al., International Journal of Applied Mechanics 2016 Vol. 8 No. 2 P. 1650016-01–1650016-18

We present a method for evaluating elastic properties of a composite material produced by molding a resin filled with short elastic fibers. A flow of the filled resin is simulated numerically using a mesh-free method. After that, assuming that spatial distribution and orientation of fibers are not significantly changed during polymerization, effective elastic moduli of ...

Added: May 22, 2016

Оценка занятости пожарных боевых расчётов и рисков их несвоевременного прибытия на объект защиты

Litvin Y. V., Абрамов И. В., Технологии техносферной безопасности 2016 № 66

Advanced approach to the assessment of a random time of arrival fire fighting calculation on the object of protection, the time of their employment and the free combustion. There is some quantitative assessments with the review of analytical methods and simulation ...

Added: August 27, 2016

Parallelization of matrix Algorithms for Gröbner basis computation

Alexandrov D. E., Galkin V. V., Zobnin A.I. et al., Journal of Mathematical Sciences 2009 Vol. 163 No. 5 P. 469–486

Sequential and parallel implementations of the F4 algorithm for computing Gr¨obner bases of polynomial ideals are discussed. ...

Added: October 1, 2014

Hardness of Approximation for H-free Edge Modification Problems

Bliznets Ivan, Cygan M., Komosa P. et al., ACM Transactions on Computation Theory 2018 Vol. 10 No. 2 P. 1–32

The H-free Edge Deletion problem asks, for a given graph G and integer k, whether it is possible to delete at most k edges from G to make it H-free—that is, not containing H as an induced subgraph. The H-free Edge Completion problem is defined similarly, but we add edges instead of deleting them. The study of these two problem families has recently been the subject of intensive studies from the point of ...

Added: October 30, 2018

Об одномерных проекциях многогранников задач дискретной оптимизации

Vyalyi M., Дискретная математика 1991 Т. 3 № 3 С. 35–45

Added: October 17, 2014

Algorithms and methods for solving scheduling problems and other extremum problems on large-scale graphs

Chernyshev S. V., Cherepanov E. A., Pankratiev E. V. et al., Journal of Mathematical Sciences 2005 Vol. 128 No. 6 P. 3487–3495

Added: January 27, 2014

Численное моделирование затвердевания сплавов при интенсивном сопряженном теплообмене

Marshirov V. V., Marshirova L. E., Сибирский журнал индустриальной математики 2013 Т. XVI № 4 С. 111–120

The paper considers the problem of determining the rate of cooling of metal during solidification at the intersection of the liquidus temperature under intense heat sink from the surface. The solution to this problem it is necessary to determine the process conditions, the boundary and initial conditions for which it is possible to get new ...

Added: November 17, 2013

Proceedings 10th International Conference on Terminology and Artificial Intelligence TIA 2013

P.: Université Paris 13 - Paris Sorbonne Cité, 2013.

In this workshop we will bring together participants who have solutions for one or more of the following problems: How can mutual understanding be optimized with the help of technology in hospitals where both patients and professionals have varying language skills, cultural backgrounds and cognitive capacities? Can domain ontologies, natural language processing tools, multilingual knowledge-based ...

Added: December 18, 2014

Complete complexity dichotomy for 7-edge forbidden subgraphs in the edge coloring problem

Malyshev D., Journal of Applied and Industrial Mathematics (перевод журналов "Сибирский журнал индустриальной математики" и "Дискретный анализ и исследование операций") 2020 Vol. 14 No. 4 P. 706–721

The edge coloring problem for a graph is to minimize the number of colors that are sufficient to color all edges of the graph so that all adjacent edges receive distinct colors. The computational complexity of the problem is known for all graph classes defined by forbidden subgraphs with at most 6 edges. We improve ...

Added: January 30, 2021

Belief Functions: Theory and Applications

Dordrecht, L., Heidelberg, NY: Springer, 2014.

This book constitutes the thoroughly refereed proceedings of the Third International Conference on Belief Functions, BELIEF 2014, held in Oxford, UK, in September 2014. The 47 revised full papers presented in this book were carefully selected and reviewed from 56 submissions. The papers are organized in topical sections on belief combination; machine learning; applications; theory; ...

Added: October 1, 2014