### Book chapter

## Statistical uncertainty of minimum spanning tree in market network

The paper deal with uncertainty in market network analysis. The main problem addressed is to investigate statistical uncertainty of Kruskal algorithm for the minimum spanning tree in market network. Uncertainty of Kruskal algorithm is measured by the probability of q incorrectly included edges. Numerical experiments are conducted with the returns of a set of 100 financial instruments traded in the US stock market over a period of 250 days in 2014. Obtained results help to estimate the reliability of minimum spanning tree in market network analysis.

Research into the market graph is attracting increasing attention in stock market analysis. One of the important problems connected with the market graph is its identification from observations. The standard way of identifying the market graph is to use a simple procedure based on statistical estimations of Pearson correlations between pairs of stocks. Recently a new class of statistical procedures for market graph identification was introduced and the optimality of these procedures in the Pearson correlation Gaussian network was proved. However, the procedures obtained have a high reliability only for Gaussian multivariate distributions of stock attributes. One of the ways to correct this problem is to consider different networks generated by different measures of pairwise similarity of stocks. A new and promising model in this context is the sign similarity network. In this paper the market graph identification problem in the sign similarity network is reviewed. A new class of statistical procedures for the market graph identification is introduced and the optimality of these procedures is proved. Numerical experiments reveal an essential difference in the quality between optimal procedures in sign similarity and Pearson correlation networks. In particular, it is observed that the quality of the optimal identification procedure in the sign similarity network is not sensitive to the assumptions on the distribution of stock attributes.

Full texts of third international conference on data analytics are presented.

Problem of construction of the market graph as a multiple decision statistical problem is considered. Detailed description of a optimal unbiased multiple decision statistical procedure is given. This procedure is constructed using the Lehmann’s theory of multiple decision statistical procedures and the conditional tests of the Neyman structures. The equations for thresholds calculation for the tests of the Neyman structure are presented and analyzed.

The main goal of the present paper is the development of general approach to network analysis of statistical data sets. First a general method of market network construction is proposed on the base of idea of measures of association. It is noted that many existing network models can be obtained as a particular case of this method. Next it is shown that statistical multiple decision theory is an appropriate theoretical basis for market network analysis of statistical data sets. Finally conditional risk for multiple decision statistical procedures is introduced as a natural measure of quality in market network analysis. Some illustrative examples are given.

A class of distribution free multiple decision statistical procedures is proposed for threshold graph identification in correlation networks. The decision procedures are based on simultaneous application of sign statistics. It is proved that single step, step down Holm and step up Hochberg statistical procedures for threshold graph identification are distribution free in sign similarity network in the class of elliptically contoured distributions. Moreover it is shown that these procedures can be adapted for distribution free threshold graph identification in Pearson correlation network.

Market graph is known to be a useful tool for market network analysis. Cliques and independent sets of the market graph give an information about con- centrated dependent sets of stocks and distributed independent sets of stocks on the market. In the present paper the connections between market graph and classical Markowitz portfolio theory are studied. In particular, efﬁcient frontiers of cliques and independent sets of the market graph are compared with the efﬁcient frontier of the market. The main result is: efﬁcient frontier of the market can be well ap- proximated by the efﬁcient frontier of the maximum independent set of the market graph constructed on the sets of stocks with the highest Sharp ratio. This allows to reduce the number of stocks for portfolio optimization without the loss of quality of obtained portfolios. In addition it is shown that cliques of the market graphs are not suitable for portfolio optimization.

Random matrix theory (RMT) is applied to investigate the cross-correlation matrix of a financial time series in four different stock markets: Russian, American, German, and Chinese. The deviations of distribution of eigenvalues of market correlation matrix from RMT global regime are investigated. Specific properties of each market are observed and discussed.

Using network models to investigate the interconnectivity in modern economic systems allows researchers to better understand and explain some economic phenomena. This volume presents contributions by known experts and active researchers in economic and financial network modeling. Readers are provided with an understanding of the latest advances in network analysis as applied to economics, finance, corporate governance, and investments. Moreover, recent advances in market network analysis that focus on influential techniques for market graph analysis are also examined. Young researchers will find this volume particularly useful in facilitating their introduction to this new and fascinating field. Professionals in economics, financial management, various technologies, and network analysis, will find the network models presented in this book beneficial in analyzing the interconnectivity in modern economic systems.