Non-Asymptotic Analysis of Approximations for Multivariate Statistics
This book presents recent non-asymptotic results for approximations in multivariate statistical analysis. The book is unique in its focus on results with the correct error structure for all the parameters involved. Firstly, it discusses the computable error bounds on correlation coefficients, MANOVA tests and discriminant functions studied in recent papers. It then introduces new areas of research in high-dimensional approximations for bootstrap procedures, Cornish–Fisher expansions, power-divergence statistics and approximations of statistics based on observations with random sample size. Lastly, it proposes a general approach for the construction of non-asymptotic bounds, providing relevant examples for several complicated statistics. It is a valuable resource for researchers with a basic understanding of multivariate statistics.
Nowadays insurance market is one of the most rapidly developing sectors of economy, the purpose of which is to protect the property interests of individuals and legal entities under ensuing of specific events (insured accidents) at the expense of monetary funds formed from insurance dues (insurance premiums) paid by them. Probabilistic nature of insured accidents as well as the uncertainty of the moment of their occurrence and the severity of losses leads to the necessity of forming loss reserves. Reserves for incurred but not reported claims (hereinafter referred to as IBNR reserves) seem to be the most challenging in terms of actuarial calculations. The following article provides the descriptions of various actuarial techniques of loss reserving and examples of their application to a real insurance portfolio. In this paper the point estimating methods such as Chain Ladder, Bornhuetter-Fergusson, multiplicative techniques are compared with the stochastic method of Bootstrap and the most accurate estimate is determined using run-off analysis.
Prediction of the duration of a repair and maintenance project of a gas transport system is an important part of planning activities. There exist numerous sources of uncertainties that may result in time overruns possibly leading to multiple negative consequences. Our experience in planning this work suggests that accepting the stochastic nature of the project duration is a constructive step towards the preparedness to contingencies and defining penalties for repair companies. To support this approach, one needs to construct probability distributions of the durations of the projects. To address the issue of the scarcity of observed data, we suggest using a bootstrap resampling procedure. Gram-Charlier functions and order statistics are employed to approximate the distributions. It is demonstrated how to derive them for a separate repair project and a larger project consisting of a number of concurrently running subprojects. Following this, guidance is provided on how to decide about what duration should define the deadline for completion of the whole work. A simple example is provided.
Motivated by a problem arising when analysing data from quarantine searches, we explore properties of distributions of sums of independent means of independent lattice-valued random variables. The aim is to determine the extent to which approximations to those sums require continuity corrections. We show that, in cases where there are only two different means, the main effects of distribution smoothness can be understood in terms of the ratio rho_12=(e_2 n_1)/(e_1 n_2), where e_1 and e_2 are the respective maximal lattice edge widths of the two populations, and n_1 and n_2 are the respective sample sizes used to compute the means. If rho_12 converges to an irrational number, or converges sufficiently slowly to a rational number; and in a number of other cases too, for example those where rho_12 does not converge; the effects of the discontinuity of lattice distributions are of smaller order than the effects of skewness. However, in other instances, for example where rho_12 converges relatively quickly to a rational number, the effects of discontinuity and skewness are of the same size. We also treat higher-order properties, arguing that cases where rho_12 converges to an algebraic irrational number can be less prone to suffer the effects of discontinuity than cases where the limiting irrational is transcendental. These results are extended to the case of three or more different means, and also to problems where distributions are estimated using the bootstrap. The results have practical interpretation in terms of the accuracy of inference for, among other quantities, the sum or difference of binomial proportions.ρ12=(e2n1)/(e1n2)e1e2n1n2ρ12ρ12ρ12ρ12
We introduce a new method for improving the coverage accuracy of confidence intervals for means of lattice distributions. The technique can be applied very generally to enhance existing approaches, although we consider it in greatest detail in the context of estimating a binomial proportion or a Poisson mean, where it is particularly effective. The method is motivated by a simple theoretical result, which shows that, by splitting the original sample of size n into two parts, of sizes n_1 and n_2=n-n_1, and basing the confidence procedure on the average of the means of these two subsamples, the highly oscillatory behaviour of coverage error, as a function of n, is largely removed. Perhaps surprisingly, this approach does not increase confidence interval width; usually the width is slightly reduced. Contrary to what might be expected, our new method performs well when it is used to modify confidence intervals based on existing techniques that already perform very well--it typically improves significantly their coverage accuracy. Each application of the split sample method to an existing confidence interval procedure results in a new technique.
In Christoph, Prokhorov and Ulyanov (Theory Probab Appl 40(2):250–260, 1996) we studied properties of high-dimensional Gaussian random vectors. Yuri Vasil’evich Prokhorov initiated these investigations. In the present paper we continue these investigations. Computable error bounds of different orders with respect to n for the approximations of sample correlation coefficients and the angle between high-dimensional Gaussian vectors by the standard normal law are obtained. We give some numerical results as well. Moreover, different types of Bartlett corrections are suggested.
A model for organizing cargo transportation between two node stations connected by a railway line which contains a certain number of intermediate stations is considered. The movement of cargo is in one direction. Such a situation may occur, for example, if one of the node stations is located in a region which produce raw material for manufacturing industry located in another region, and there is another node station. The organization of freight traﬃc is performed by means of a number of technologies. These technologies determine the rules for taking on cargo at the initial node station, the rules of interaction between neighboring stations, as well as the rule of distribution of cargo to the ﬁnal node stations. The process of cargo transportation is followed by the set rule of control. For such a model, one must determine possible modes of cargo transportation and describe their properties. This model is described by a ﬁnite-dimensional system of diﬀerential equations with nonlocal linear restrictions. The class of the solution satisfying nonlocal linear restrictions is extremely narrow. It results in the need for the “correct” extension of solutions of a system of diﬀerential equations to a class of quasi-solutions having the distinctive feature of gaps in a countable number of points. It was possible numerically using the Runge–Kutta method of the fourth order to build these quasi-solutions and determine their rate of growth. Let us note that in the technical plan the main complexity consisted in obtaining quasi-solutions satisfying the nonlocal linear restrictions. Furthermore, we investigated the dependence of quasi-solutions and, in particular, sizes of gaps (jumps) of solutions on a number of parameters of the model characterizing a rule of control, technologies for transportation of cargo and intensity of giving of cargo on a node station.
Let k be a field of characteristic zero, let G be a connected reductive algebraic group over k and let g be its Lie algebra. Let k(G), respectively, k(g), be the field of k- rational functions on G, respectively, g. The conjugation action of G on itself induces the adjoint action of G on g. We investigate the question whether or not the field extensions k(G)/k(G)^G and k(g)/k(g)^G are purely transcendental. We show that the answer is the same for k(G)/k(G)^G and k(g)/k(g)^G, and reduce the problem to the case where G is simple. For simple groups we show that the answer is positive if G is split of type A_n or C_n, and negative for groups of other types, except possibly G_2. A key ingredient in the proof of the negative result is a recent formula for the unramified Brauer group of a homogeneous space with connected stabilizers. As a byproduct of our investigation we give an affirmative answer to a question of Grothendieck about the existence of a rational section of the categorical quotient morphism for the conjugating action of G on itself.
Let G be a connected semisimple algebraic group over an algebraically closed field k. In 1965 Steinberg proved that if G is simply connected, then in G there exists a closed irreducible cross-section of the set of closures of regular conjugacy classes. We prove that in arbitrary G such a cross-section exists if and only if the universal covering isogeny Ĝ → G is bijective; this answers Grothendieck's question cited in the epigraph. In particular, for char k = 0, the converse to Steinberg's theorem holds. The existence of a cross-section in G implies, at least for char k = 0, that the algebra k[G]G of class functions on G is generated by rk G elements. We describe, for arbitrary G, a minimal generating set of k[G]G and that of the representation ring of G and answer two Grothendieck's questions on constructing generating sets of k[G]G. We prove the existence of a rational (i.e., local) section of the quotient morphism for arbitrary G and the existence of a rational cross-section in G (for char k = 0, this has been proved earlier); this answers the other question cited in the epigraph. We also prove that the existence of a rational section is equivalent to the existence of a rational W-equivariant map T- - - >G/T where T is a maximal torus of G and W the Weyl group.