Algorithmic Statistics: Forty Years Later.

?

Algorithmic Statistics: Forty Years Later.

Lecture Notes in Computer Science. 2017. Vol. 10010. P. 669–737.

Algorithmic statistics has two different (and almost orthogonal) motivations. From the philosophical point of view, it tries to formalize how the statistics works and why some statistical models are better than others. After this notion of a "good model" is introduced, a natural question arises: it is possible that for some piece of data there is no good model? If yes, how often these bad ("non-stochastic") data appear "in real life"? Another, more technical motivation comes from algorithmic information theory. In this theory a notion of complexity of a finite object (=amount of information in this object) is introduced; it assigns to every object some number, called its algorithmic complexity (or Kolmogorov complexity). Algorithmic statistic provides a more fine-grained classification: for each finite object some curve is defined that characterizes its behavior. It turns out that several different definitions give (approximately) the same curve. In this survey we try to provide an exposition of the main results in the field (including full proofs for the most important ones), as well as some historical comments. We assume that the reader is familiar with the main notions of algorithmic information (Kolmogorov complexity) theory.

Priority areas: IT and mathematics

Language: English

Full text

DOI

On Algorithmic Statistics for Space-bounded Algorithms

Milovanov A., Theory of Computing Systems 2019 Vol. 63 No. 4 P. 833–848

Algorithmic statistics looks for models of observed data that are good in the following sense: a model is simple (i.e., has small Kolmogorov complexity) and captures all the algorithmically discoverable regularities in the data. However, this idea can not be used in practice as is because Kolmogorov complexity is not computable. In this paper we ...

Added: October 17, 2018

Some Properties of Antistochastic Strings

Milovanov A., Theory of Computing Systems 2017 Vol. 61 No. 2 P. 521–535

Algorithmic statistics is a part of algorithmic information theory (Kolmogorov complexity theory) that studies the following task: given a finite object x (say, a binary string), find an `explanation' for it, i.e., a simple finite set that contains x and where x is a `typical element'. Both notions (`simple' and `typical') are defined in terms ...

Added: June 27, 2016

Algorithmic Statistics: Forty Years Later.

Shen A., Vereshchagin N., , in: Computability and Complexity.: Berlin: Springer, 2017. P. 669–737.

Added: October 26, 2018

An additivity theorem for plain Kolmogorov complexity

Bauwens B. F., Shen A., Theory of Computing Systems 2013 Vol. 52 No. 2 P. 297–302

Added: October 2, 2015

Algorithmic Minimal Sufficient Statistics: a New Approach

Vereshchagin N., Theory of Computing Systems 2016 Vol. 58 No. 3 P. 463–481

We introduce the notion of a strong sufficient statistic for a given data string. We show that strong sufficient statistics have better properties than just sufficient statistics. We prove that there are “strange” data strings, whose minimal strong sufficient statistic have much larger complexity than the minimal sufficient statistic. ...

Added: February 7, 2017

On Algorithmic Statistics for Space-Bounded Algorithms

Milovanov A., , in: Computer Science – Theory and Applications: 12th International Computer Science Symposium in Russia (CSR 2017)Vol. 10304.: Luxemburg: Springer, 2017. P. 232–244.

Algorithmic statistics studies explanations of observed data that are good in the algorithmic sense: an explanation should be simple i.e. should have small Kolmogorov complexity and capture all the algorithmically discoverable regularities in the data. However this idea can not be used in practice because Kolmogorov complexity is not computable. In this paper we develop algorithmic ...

Added: October 15, 2017

Descriptive complexity of computable sequences revisited

Vereshchagin N., Theoretical Computer Science 2020 Vol. 809 P. 531–537

The purpose of this paper is to answer two questions left open in [B. Durand, A. Shen, and N. Vereshchagin, Descriptive Complexity of Computable Sequences, Theoretical Computer Science 171 (2001), pp. 47--58]. Namely, we consider the following two complexities of an infinite computable 0-1-sequence $\alpha$: $C^{0'}(\alpha )$, defined as ...

Added: January 17, 2020

Complexity of complexity and maximal plain versus prefix-free Kolmogorov complexity

Bauwens B. F., Shen A., Journal of Symbolic Logic 2013 Vol. 79 No. 2 P. 620–632

Added: October 2, 2015

Kolmogorov’s Last Discovery? (Kolmogorov and Algorithmic Statistics)

Semenov A., Shen A., Vereshchagin N., Theory of Probability and its Applications, USA 2024 Vol. 68 No. 4 P. 582–606

The definition of descriptional complexity of finite objects suggested by Kolmogorov and other authors in the mid-1960s is now well known. In addition, Kolmogorov pointed out some approaches to a more fine-grained classification of finite objects, such as the resource-bounded complexity (1965), structure function (1974), and the notion of $(\alpha,\beta)$-stochasticity (1981). Later it turned out ...

Added: January 16, 2025

Algorithmic Statistics and Prediction for Polynomial Time-Bounded Algorithms

Milovanov A., , in: Sailing Routes in the World of Computation.: Springer, 2018. P. 287–296.

Algorithmic statistics studies explanations of observed data that are good in the algorithmic sense: an explanation should be simple i.e. should have small Kolmogorov complexity and capture all the algorithmically discoverable regularities in the data. However this idea can not be used in practice as is because Kolmogorov complexity is not computable. In recent years resource-bounded ...

Added: September 4, 2018

Generic algorithms for halting problem and optimal machines revisited

Bienvenu L., Desfontaines D., Shen A., Logical Methods in Computer Science 2016 Vol. 12 No. 2

The halting problem is undecidable --- but can it be solved for "most" inputs? This natural question was considered in a number of papers, in different settings. We revisit their results and show that most of them can be easily proven in a natural framework of optimal machines (considered in algorithmic information theory) using the ...

Added: February 7, 2017

Relating and contrasting plain and prefix Kolmogorov complexity

Bauwens B. F., Theory of Computing Systems 2015 Vol. 58 No. 3 P. 482–501

Added: October 2, 2015

Some Properties of Antistochastic Strings

Milovanov A., , in: Computer Science -- Theory and Applications 10th International Computer Science Symposium in Russia, CSR 2015Vol. 9139.: Springer, 2015. P. 339–349.

Antistochastic strings are those strings that do not have any reasonable statistical explanation. We establish the follow property of such strings: every antistochastic string x is “holographic” in the sense that it can be restored by a short program from any of its part whose length equals the Kolmogorov complexity of x. Further we will ...

Added: June 27, 2016

Priority Queueing for Packets with Two Characteristics

Chuprikov P., Nikolenko S. I., Davydow A. et al., IEEE Transactions on Networking 2018 Vol. 26 No. 1 P. 342–355

Modern network elements are increasingly required to deal with heterogeneous traffic. Recent works consider processing policies for buffers that hold packets with different processing requirements (number of processing cycles needed before a packet can be transmitted out) but uniform value, aiming to maximize the throughput, i.e., the number of transmitted packets. Other developments deal with ...

Added: March 14, 2018

Влияние проницаемости поясков Каспари для воды и растворенных веществ на величину корневого давления: математическое моделирование

Logvenkov S. A., Штейн А. А., Российский журнал биомеханики 2013 Т. 17 № 4 С. 47–57

The mathematical modelling is performed to study the effect of the permeability of the Casparian bands to water and solutes on the formation of the root pressure. It is shown that the pressure in the xylem vessels which stops the flow across a root cut (root pressure) decreases with increase in the permeability of the ...

Added: January 30, 2014

Particle Simulation for Predicting Effective Properties of Short Fiber Reinforced Composites

Skoptsov K. A., Sheshenin S., Galatenko V. V. et al., International Journal of Applied Mechanics 2016 Vol. 8 No. 2 P. 1650016-01–1650016-18

We present a method for evaluating elastic properties of a composite material produced by molding a resin filled with short elastic fibers. A flow of the filled resin is simulated numerically using a mesh-free method. After that, assuming that spatial distribution and orientation of fibers are not significantly changed during polymerization, effective elastic moduli of ...

Added: May 22, 2016

Три подхода к проблеме квантовой реальности и вторая квантовая революция.

Terekhovich V., Эпистемология и философия науки 2019 Т. 56 № 1 С. 169–184

The framework of simple opposition realism - anti-realism is not enough to analyze the views on the reality of unobservable objects of quantum theory. First, it is necessary to distinguish between realism in relation to the theory and realism in relation to the theory’s objects. Secondly, realism in relation to classical objects can be combined, ...

Added: February 12, 2020

Количественные методы разработки и принятия решений в менеджменте. Компьютерное моделирование в Microsoft Excel. Практикум Изд. 2-е, испр. и доп.

Madera A. G., М.: ЛЕНАНД, 2018.

В учебном пособии рассмотрено применение компьютерных технологий в математическом моделировании и количественных методах принятия решений при изучении дисциплин «Качественные и количественные методы разработки и принятия управленческих решений», «Математические модели в управлении», и аналогичных им. В качестве инструментального средства используется программный пакет Microsoft Excel. Книга является практикумом по курсу, охватывает большинство тем указанных дисциплин и включает ...

Added: October 29, 2018

Численные методы поиска равновесного распределения потоков в модели Бэкмана и модели стабильной динамики

Gasnikov A., Maximov Y., Дорн Ю. В., Математическое моделирование 2016 Т. 28

В работе рассматриваются две потенциальные игры загрузки: модель Бэкмана (1955) и ее вырожденный вариант – модель стабильной динамики (Нестеров–де Пальма, 1998). В статье мы опишем эффективные численные процедуры поиска равновесий в этих играх. Для модели (игры) Бэкмана будет использован метод Франк–Вульфа, а для модели ста- бильной динамики используется переход к двойственной задаче. Эта задача решается ...

Added: October 23, 2015

Fast reconfiguration of high frequency brain networks in response to surprising changes in auditory input

Nicol R., Chapman S., Vertes P. et al., Journal of Neurophysiology (США) 2012 Vol. 107 No. 5 P. 1421–1430

How do human brain networks react to dynamic changes in the sensory environment? We measured rapid changes in brain network organization in response to brief, discrete, salient auditory stimuli. We estimated network topology and distance parameters in the immediate central response period, <1 s following auditory presentation of standard tones interspersed with occasional deviant tones ...

Added: October 23, 2014

Multiple substance use among heroin users: findings from an Internet-based illegal marketplace "Hydra"

Meylakhs A., Harm Reduction Journal 2020

. ...

Added: January 28, 2021

О выборе программных средств когнитивной компьютерной визуализации

Baibikova T., Domoratsky E., Вестник Московского финансово-юридического университета 2017 № 1 С. 200–206

Some questions of scientific visualization are under consideration in this paper. This article also discusses the peculiarities of application of cognitive computer graphics, singles out a range of tasks of scientific visualization. The paper gives a brief overview of modern support tools for program visualization, tendencies of their development and their main characteristics. A module ...

Added: June 10, 2017

Hardness of Approximation for H-free Edge Modification Problems

Bliznets Ivan, Cygan M., Komosa P. et al., ACM Transactions on Computation Theory 2018 Vol. 10 No. 2 P. 1–32

The H-free Edge Deletion problem asks, for a given graph G and integer k, whether it is possible to delete at most k edges from G to make it H-free—that is, not containing H as an induced subgraph. The H-free Edge Completion problem is defined similarly, but we add edges instead of deleting them. The study of these two problem families has recently been the subject of intensive studies from the point of ...

Added: October 30, 2018

Probably approximately correct learning of Horn envelopes from queries

Borchmann D., Hanika T., Obiedkov S., Discrete Applied Mathematics 2020 Vol. 273 P. 30–42

We propose an algorithm for learning the Horn envelope of an arbitrary domain using an expert, or an oracle, capable of answering certain types of queries about this domain. Attribute exploration from formal concept analysis is a procedure that solves this problem, but the number of queries it may ask is exponential in the size ...

Added: October 29, 2019