Clustering cities based on their development dynamics and Variable neigborhood search
This state-of-the-art survey is dedicated to the memory of Emmanuil Markovich Braverman (1931-1977), a pioneer in developing the machine learning theory. The 12 revised full papers and 4 short papers included in this volume were presented at the conference "Braverman Readings in Machine Learning: Key Ideas from Inception to Current State" held in Boston, MA, USA, in April 2017, commemorating the 40th anniversary of Emmanuil Braverman's decease. The papers present an overview of some of Braverman's ideas and approaches. The collection is divided in three parts. The first part bridges the past and the present. Its main contents relate to the concept of kernel function and its application to signal and image analysis as well as clustering. The second part presents a set of extensions of Braverman's work to issues of current interest both in theory and applications of machine learning. The third part includes short essays by a friend, a student, and a colleague.
The mass application of mobile cardiographs already leads to both explosive quantitative growth of the number of patients available for ECG study, registered daily outside the hospital (Big DATA in cardiology), and to the emergence of new qualitative opportunities for the study of long-term oscillatory processes (weeks, months, years) of the dynamics of the individual state of the Cardiovascular system of any patient.
The article demonstrates that new opportunities of long - term continuous monitoring of the Cardiov ascular system state of patients ' mass allow to reveal the regularities (DATA MINING) of Cardiovascular system dynamics, leading to the hypothesis of the existence of an adequate Cardiovascular system model as a distributed nonlinearself - oscillating system of the FPU recurrence model class . The presence of a meaningful mathematical model of Cardiovascular system within the framework of the FPU auto – recurrence , as a refinement of the traditional model of studying black box, further allows us to offer new computational methods for ECG analysis and prediction of Cardiovascular system dynamics for a refined diagnosis and evaluation of the effectiveness of the treatment.
The paper describes the results of an experimental study of topic models applied to the task of single-word term extraction. The experiments encompass several probabilistic and non-probabilistic topic models and demonstrate that topic information improves the quality of term extraction, as well as NMF with KL-divergence minimization is the best among the models under study.
Technology mining (TM) helps to acquire intelligence about the evolution of research and development (R&D), technologies, products, and markets for various STI areas and what is likely to emerge in the future by identifying trends. The present chapter introduces a methodology for the identification of trends through a combination of “thematic clustering” based on the co-occurrence of terms, and “dynamic term clustering” based on the correlation of their dynamics across time. In this way, it is possible to identify and distinguish four patterns in the evolution of terms, which eventually lead to (i) weak signals of future trends, as well as (ii) emerging, (iii) maturing, and (iv) declining trends. Key trends identified are then further analyzed by looking at the semantic connections between terms identified through TM. This helps to understand the context and further features of the trend. The proposed approach is demonstrated in the field photonics as an emerging technology with a number of potential application areas.
This article represents a new technique for collaborative filtering based on pre-clustering of website usage data. The key idea involves using clustering methods to define groups of different users.
This is a textbook in data analysis. Its contents are heavily influenced by the idea that data analysis should help in enhancing and augmenting knowledge of the domain as represented by the concepts and statements of relation between them. According to this view, two main pathways for data analysis are summarization, for developing and augmenting concepts, and correlation, for enhancing and establishing relations. Visualization, in this context, is a way of presenting results in a cognitively comfortable way. The term summarization is understood quite broadly here to embrace not only simple summaries like totals and means, but also more complex summaries such as the principal components of a set of features or cluster structures in a set of entities.
The material presented in this perspective makes a unique mix of subjects from the fields of statistical data analysis, data mining, and computational intelligence, which follow different systems of presentation.
A vast amount of documents in the Web have duplicates, which is a challenge for developing efficient methods that would compute clusters of similar documents. In this paper we use an approach based on computing (closed) sets of attributes having large support (large extent) as clusters of similar documents. The method is tested in a series of computer experiments on large public collections of web documents and compared to other established methods and software, such as biclustering, on same datasets. Practical efficiency of different algorithms for computing frequent closed sets of attributes is compared.
Abstract. The paper describes the results of an experimental study of topic models applied to the task of single-word term extraction. The experiments encompass several probabilistic and non-probabilistic topic models and demonstrate that topic information improves the quality of term extraction, as well as NMF with KL-divergence minimization is the best among the models under study.
This compilation presents papers on the techniques of building global international university rankings, participation in them of Russian higher education institutions including the winners of 5/100/2020 Federal Project, the positions of national higher education in the international market of educational services. The authors analyze the methodology of ranking higher education institutions, the experience of developing rankings of leading universities in the Post-Soviet area and the BRICS countries. The authors are workers of Russian and foreign universities and research institutions as well as other organizations dealing with the development and analysis of rankings. The book is addressed to instructors, scientific workers, and to everybody interested in international education; global, regional, and national university rankings; the possibilities of enhancing the competitiveness of national higher education institutions. This edition is prepared in the framework of implementation of the Government resolution «On measures of state support of leading universities of the Russian Federation in order to enhance their competitiveness among the leading world scientific and educational centers» (as of 16 March 2013 № 211).
We consider certain spaces of functions on the circle, which naturally appear in harmonic analysis, and superposition operators on these spaces. We study the following question: which functions have the property that each their superposition with a homeomorphism of the circle belongs to a given space? We also study the multidimensional case.
We consider the spaces of functions on the m-dimensional torus, whose Fourier transform is p -summable. We obtain estimates for the norms of the exponential functions deformed by a C1 -smooth phase. The results generalize to the multidimensional case the one-dimensional results obtained by the author earlier in “Quantitative estimates in the Beurling—Helson theorem”, Sbornik: Mathematics, 201:12 (2010), 1811 – 1836.
We consider the spaces of function on the circle whose Fourier transform is p-summable. We obtain estimates for the norms of exponential functions deformed by a C1 -smooth phase.
The article is devoted to the study of the authoritarianism prevalent in the mass consciousness of Russians. The article describes a new approach to the consideration of the authoritarian syndrome as the effects of the cultural trauma as a result of political and socio-cultural transformation of society. The article shows the dynamics of the symptoms of the authoritarianism, which appear in the mass consciousness of Russians from 1993 to 2011. This paper proposes a package of measures aimed at reducing the level of the authoritarianism in Russian society.
This work looks at a model of spatial election competition with two candidates who can spend effort in order to increase their popularity through advertisement. It is shown that under certain condition the political programs of the candidates will be different. The work derives the comparative statics of equilibrium policy platform and campaign spending with respect the distribution of voter policy preferences and the proportionality of the electoral system. In particular, it is whown that the equilibrium does not exist if the policy preferences are distributed over too narrow an interval.
The article examines "regulatory requirements" as a subject of state control over business in Russia. The author deliberately does not use the term "the rule of law". The article states that a set of requirements for business is wider than the legislative regulation.
First, the article analyzes the regulatory nature of the requirements, especially in the technical field. The requirements are considered in relation to the rule of law. The article explores approaches to the definition of regulatory requirements in Russian legal science. The author analyzes legislation definitions for a set of requirements for business. The author concludes that regulatory requirements are not always identical to the rule of law. Regulatory requirements are a set of obligatory requirements for entrepreneurs’ economic activity. Validation failure leads to negative consequences.
Second, the article analyzes the problems of the regulatory requirements in practice. Lack of information about the requirements, their irrelevance and inconsistency are problems of the regulatory requirements in Russia.
Many requirements regulating economic activity are not compatible with the current development level of science and technology. The problems are analyzed on the basis of the Russian judicial practice and annual monitoring reports by Higher School of Economics.
Finally, the author provides an approach to the possible solution of the regulatory requirements’ problem. The author proposes to create a nationwide Internet portal about regulatory requirements. The portal should contain full information about all regulatory requirements. The author recommends extending moratorium on the use of the requirements adopted by the bodies and organizations of the former USSR government.