Методы бинарной классификации объектов с номинальными показателями
In this work a problem is studied of classification of respondents into classes accepting and not participation in a charity actions. An optimal (in Bayes sense) decisive discriminant rule of division of objects on two classes is constructed for the case when all indicators of observable objects are measured in a nominal scale, and there are signs of dependence between them . Using ROC-analysis methods, comparison of the developed rule with a rule implemented in the software package SPSS (Fisher’s discriminant rule), «naive» Bayesian classifier, a rule based on support vector machines (SVM) method and implemented in SPSS package binary logistic regression classifier is made. Results of the ROC-analysis have shown that the proposed rule has higher quality than all other mentioned rules of classification of respondents.
In article possible approaches to clustering of large city schools of the Russian Federation by results of their educational activity are studied. The extent to which a school has entered a particular cluster is determined by a number of objective conditions in which schools operate. Significant indicators of conditions affecting the EGE-results in schools were identified. It is shown that the studied indexes of the working conditions of schools are not sufficient for the correct clustering of schools according to the aggregated EGE-indicators.
Different methods of feature selection are used to improve the performance of remote sensing images classification. In this work two methods of feature selection are examined. The first one is based on the discriminant analysis, and the second one rests on building the regression model. Histogram and textural features are considered as characteristics of an image. The experiments on the remote sensing dataset UC Merced Land Use show the effectiveness of these methods. As the result, the largest fraction of correctly classified images accounts for the 95%. Dimension of the initial feature space consisting of 18 features has been reduced to 3 features.
The article gives an overview of the main approaches to verification of the internal rating methods of analysis of the creditworthiness. The article gives practical advice on the application of these techniques by an example of real internal rating technique.
The article studies educational trajectories of schoolchildren in Yaroslavl Oblast. Conclusions point out that schoolchild’s educational achievements, educational plans of his/her friends and the level of education of his/her father are key predictors of a decision about continuing education. Thanks to this information it is possible to know which schoolchildren are at risk of not continuing their studies. In the course of the research comparative advantages of the logistic regression and the discriminant analysis in the case of binary dependent variables were examined. With the necessary prerequisites for the use of methods fulfilled, both strategies work well classifying schoolchildren.
The current study provides new insights into explaining individual differences in the attitude toward childlessness in Russia. The main research aim is defining who are childless people in modern Russia and to what extent this status is compulsory (i.e. reproductive norms are lowered due to unfavorable life circumstances) or truly voluntary. Moscow is examined separately within the research as the most advanced in respect to the second demographic transition region of the country.
This is a short abstract in the conference book. Long abstract is available at the web site of the European Population Conference 2014.
A model for organizing cargo transportation between two node stations connected by a railway line which contains a certain number of intermediate stations is considered. The movement of cargo is in one direction. Such a situation may occur, for example, if one of the node stations is located in a region which produce raw material for manufacturing industry located in another region, and there is another node station. The organization of freight traﬃc is performed by means of a number of technologies. These technologies determine the rules for taking on cargo at the initial node station, the rules of interaction between neighboring stations, as well as the rule of distribution of cargo to the ﬁnal node stations. The process of cargo transportation is followed by the set rule of control. For such a model, one must determine possible modes of cargo transportation and describe their properties. This model is described by a ﬁnite-dimensional system of diﬀerential equations with nonlocal linear restrictions. The class of the solution satisfying nonlocal linear restrictions is extremely narrow. It results in the need for the “correct” extension of solutions of a system of diﬀerential equations to a class of quasi-solutions having the distinctive feature of gaps in a countable number of points. It was possible numerically using the Runge–Kutta method of the fourth order to build these quasi-solutions and determine their rate of growth. Let us note that in the technical plan the main complexity consisted in obtaining quasi-solutions satisfying the nonlocal linear restrictions. Furthermore, we investigated the dependence of quasi-solutions and, in particular, sizes of gaps (jumps) of solutions on a number of parameters of the model characterizing a rule of control, technologies for transportation of cargo and intensity of giving of cargo on a node station.