Задачи нечетко-вероятностной оптимизации: регрессия с нечеткими данными
The paper analyzes the key determinants of real estate prices in Perm, with special attention to transport accessibility indicators. The issue of transport accessibility modeling is discussed. The valuation of price hedonic model revealed that housing prices in Perm are affected mostly by the area of the apartment, the fact of its location on the first floor, number of public transport routes in the district, and time to the city centre
The problem of recognition of a sequence of objects (e.g., video-based image recognition, phoneme recognition) is explored. The generalization of the fuzzy phonetic decoding method is proposed by assuming the distribution of the classified object to be of exponential type. Its preliminary phase includes association of each model object with the fuzzy set of model classes with grades of membership defined as the confusion probabilities estimated with the Kullback-Leibler divergence between model distributions. At first, each object (e.g., frame) in a classified sequence is put in correspondence with the fuzzy set which grades are defined as the posterior probabilities. Next, this fuzzy set is intersected with the fuzzy set corresponding to the nearest neighbor. Finally, the arithmetic mean of these fuzzy intersections is assigned to the decision for the whole sequence. In this paper we propose not to limit the method's usage with the Kullback-Leibler discrimination and to estimate the grades of membership of models and query objects based on an arbitrary distance with appropriate scale factor. The experimental results in the problem of isolated Russian vowel phonemes and words recognition for state-of-the-art measures of similarity are presented. It is shown that the correct choice of the scale parameter can significantly increase the recognition accuracy.
The article is devoted to questions of accumulated data usage to find out regularities by means of pattern recognition methods that allow predicting formation of not synthesized substances and estimating its properties. The formal task of computer-aided inorganic compounds design is stated. An approach to reproduce of missing data in learning samples for computer-aided inorganic compounds design is proposed. It is based on combination of linear regression and interpolation taking into consideration the problem domain - inorganic chemistry. The approach is more powerful than methods of missed data reproduction used currently in information-analytical system for inorganic compounds design running at IMET RAS.
A complex model of forecasting the cost of residential real estate in the secondary market, including three submodels – a model of forecasting the level of population needs for housing based on regional data, a model of forecasting the comfort of housing based on local data, and a model of forecasting the cost of a unit of residential real estate based on the factors of the object and input variables that are the results of the forecast of previous models.