Методы ускорения работы рекомендательных систем для высоконагруженных веб-сайтов

О. В. Новиков

Publications

?

Методы ускорения работы рекомендательных систем для высоконагруженных веб-сайтов

Прикладная информатика. 2013. № 5(47). С. 29–34.

Новиков О. В.

In press

This article represents different techniques for building fast recommender systems based on dimension reduction and classification of web-site usage data. Description of different web-site types that use recommender systems is provided.

Research target: Computer Science

Priority areas: business informatics

Language: Russian

Keywords: классификация рекомендательная система classification collaborative filtering коллаборативаная фильтрация Recommender Systems

Обзор методов коллаборативной фильтрации для использования на высоконагруженных веб-сайтах

Новиков О. В., Образование. Наука. Научные кадры 2013 № 5 С. 211–212

This article represents different techniques for building fast recommender systems based on collaborative filtering. Description of different ways to collect web-usage data is provided. ...

Added: October 28, 2013

Fairness of Machine Learning Algorithms in Demography

Emmanuel I. C., Mitrofanova E., / Series 4064475 "ArXiv Preprint". 2022.

The paper is devoted to the study of the model fairness and process fairness of the Russian demographic dataset by making predictions of divorce of the 1st marriage, religiosity, 1st employment and completion of education. Our goal was to make classifiers more equitable by reducing their reliance on sensitive features while increasing or at least ...

Added: May 31, 2022

МАШИННОЕ ОБУЧЕНИЕ: ПРОГНОЗИРОВАНИЕ РИСКОВ ГОСЗАКУПОК

Елисеев Д. А., Romanov D. A., Открытые системы. СУБД 2018 № 2 С. 42–44

В сфере госзакупок обращаются огромные денежные средства, и сегодня прикладываются большие усилия для обеспечения мониторинга процесса выполнения контракта — своевременное управление рисками может позволить сэкономить миллиарды рублей. Точная модель автоматизированной оценки рискованности государственных контрактов, построенная на базе алгоритмов машинного обучения, может помочь повысить эффективность государственных закупок. ...

Added: December 19, 2018

Об энтропийных критериях отбора признаков в задачах анализа данных

Dubnov Y. A., Информационные технологии и вычислительные системы 2018 № 2 С. 60–69

The paper considers the problem of reducing the dimension of the feature space for describing objects in data analysis problems using the example of binary classification. The article provides a detailed overview of existing approaches to solving this problem and proposes several modifications. In which the dimensionality reduction is considered as the problem of extracting the most relevant ...

Added: July 4, 2018

Математическое и программное обеспечение баз экспертных знаний, полученных при разрешении инцидентов в информационных системах.

Karasev A. A., Starykh V., Вестник МГТУ МИРЭА 2014 Т. 4 № 5 С. 113–121

This article is devoted to a solution of the problem of collecting and classification of the expert knowledge acquired during the operation of information systems aimed to organize expert knowledge bases. As perspective approach ontological theory use is offered; it provides a basis for building mathematical model of knowledge representation. Organization of objects hierarchy is ...

Added: February 10, 2015

Проблема интерпретации, дифференциации и классификации цифровых продуктов

Shaidullin A., Бизнес-информатика 2023 Т. 17 № 2 С. 55–70

Digital innovative products often become a significant factor in the revision of companies’ business strategies and influence consumer preferences. A key component in the process of formulating such strategies is understanding the implications underlying the attributes of digital products. This requires a good understanding of their nature and characteristics. To date, there is no solid ...

Added: June 29, 2023

Об одном способе повышения вычислительной эффективности вероятностной нейронной сети в задаче распознавания образов на основе проекционных оценок

Savchenko A., Информационные системы и технологии 2015 № 4(90) С. 28–38

Рассмотрена проблема недостаточной вычислительной эффективности вероятностной нейронной сети (ВНС) в задачах распознавания образов при наличии в базе данных для каждого класса небольшого числа эталонов. На основе проекционных оценок плотности распределения с ядром Фейера и наивного предположения о независимости признаков классифицируемого объекта синтезирована новая модификация ВНС. Экспериментально показано, что предложенный классификатор оказался несколько точнее и намного ...

Added: October 8, 2015

SmartTips: Online Products Recommendations System Based on Analyzing Customers Reviews

Ali N., Alshahrani A., Alghamdi A. et al., Applied Sciences (Switzerland) 2022 Vol. 12 No. 17 Article 8823

Online customers’ opinions represent a significant resource for both customers and enterprises to extract much information that helps them make the right decision. Finding relevant data while searching the internet is a big challenge for web users, known as the “Problem of Information Overload”. Recommender systems have been recognized as a promising way of solving ...

Added: October 4, 2022

Метод отбора признаков на основе вероятностного подхода и перекрестной энтропии на примере задачи распознавания изображений

Dubnov Y. A., Искусственный интеллект и принятие решений 2020 № 2 С. 78–85

The paper considers the problem of feature selection in the classification problem. A method for selecting informative features based on a probabilistic approach and cross-entropy metrics is proposed. Several variants of the information criterion for selecting features for a binary classification problem are considered, as well as its generalization to the case of a multiclass ...

Added: October 31, 2020

Классификация мозговой активности при помощи синолитических сетей

Vlasenko D., Zaikin A., Zakharov D., Известия высших учебных заведений. Прикладная нелинейная динамика 2023 Т. 31 № 5 С. 661–669

Because the brain is an extremely complex hypernet of interacting macroscopic subnetworks, full-scale analysis of brain activity is a daunting task.Nevertheless,this task can be greatly simplified by analysing the correspondence between various patterns of macroscopic brain activity, forex ample,through functional magneticresonance imaging(fMRI) scans, and the performance of particular cognitive tasks or pathological states.The purpose of ...

Added: October 4, 2023

Concept Learning from Triadic Data

Zhuk R., Ignatov D. I., Konstantinova N., Procedia Computer Science 2014 Vol. 31 P. 928–938

We propose extensions of the classical JSM-method and the Na ̈ıve Bayesian classifier for the case of triadic relational data. We performed a series of experiments on various types of data (both real and synthetic) to estimate quality of classification techniques and compare them with other classification algorithms that generate hypotheses, e.g. ID3 and Random ...

Added: June 9, 2014

Кросс-энтропийная редукции матрицы данных с ограничением информационной емкости матриц-проекторов и их норм

Popkov Y., Popkov A., Dubnov Y. A., Математическое моделирование 2020 Т. 32 № 9 С. 35–52

We develop a new method of dimensionality reduction based on direct and inverse projection of data matrix and calculation of projectors minimizing cross-entropy functional. Concept of information capacity of matrix which is used as a restriction in a problem of optimal reduction is introduced. We conduct a comparison of proposed method with known ones based ...

Added: October 31, 2020

Байесовская идентификация параметров смеси нормальных распределений

Dubnov Y. A., Bulychev A., Информационные технологии и вычислительные системы 2017 № 1 С. 101–111

We consider a problem of parameters estimation for gaussian mixture models widely used in data analysis and unsupervised machine learning. A new model identification method based on Bayesian aproach and the principle of maximum posterior distribution is proposed. In the article we describe the method of multiextremum density function maximum definition using sampling by Metropolis-Hastings ...

Added: December 27, 2017

23rd International Symposium on Methodologies for Intelligent Systems - Proceedings

Birkhauser/Springer, 2017.

This book constitutes the proceedings of the 23rd International Symposium on Foundations of Intelligent Systems, ISMIS 2017, held in Warsaw, Poland, in June 2017. The 56 regular and 15 short papers presented in this volume were carefully reviewed and selected from 118 submissions. The papers include both theoretical and practical aspects of machine learning, data mining ...

Added: September 18, 2017

Классификация и каталогизация информационных ресурсов сферы образования

Starykh V., Башмаков А. И., Scientific and Technical Information Processing 2013 № 4 С. 27–31

For management of information resources (IR) the Russian educational system the uniform specification of metadata used for the description and classification of the information - and in educational portals, and in registration system, and in storehouses of data should be developed and accepted. In the present work specified concept of IR as object of cataloguing ...

Added: October 16, 2013

Информационные ресурсы для сферы образования: каталогизация, классификация, онтология.

Башмаков А. И., Белоозеров В. Н., Starykh V., Информационные системы и технологии 2013 № 6(80) С. 88–102

In article process of construction formal ontology of information resources system for an education, that pursues the aim to reflect representation about this sphere in the automated systems intended for creation, account, ordering, storage, search and use of these resources in educational institutions of various level is stated. The system of information resources is set ...

Added: January 16, 2014

Automated real-time classification of psychological functional state based on discrete wavelet transform of EEG data

Galatenko V. V., Livshitz E., Podol’skii V. et al., International Journal of Applied Mathematics 2012 Vol. 25 No. 6 P. 871–882

A method for the automated real-time classification of psychological functional state is proposed. The classification is based on discrete wavelet transform of electroencephalographic data. The method consists of two preliminary stages — global feature selection and individual tuning, and the main stage — real-time classification. All stages are fully automated. The software implementation of this ...

Added: October 30, 2015

Распределенная кластеризация данных о поведении пользователей веб-сайта для рекомендательных систем

Новиков О. В., Образование. Наука. Научные кадры 2013 № 2-2013 С. 164–167

This article represents a new technique for collaborative filtering based on pre-clustering of website usage data. The key idea involves using clustering methods to define groups of different users. ...

Added: April 6, 2013

АВТОМАТИЧЕСКАЯ КЛАССИФИКАЦИЯ ТЕКСТОВ С ИСПОЛЬЗОВАНИЕМ СЕМАНТИКО-СИНТАКСИЧЕСКИХ СВЯЗЕЙ СЛОВ

Lebedev I., Спивак А. И., Лапшин С. В., Вестник компьютерных и информационных технологий 2018 № 12(174) С. 28–35

Abstract. We present the method for improving the quality metrics of text classification. The result achieved by using of additional semantico-syntactic features for text classifier. These features calculated from a semantico-syntactic representation of text. In our research, we used Stanford CoreNLP parser and its “Universal++Dependencies” representation of parse tree. It allowed us to handle some dependencies ...

Added: April 23, 2019

Advances in Information Retrieval

Kuznetsov S., Serdyukov P., Segalovich I. et al., L.: Springer, 2013.

Higher School of Economics (HSE) and supported by the Information Retrieval Specialist Group at the British Computer Society (BCS–IRSG). The conference was held during March 24–27, 2013, in Moscow, Russia – the easternmost location in the history of the ECIR series. ECIR 2013 received a total of 287 submissions in three categories: 191 full papers, ...

Added: April 15, 2013

Информационный анализ документации промышленных предприятий

Chernikov B. V., Вестник машиностроения, СТИН 2013 № 3 С. 74–78

Рассмотрены составляющие документации промышленных предприятии и основы внедрения лексикологического синтеза документов. Исследованы структура документационного обеспечения предприятий, состав, информативность и содержание документов ...

Added: April 15, 2013

Ансамблевый метод машинного обучения, основанный на рекомендации классификаторов

Kashnitsky Y., Ignatov D. I., Интеллектуальные системы. Теория и приложения 2015 Т. 19 № 4 С. 37–55

The paper makes a brief introduction into multiple classifier systems and describes a particular algorithm which improves classification accuracy by making a recommendation of an algorithm to an object. This recommendation is done under a hypothesis that a classifier is likely to predict the label of the object correctly if it has correctly classified its ...

Added: December 7, 2015

Classification of a Sequence of Objects with the Fuzzy Decoding Method

Savchenko A., Savchenko L. V., Lecture Notes in Artificial Intelligence 2014 Vol. 8536 P. 309–318

The problem of recognition of a sequence of objects (e.g., video-based image recognition, phoneme recognition) is explored. The generalization of the fuzzy phonetic decoding method is proposed by assuming the distribution of the classified object to be of exponential type. Its preliminary phase includes association of each model object with the fuzzy set of model ...

Added: July 25, 2014

Сокращение размерности данных в задачах имитационного моделирования

Агалаков Ю. Г., Bernstein A., Информационные технологии и вычислительные системы 2012 № 3 С. 3–17

Рассматриваются задачи интеллектуального анализа данных, которые необходимо решать в технологии предсказательного моделирования. Для уменьшения сложности решения этих задач в технологии предсказательного моделирования используются решения задач снижения размерности, которые должны удовлетворять ряду дополнительных условий. В статье обсуждаются эти дополнительные требования и сформулированы соответствующие новые нетрадиционные постановки задач снижения размерности. ...

Added: January 24, 2013