Recommender system for crowdsourcing platform Witology

D. I. Ignatov; A. Y. Kaminskaya; Konstantinova N.; A. V. Konstantinov

?

Recommender system for crowdsourcing platform Witology

P. 327-335.

Ignatov D. I., Kaminskaya A. Y., Konstantinova N., Konstantinov A. V.

This paper discusses the recommender models and methods for crowdsourcing platforms. These models are based on modern methods of data analysis of object-attribute data, such as Formal Concept Analysis and biclustering. In particular, the paper is focused on the solution of two tasks – idea and antagonists recommendation – on the example of crowdsourcing platform Witology.

Language: English

Full text

Keywords: бикластеризация crowdsourcing краудсорсинг анализ формальных понятий Formal Concept Analysis biclustering рекомендательные системы Recommender Systems

Publication based on the results of:

Mathematical models, algorithms and software for data mining in the text and the structural form (2014)

In book

Proceedings of The 2014 IEEE/WIC/ACM International Joint Conference on Web Intelligence and Intelligent Agent Technology, WI-IAT 2014, 11-14 August 2014 Warsaw, Poland

Los Alamitos, Washington, Tokyo : IEEE Computer Society, 2014

FCA-Based Recommender Models and Data Analysis for Crowdsourcing Platform Witology

Ignatov D. I., Kaminskaya A. Y., Malioukov A. et al., , in : Proceedings of International Conference on Conceptual Structures 2014. Vol. 8577: Graph-Based Representation and Reasoning.: Springer, 2014. P. 287-292.

This paper considers a recommender part of the data anal- ysis system for the collaborative platform Witology. It was developed by the joint research team of the National Research University Higher School of Economics and the Witology company. This recommender sys- tem is able to recommend ideas, like-minded users and antagonists at the respective phases ...

Added: June 9, 2014

CDUD'11 – Concept Discovery in Unstructured Data Workshop co-located with the 13th International Conference on Rough Sets, Fuzzy Sets, Data Mining, and Granular Computing (RSFDGrC-2011), June 2011, Moscow, Russia

M. : Higher School of Economics Publishing House, 2011

Concept discovery is a Knowledge Discovery in Databases (KDD) research field that uses human-centered techniques such as Formal Concept Analysis (FCA), Biclustering, Triclustering, Conceptual Graphs etc. for gaining insight into the underlying conceptual structure of the data. Traditional machine learning techniques are mainly focusing on structured data whereas most data available resides in unstructured, often ...

Added: December 3, 2012

Recommender System Based on Algorithm of Bicluster Analysis RecBi

Ignatov D. I., Poelmans J., Zaharchuk V. V., , in : CDUD'11 – Concept Discovery in Unstructured Data Workshop co-located with the 13th International Conference on Rough Sets, Fuzzy Sets, Data Mining, and Granular Computing (RSFDGrC-2011), June 2011, Moscow, Russia. Issue 757.: M. : Higher School of Economics Publishing House, 2011. P. 122-126.

In this paper we propose two new algorithms based on biclustering analysis, which can be used at the basis of a recommender system for educational orientation of Russian School graduates. The first algorithm was designed to help students make a choice between different university faculties when some of their preferences are known. The second algorithm ...

Added: December 3, 2012

RAPS: A Recommender Algorithm Based on Pattern Structures

Ignatov D. I., Корнилов Д. И., , in : Proceedings of the International Workshop "What can FCA do for Artificial Intelligence?" (FCA4AI at IJCAI 2015). : Buenos Aires : [б.и.], 2015. P. 87-98.

We propose a new algorithm for recommender systems with numeric ratings which is based on Pattern Structures (RAPS). As the input the algorithm takes rating matrix, e.g., such that it contains movies rated by users. For a target user, the algorithm returns a rated list of items (movies) based on its previous ratings and ratings ...

Added: October 23, 2015

Analysing Online Social Network Data with Biclustering and Triclustering

Gnatyshak D. V., Ignatov D. I., Semenov A. et al., , in : Concept Discovery in Unstructured Data. 2nd International Workshop, CDUD 2012, Leuven, Belgium, May 2012, Proceedings. Issue 871.: Leuven : Katholieke Universiteit Leuven, 2012. P. 30-39.

In this paper we propose two novel methods for analyzing data collected from online social networks. In particular we will do analyses on Vkontake data (Russian online social network). Using biclustering we extract groups of users with similar interests and find communities of users which belong to similar groups. With triclustering we reveal users’ interests ...

Added: November 20, 2012

Gaining Insight in Social Networks with Biclustering and Triclustering

Gnatyshak D. V., Ignatov D. I., Semenov A. et al., , in : Perspectives in Business Informatics Research. 11th International Conference, BIR 2012, Nizhny Novgorod, Russia, September 2012 Proceedings. Issue 128.: Berlin, Heidelberg : Springer, 2012. P. 162-171.

We combine bi- and triclustering to analyse data collected from the Russian online social network Vkontakte. Using biclustering we extract groups of users with similar interests and find communities of users which belong to similar groups. With triclustering we reveal users' interests as tags and use them to describe Vkontakte groups. After this social tagging ...

Added: December 3, 2012

Multimodal Clustering for Community Detection

Ignatov D. I., Semenov A., Комиссарова Д. В. et al., , in : Formal Concept Analysis of Social Networks. : Springer, 2017. Ch. 4. P. 59-96.

Multimodal clustering is an unsupervised technique for mining interesting patterns in n-ary relations or n-mode networks. Among different types of such generalised patterns one can find biclusters and formal concepts (maximal bicliques) for two-mode case, triclusters and triconcepts for three-mode case, closed n-sets for n-mode case, etc. Object-attribute biclustering (OA-biclustering) for mining large binary datatables (formal contexts or two-mode ...

Added: December 17, 2017

Визуальная аналитика в задаче трикластеризации данных социальных сетей

Kashnitsky Y., В кн. : Труды Международной конференции по физико-технической информатике CPT-2013, 12-19 мая 2013 г., Ларнака, Республика Кипр. : М., Протвино : Изд-во ИФТИ, 2013. С. 251-258.

Triclustering is an outgrowth of Formal Concept Analysis intented to detect groups of objects with similar properties (clusters) in a context of three sets of entities. In case of social network analysis, for instance, these sets might be users, their interests and events they take part in. Triclustering here can help to detect users with similar ...

Added: January 27, 2014

Concept-based Recommendations for Internet Advertisement

Ignatov D. I., Kuznetsov S., , in : CLA 2008. Proceedings of the Sixth International Conference on Concept Lattices and Their Applications. : Olomouc : Palacky University, 2008. P. 157-166.

The problem of detecting terms that can be interesting to the advertiser is considered. If a company has already bought some advertising terms which describe certain services, it is reasonable to find out the terms bought by competing companies. A part of them can be recommended as future advertising terms to the company. The goal ...

Added: December 9, 2012

Object-Attribute Biclustering for Elimination of Missing Genotypes in Ischemic Stroke Genome-Wide Data

Ignatov D. I., Khvorykh G. V., Khrunin A. V. et al., / Springer. Series LNCS "Lecture Notes in Computer Science". 2020.

Missing genotypes can affect the effcacy of machine learning approaches to identify the risk genetic variants of common diseases and traits. The problem occurs when genotypic data are collected from different experiments with different DNA microarrays, each being characterised by its pattern of uncalled (missing) genotypes. This can prevent the machine learning classifier from assigning ...

Added: November 10, 2020

Возможности формирования стратегии организации с использованием краудсорсинга

Dolzhenko R. A., Проблемы теории и практики управления 2014 № 4 С. 125-129

В статье рассмотрены возможности использования краудсорсинга при определении основных стратегических направлений развития компании. Определена суть краудсорсинга, выделены типы краудсорсинга. Рассмотрен опыт Сбербанка России в этой области, который первым среди отечественных компаний, привлёк сообщество заинтересованных работников к определению стратегических направлений развития Банка на период до 2018 года. ...

Added: October 21, 2014

Online recommender system for radio station hosting based on information fusion and adaptive tag-aware profiling

Ignatov D. I., Nikolenko S. I., Abaev T. et al., Expert Systems with Applications 2016 Vol. 55 P. 546-558

We present a new recommender system developed for the Russian interactive radio network FMhost. To the best of our knowledge, it is the first model and associated case study for recommending radio stations hosted by real DJs rather than automatically built streamed playlists. To address such problems as cold start, gray sheep, boosting of rankings, preference ...

Added: June 28, 2016

Anaphoric annotation and corpus-based anaphora resolution: An experiment

Alexeeva S. V., Protopopova E. V., Bodrova A. A. et al., Компьютерная лингвистика и интеллектуальные технологии 2014 P. 562-571

The paper describes the noun phase and anaphora annotation in OpenCorpora and compares it to that in other corpora. We discuss the choice of representative texts for anaphoric annotation and the basic principles of syntactic annotation. In case of noun phrase annotation we followed the scheme introduced earlier for morphological annotation: it was carried out ...

Added: October 8, 2014

Proceedings of International Conference on Conceptual Structures 2014

Springer, 2014

This book constitutes the proceedings of the 21st International Conference on Conceptual Structures, ICCS 2014, held in Iaşi, Romania, in July 2014. The 17 regular papers and 6 short papers presented in this volume were carefully reviewed and selected from 40 and 10 submissions, respectively. The topics covered are: conceptual structures, knowledge representation, reasoning, conceptual ...

Added: June 9, 2014

Краудсорсинговые технологии в маркетинге

Tsaplin E., Бушеленкова С. В., Проблемы теории и практики управления 2015 № 2 С. 127-132

This article discusses the concept of web 3.0 and the upcoming changes related to the use of this technology in the marketing field. The exploratory study presents the results of the research of possible development directions of crowdsourcing technologies, remote work, and management of unstructured distributed data in the open Internet. The article discusses in ...

Added: October 19, 2014

Краудсорсинг как инструмент доработки нормативных документов в организации: возможности и ограничения

Dolzhenko R. A., Известия высших учебных заведений. Серия: Экономика, финансы и управление производством 2015 Т. 23 № 1 С. 67-75

В статье рассмотрены возможности использования краудсорсинга для обсуждения и доработки отчёта о корпоративной социальной ответственности силами заинтересованной общественности. Определена суть краудсорсинга, описаны этапы краудсорсинга документов организации. Выделены преимущества его использования для обсуждения и доработки документа, как для организации, так и для участников. Рассмотрен опыт Сбербанка России в этой области, который первым среди отечественных компаний, на ...

Added: April 18, 2015

Некоторые аспекты оценки эффективности использования краудсорсинга в организации

Dolzhenko R. A., Экономический анализ: теория и практика 2014 № 36 С. 30-38

The article considers the possibility of evaluating the effectiveness of using the crowdsourcing in the organization. The nature of crowdsourcing and ingredients that make up effect of using this technology are considered. The performance and effectiveness of crowdsourcing activities are compared. The indicators for evaluating the effectiveness of crowdsourcing in the organization are presented. Author ...

Added: November 6, 2014

Extracting social networks from literary text with word embedding tools

Wohlgenannt G., Artemova E., Ilvovsky D., , in : Proceedings of the Workshop on Language Technology Resources and Tools for Digital Humanities (LT4DH). : Osaka : [б.и.], 2016. Ch. 4. P. 18-26.

In this paper a social network is extracted from a literary text. The social network shows, how frequent the characters interact and how similar their social behavior is. Two types of similarity measures are used: the first applies co-occurrence statistics, while the second exploits cosine similarity on different types of word embedding vectors. The results ...

Added: March 6, 2017

Semi-supervised Tag Extraction in a Web Recommender System

Leksin V., Nikolenko S. I., , in : Proceedings of the 6th International Conference on Similarity Search and Applications (SISAP 2013), Lecture Notes in Computer Science. Vol. 8199.: Berlin, Heidelberg : Springer, 2013. P. 206-212.

An important characteristic feature of recommender systems for web pages is the abundance of textual information in and about the items being recommended (web pages). To improve recommendations and enhance user experience, we propose to use automatic tag (keyword) extraction for web pages entering the recommender system. We present a novel tag extraction algorithm that ...

Added: September 27, 2013

Индивидуальная «агентность» как элемент человеческого потенциала: виды, проявления и эффекты в корпоративном секторе. Научный дайджест №10 (27)

Sorokin P. S., Afanaseva I., Шмаевка В. К. et al., М. : Издательский дом НИУ ВШЭ, 2023

The issue of agency (enterprise, initiative) is one of the central ones for the corporate sector. The key factor determining the importance of this issue is the processes of ‘destructuration’, that is, the growth of variability in the forms of social organization in various spheres of public life. The authors identified three levels of proactive behavior ...

Added: November 16, 2023

Preliminary Results on Mixed Integer Programming for Searching Maximum Quasi-Bicliques and Large Dense Biclusters

Ignatov D. I., Ivanova P., Zamaletdinova A. et al., , in : Supplementary Proceedings ICFCA 2019 Conference and Workshops. Vol. 2378.: CEUR Workshop Proceedings, 2019. P. 28-32.

This short paper is related to the problem of finding maximum quasi-bicliques in a bipartite graph (bigraph). A quasi-biclique in a bigraph is its “almost” complete subgraph; here, we assume that the subgraph is a quasi-biclique if it lacks γ · 100% of the edges to become a biclique. The problem of finding the maximal ...

Added: October 31, 2019

Object-Attribute Biclustering for Elimination of Missing Genotypes in Ischemic Stroke Genome-Wide Data

Ignatov D. I., Khvorykh G., Khrunin A. et al., , in : Recent Trends in Analysis of Images, Social Networks and Texts. 9th International Conference, AIST 2020, Skolkovo, Moscow, Russia, October 15–16, 2020 Revised Supplementary Proceedings. Vol. 12602.: Springer, 2021. P. 185-204.

© 2021, Springer Nature Switzerland AG.Missing genotypes can affect the efficacy of machine learning approaches to identify the risk genetic variants of common diseases and traits. The problem occurs when genotypic data are collected from different experiments with different DNA microarrays, each being characterised by its pattern of uncalled (missing) genotypes. This can prevent the ...

Added: November 1, 2022

Triadic Formal Concept Analysis and triclustering: searching for optimal patterns

Ignatov D. I., Gnatyshak D. V., Sergei O. Kuznetsov et al., Machine Learning 2015 Vol. 101 No. 1 P. 271-302

This paper presents several definitions of “optimal patterns” in triadic data and results of experimental comparison of five triclustering algorithms on real-world and synthetic datasets. The evaluation is carried over such criteria as resource efficiency, noise tolerance and quality scores involving cardinality, density, coverage, and diversity of the patterns. An ideal triadic pattern is a totally dense ...

Added: April 15, 2015

Link Prediction Regression for Weighted Co-authorship Networks

Gerasimova O., Makarov I., , in : Advances in Computational Intelligence. IWANN 2019. : Berlin : Springer, 2019. P. 667-677.

In this paper, we study the problem of predicting quantity of collaborations in co-authorship network. We formulated our task in terms of link prediction problem on weighted co-authorship network, formed by authors writing papers in co-authorship represented by edges between authors in the network. Our task is formulated as regression for edge weights, for which ...

Added: July 29, 2019