Concept Relation Discovery and Innovation Enabling Technology (CORDIET)

J. Poelmans; P. Elzinga; A. Neznanov; S. Viaene; S. Kuznetsov; D. I. Ignatov; G. Dedene

Сhapter

Concept Relation Discovery and Innovation Enabling Technology (CORDIET)

P. 53–62.

Poelmans J., Elzinga P., Neznanov A., Viaene S., Kuznetsov S., Ignatov D. I., Dedene G.

Concept Relation Discovery and Innovation Enabling Technology (CORDIET), is a toolbox for gaining new knowledge from unstructured text data. At the core of CORDIET is the C-K theory which captures the essential elements of innovation. The tool uses Formal Concept Analysis (FCA), Emergent Self Organizing Maps (ESOM) and Hidden Markov Models (HMM) as main artifacts in the analysis process. The user can define temporal, text mining and compound attributes. The text mining attributes are used to analyze the unstructured text in documents, the temporal attributes use these document’s timestamps for analysis. The compound attributes are XML rules based on text mining and temporal attributes. The user can cluster objects with object-cluster rules and can chop the data in pieces with segmentation rules. The artifacts are optimized for efficient data analysis; object labels in the FCA lattice and ESOM map contain an URL on which the user can click to open the selected document.

Language: English

Full text

Text on another site

Keywords: анализ текстов анализ формальных понятий Formal Concept Analysis text mining knowledge discovery обнаружение знаний rule mining taxonomy building вывод правил построение таксономий

In book

CDUD'11 – Concept Discovery in Unstructured Data Workshop co-located with the 13th International Conference on Rough Sets, Fuzzy Sets, Data Mining, and Granular Computing (RSFDGrC-2011), June 2011, Moscow, Russia

Issue 757. , M.: Higher School of Economics Publishing House, 2011.

Human-centered text mining: A new software system

Kuznetsov S., Poelman J., Elzinga P. et al., Lecture Notes in Computer Science 2012 Vol. 7377 LNAI P. 528–272.

In this paper we introduce a novel human-centered data mining software system which was designed to gain intelligence from unstructured textual data. The architecture takes its roots in several case studies which were a collaboration between the Amsterdam-Amstelland Police, GasthuisZusters Antwerpen (GZA) hospitals and KU Leuven. It is currently being implemented by bachelor and master ...

Added: February 7, 2013

Texterra: инфраструктура для анализа текстов

Денис Турдаков, Астраханцев Н. А., Недумов Я. Р. et al., Труды Института системного программирования РАН 2014 Т. 26 С. 421–438.

he paper presents a framework for fast text analytics developed during the Texterra project. Texterra is a technology for multilingual text mining based on novel text processing methods that exploit knowledge extracted from user-generated content. It delivers a fast scalable solution for text mining without the expensive customization. Depending on use-cases Texterra could be utilized ...

Added: November 6, 2017

14th International Conference on Formal Concept Analysis - Supplementary Proceedings

University Rennes 1, 2017..

This volume is the supplementary volume of the 14th International Conference on Formal Concept Analysis (ICFCA 2017), held from June 13th to 16th 2017, at IRISA, Rennes. The ICFCA conference series is one of the major venues for researches from the field of Formal Concept Analysis and related areas to present and discuss their recent ...

Added: June 19, 2017

Conceptual Structures for STEM Research and Education, 20th International Conference on Conceptual Structures

Berlin, Heidelberg: Springer, 2013..

This book constitutes the proceedings of the 20th International Conference on Conceptual Structures, ICCS 2013, held in Mumbai, India, in January 2013. The 22 full papers presented were carefully reviewed and selected from 43 submissions for inclusion in the book. The volume also contains 3 invited talks. ICCS focuses on the useful representation and analysis ...

Added: June 2, 2013

Mining Complex Data Generated by Collaborative Platforms

Ignatov D. I., Kaminskaya A. Y., Bezzubtseva A. A. et al., , in: Перспективные направления исследований в области бизнес-информатики: Материалы XI международной конференции.: Nizhny Novgorod: Higher School of Economics in Nizhny Novgorod, 2012. P. 7–17..

In a crowdsourcing project several participants discuss and solve one common problem, propose their ideas, evaluate ideas of each other, etc. We propose the novel instrument CrowDM for analyzing data generated by collaborative platforms. The initial version of the system combines several innovative techniques for structured and unstructured data analysis. Formal Concept Analysis, multimodal clustering ...

Added: December 3, 2012

Analysing Online Social Network Data with Biclustering and Triclustering

Gnatyshak D. V., Ignatov D. I., Semenov A. et al., , in: Concept Discovery in Unstructured Data. 2nd International Workshop, CDUD 2012, Leuven, Belgium, May 2012, ProceedingsIssue 871.: Leuven: Katholieke Universiteit Leuven, 2012. P. 30–39..

In this paper we propose two novel methods for analyzing data collected from online social networks. In particular we will do analyses on Vkontake data (Russian online social network). Using biclustering we extract groups of users with similar interests and find communities of users which belong to similar groups. With triclustering we reveal users’ interests ...

Added: November 20, 2012

Recommender system for crowdsourcing platform Witology

Ignatov D. I., Kaminskaya A. Y., Konstantinova N. et al., , in: Proceedings of The 2014 IEEE/WIC/ACM International Joint Conference on Web Intelligence and Intelligent Agent Technology, WI-IAT 2014, 11-14 August 2014 Warsaw, Poland.: Los Alamitos, Washington, Tokyo: IEEE Computer Society, 2014. P. 327–335..

This paper discusses the recommender models and methods for crowdsourcing platforms. These models are based on modern methods of data analysis of object-attribute data, such as Formal Concept Analysis and biclustering. In particular, the paper is focused on the solution of two tasks – idea and antagonists recommendation – on the example of crowdsourcing platform ...

Added: June 9, 2014

Gaining Insight in Social Networks with Biclustering and Triclustering

Gnatyshak D. V., Ignatov D. I., Semenov A. et al., , in: Perspectives in Business Informatics Research. 11th International Conference, BIR 2012, Nizhny Novgorod, Russia, September 2012 ProceedingsIssue 128.: Berlin, Heidelberg: Springer, 2012. P. 162–171..

We combine bi- and triclustering to analyse data collected from the Russian online social network Vkontakte. Using biclustering we extract groups of users with similar interests and find communities of users which belong to similar groups. With triclustering we reveal users' interests as tags and use them to describe Vkontakte groups. After this social tagging ...

Added: December 3, 2012

Социальные медиа: о чем и кому пишут их пользователи? Некоторые подходы к анализу данных

Kotyrlo E., Прикладная эконометрика 2017 № 3 С. 74–99.

Study of users and their segmentation, based on users’ preferred topics of discussion and their networking, is the unique opportunity offered by social networks. Variety of approaches to social media analysis based on social network analysis and text mining is summarized in the paper. It is extended by concentration index application and visualizing of the ...

Added: October 20, 2017

Proceedings of the International Workshop "What can FCA do for Artificial Intelligence?" (FCA4AI at IJCAI 2015)

Buenos Aires: [б.и.], 2015..

The three preceding editions of the FCA4AI Workshop showed that many researchers working in Artificial Intelligence are deeply interested by a well-founded method for classi- fication and mining such as Formal Concept Analysis (see http://www.fca4ai.hse.ru/). The first edition of FCA4AI was co-located with ECAI 2012 in Montpellier and published as http://ceur-ws.org/Vol-939/, the second edition was ...

Added: August 5, 2015

Formal concept analysis in knowledge processing: A survey on applications

Poelmans J., Ignatov D. I., Kuznetsov S. et al., Expert Systems with Applications 2013 Vol. 40 No. 16 P. 6538–6560.

This is the second part of a large survey paper in which we analyze recent literature on Formal Concept Analysis (FCA) and some closely related disciplines using FCA. We collected 1072 papers published between 2003 and 2011 mentioning terms related to Formal Concept Analysis in the title, abstract and keywords. We developed a knowledge browsing ...

Added: October 3, 2013

A Lattice-based Consensus Clustering Algorithm

Бочаров А. А., Gnatyshak D. V., Ignatov D. I. et al., , in: CLA 2016: Proceedings of the Thirteenth International Conference on Concept Lattices and Their Applications. CEUR Workshop ProceedingsVol. 1624.: M.: Higher School of Economics, National Research University, 2016. P. 45–56..

We propose a new algorithm for consensus clustering, FCA-Consensus, based on Formal Concept Analysis. As the input, the algorithm takes T partitions of a certain set of objects obtained by k-means algorithm after T runs from different initialisations. The resulting consensus partition is extracted from an antichain of the concept lattice built on a formal ...

Added: October 24, 2016

Concept Discovery in Unstructured Data. 2nd International Workshop, CDUD 2012, Leuven, Belgium, May 2012, Proceedings

Ignatov D. I., Kuznetsov S., Poelmans J., Leuven: Katholieke Universiteit Leuven, 2012..

Added: November 20, 2012

M.: Higher School of Economics Publishing House, 2011..

Concept discovery is a Knowledge Discovery in Databases (KDD) research field that uses human-centered techniques such as Formal Concept Analysis (FCA), Biclustering, Triclustering, Conceptual Graphs etc. for gaining insight into the underlying conceptual structure of the data. Traditional machine learning techniques are mainly focusing on structured data whereas most data available resides in unstructured, often ...

Added: December 3, 2012

Text Mining Scientific Papers: A Survey on FCA-Based Information Retrieval Research

Poelmans J., Ignatov D. I., Viaene S. et al., , in: Advances in Data Mining. Applications and Theoretical Aspects. 12th Industrial Conference, ICDM 2012, Berlin, Germany, July 13-20, 2012. ProceedingsVol. 7377.: Berlin, Heidelberg: Springer, 2012. P. 273–287..

Formal Concept Analysis (FCA) is an unsupervised clustering technique and many scientific papers are devoted to applying FCA in Information Retrieval (IR) research. We collected 103 papers published between 2003-2009 which mention FCA and information retrieval in the abstract, title or keywords. Using a prototype of our FCA-based toolset CORDIET, we converted the pdf-files containing ...

Added: December 3, 2012

Diagnostic Test Approaches to Machine Learning and Commonsense Reasoning Systems

Naidenova X., Ignatov D. I., Hershey: IGI Global, 2012..

The consideration of symbolic machine learning algorithms as an entire class will make it possible, in the future, to generate algorithms, with the aid of some parameters, depending on the initial users’ requirements and the quality of solving targeted problems in domain applications. Diagnostic Test Approaches to Machine Learning and Commonsense Reasoning Systems surveys, analyzes, and ...

Added: December 3, 2012

CDUD 2012 - Concept Discovery in Unstructured Data

Leuven: Katholieke Universiteit Leuven, 2012..

Concept discovery is a subarea of Knowledge Discovery in Databases (KDD) where concept models, such as Formal Concept Analysis (FCA), multimodal clustering, conceptual graphs and other, are used for gaining insight into the underlying conceptual structure of data. Traditional machine learning techniques are mainly focusing on structured data given by object-attribute tables, whereas most data available nowadays are given in ...

Added: March 10, 2013

Fuzzy and rough formal concept analysis: a survey

Poelmans J., Ignatov D. I., Kuznetsov S. et al., International Journal of General Systems 2014 Vol. 43 No. 2 P. 105–134.

Formal Concept Analysis (FCA) is a mathematical technique that has been extensively applied to Boolean data in knowledge discovery, information retrieval, web mining, etc. applications. During the past years, the research on extending FCA theory to cope with imprecise and incomplete information made significant progress. In this paper, we give a systematic overview of the ...

Added: June 9, 2014

Formal Concept Analysis in knowledge processing: A survey on models and techniques

Poelmans J., Kuznetsov S., Ignatov D. I. et al., Expert Systems with Applications 2013 Vol. 40 No. 16 P. 6601–6623.

This is the first part of a large survey paper in which we analyze recent literature on Formal Concept Analysis (FCA) and some closely related disciplines using FCA. We collected 1072 papers published between 2003 and 2011 mentioning terms related to Formal Concept Analysis in the title, abstract and keywords. We developed a knowledge browsing ...

Added: October 3, 2013

Supplementary Proceedings of the 4th International Conference on Analysis of Images, Social Networks and Texts (AIST'2015)

Aachen: CEUR Workshop Proceedings, 2015..

This volume contains proceedings of the fourth conference on Analysis of Images, Social Networks and Texts (AIST’2015)1 . The first three conferences in 2012–2014 attracted a significant number of students, researchers, academics and engineers working on interdisciplinary data analysis of images, texts, and social networks. The broad scope of AIST makes it an event where ...

Added: October 9, 2015

Processing and Analysis of Russian Strategic Planning Programs

Алексейчук Н. Н., Sarkisyan V., Emelyanov A. et al., , in: Digital Transformation and Global Society. Fourth International Conference, DTGS 2019, St. Petersburg, Russia, June 19–21, 2019, Revised Selected Papers.: Springer, 2019. P. 68–81..

In this paper, we present a project on the analysis of an extensive corpus of strategic planning documents, devoted to various aspects of the development of Russian regions. The main purposes of the project are: 1) to extract different aspects of goal setting and planning, 2) to form an ontology of goals and criteria of ...

Added: October 30, 2019

Анализ формальных понятий: от теории к практике

Ignatov D. I., В кн.: Анализ изображений, сетей и текстов. Доклады всероссийской научной конференции АИСТ'12. Модели, алгоритмы и инструменты анализа данных; результаты и возможности для анализа изображений, сетей и текстов. Екатеринбург, 16 – 18 марта 2012 годаВып. 1.: М.: Национальный открытый университет «ИНТУИТ», 2012. С. 3–15..

В работе даются основные определения анализа формальных понятий (АФП), рассказывается о его роли в математике и компьютерных науках, а также приводится краткий обзор его основных приложений. ...

Added: January 30, 2013

Proceedings of International Conference on Conceptual Structures 2014

Springer, 2014..

This book constitutes the proceedings of the 21st International Conference on Conceptual Structures, ICCS 2014, held in Iaşi, Romania, in July 2014. The 17 regular papers and 6 short papers presented in this volume were carefully reviewed and selected from 40 and 10 submissions, respectively. The topics covered are: conceptual structures, knowledge representation, reasoning, conceptual ...

Added: June 9, 2014

Triadic Formal Concept Analysis and triclustering: searching for optimal patterns

Ignatov D. I., Gnatyshak D. V., Sergei O. Kuznetsov et al., Machine Learning 2015 Vol. 101 No. 1 P. 271–302.

This paper presents several definitions of “optimal patterns” in triadic data and results of experimental comparison of five triclustering algorithms on real-world and synthetic datasets. The evaluation is carried over such criteria as resource efficiency, noise tolerance and quality scores involving cardinality, density, coverage, and diversity of the patterns. An ideal triadic pattern is a totally dense ...

Added: April 14, 2015

Human-centered text mining: A new software system

Kuznetsov S., Poelman J., Elzinga P. et al., Lecture Notes in Computer Science 2012 Vol. 7377 LNAI P. 528–272.

Added: February 7, 2013

Texterra: инфраструктура для анализа текстов