Fast and modular regularized topic modelling 21st Conference of Open Innovations Association, FRUCT 2017; Helsinki; Finland; 6 November 2017 до 10 November 2017; Номер категорииCFP1767Z-ART; Код 134240. Val. 1.

Kochedykov D.; Apishev M.; Golitsyn L.

doi:10.23919/FRUCT.2017.8250181

Publications

?

Fast and modular regularized topic modelling 21st Conference of Open Innovations Association, FRUCT 2017; Helsinki; Finland; 6 November 2017 до 10 November 2017; Номер категорииCFP1767Z-ART; Код 134240

Vol. 1. IEEE Computer Society, 2017.

Vorontsov K. V., Kochedykov D., Apishev M., Golitsyn L.

Topic modelling is an area of text mining that has been actively developed in the last 15 years. A probabilistic topic model extracts a set of hidden topics from a collection of text documents. It defines each topic by a probability distribution over words and describes each document with a probability distribution over topics. In applications, there are often many requirements, such as, for example, problem-specific knowledge and additional data, to be taken into account. Therefore, it is natural for topic modelling to be considered a multiobjective optimization problem. However, historically, Bayesian learning became the most popular approach for topic modelling. In the Bayesian paradigm, all requirements are formalized in terms of a probabilistic generative process. This approach is not always convenient due to some limitations and technical difficulties. In this work, we develop a non-Bayesian multiobjective approach called the Additive Regularization of Topic Models (ARTM). It is based on regularized Maximum Likelihood Estimation (MLE), and we show that many of the well-known Bayesian topic models can be re-formulated in a much simpler way using the regularization point of view. We review some of the most important types of topic models: multimodal, multilingual, temporal, hierarchical, graph-based, and short-text. The ARTM framework enables easy combination of different types of models to create new models with the desired properties for applications. This modular 'lego-style' technology for topic modelling is implemented in the open-source library BigARTM. © 2017 FRUCT.

Research target: Computer Science

Priority areas: IT and mathematics

Language: English

DOI

Text on another site

Keywords: data mining Bayesian learning multiobjective optimization Regenerative process Multi-objective optimization problem Multiobjective approach Graphic methods Technical difficulties Probabilistic topic models

Fast and modular regularized topic modelling 21st Conference of Open Innovations Association, FRUCT 2017; Helsinki; Finland; 6 November 2017 до 10 November 2017; Номер категорииCFP1767Z-ART; Код 134240

Network Algorithms, Data Mining, and Applications. Springer Proceedings in Mathematics & Statistics

Springer, 2020

This proceedings presents the result of the 8th International Conference in Network Analysis, held at the Higher School of Economics, Moscow, in May 2018. The conference brought together scientists, engineers, and researchers from academia, industry, and government. Contributions in this book focus on the development of network algorithms for data mining and its applications. Researchers and ...

Added: December 10, 2019

Proceedings of the Fifthteenth International Conference on Concept Lattices and Their Applications

CEUR-WS.org, 2020

The CLA conference is an international forum for researchers, practitioners and students dedicated to the practice of Formal Concept Analysis (FCA) and areas closely related to it, including data analysis and mining, information retrieval, knowledge management, knowledge engineering, logic, algebra and lattice theory. The 15th of CLA, CLA 2020, was going to be held in Tallinn, Estonia ...

Added: October 30, 2020

Ответ на рецензию Ю.Ю. Петрунина «Астрология, нейронные сети и управление персоналом»

Yasnitsky L., Журнал формирующихся направлений науки 2015 Т. 3 № 7

The article presents selected excerpts of the debate, which the doctor of philosophical Sciences, Professor of Moscow state University Yu. Yu. Petrunin. ...

Added: February 23, 2016

Proceedings of IADIS European Conference on Data Mining 2008

Amsterdam : [б.и.], 2008

Added: December 7, 2018

Intelligent Systems and Applications

Cham : Springer, 2019

Intelligent Systems Conference (IntelliSys) 2018 is the fourth research conference in the series. This conference is a part of SAI conferences being held since 2013. The conference series has featured keynote talks, special sessions, poster presentation, tutorials, workshops, and contributed papers each year. The conference focus on areas of intelligent systems and artificial intelligence (AI) and ...

Added: August 29, 2018

AIST: International Conference on Analysis of Images, Social Networks and Texts. 8th International Conference, AIST 2019, Kazan, Russia, July 17–19, 2019, Revised Selected Papers

Springer, 2020

This book constitutes the proceedings of the 8th International Conference on Analysis of Images, Social Networks and Texts, AIST 2019, held in Kazan, Russia, in July 2019. The 24 full papers and 10 short papers were carefully reviewed and selected from 134 submissions (of which 21 papers were rejected without being reviewed). The papers are organized ...

Added: February 9, 2020

Machine Learning and Data Mining in Pattern Recognition

Springer, 2014

This book constitutes the refereed proceedings of the 10th International Conference on Machine Learning and Data Mining in Pattern Recognition, MLDM 2014, held in St. Petersburg, Russia in July 2014. The 40 full papers presented were carefully reviewed and selected from 128 submissions. The topics range from theoretical topics for classification, clustering, association rule and ...

Added: September 30, 2014

Analysis of Images, Social Networks and Texts Third International Conference, AIST 2014, Yekaterinburg, Russia, April 10-12, 2014, Revised Selected Papers

Berlin : Springer, 2014

This book constitutes the proceedings of the Third International Conference on Analysis of Images, Social Networks and Texts, AIST 2014, held in Yekaterinburg, Russia, in April 2014. The 11 full and 10 short papers were carefully reviewed and selected from 74 submissions. They are presented together with 3 short industrial papers, 4 invited papers and ...

Added: November 13, 2014

Проектирование IoT-платформы для управления энергоресурсами интеллектуальных зданий

Kychkin A., Deryabin A. I., Vikentyeva O. et al., Прикладная информатика 2018 Т. 13 № 4 С. 29-41

The problem of designing a cyberphysical system used as a service for Smart buildings control using Internet technologies — Internet of Things (IoT) is considered. Such software platforms are part of the complex systems of the BEMS — Building Energy Management Systems and are an instrument for implementing energy savings in buildings. IoT servers and ...

Added: September 5, 2018

2013 IEEE 13th International Conference on Data Mining Workshops

Los Alamitos : IEEE Computer Society, 2013

The 13rd IEEE International Conference on Data Mining (IEEE ICDM 2013) has solicited workshops on topics related to new research directions and novel applications of data mining. The goal of the ICDM workshops program (IEEE ICDMW) is to identify grand challenges in data mining, to explore the possible paths to address these urgent problems, and ...

Added: December 26, 2013

Computational Linguistics and Intelligent Text Processing, Lecture Notes in Computer Science

Springer, 2015

16th International Conference, CICLing 2015, Cairo, Egypt, April 14-20, 2015, Proceedings, Part I ISBN: 978-3-319-18110-3 (Print) 978-3-319-18111-0 (Online) ...

Added: April 23, 2015

Язык сценариев как инструмент аналитической обработки в открытой системе автоматизированного анализа текста

Politsyna E., Балакирев Н. Е., Вестник Воронежского государственного университета. Серия: Системный анализ и информационные технологии 2013 № 1 С. 162-168

The article reveals the necessity of creating new user-level text analysis tools which should provide facilities for the open text analysis system for extending its functionality by users. The article shows details of the open text analysis system and used text analyses approaches which it is based on. A script language is suggested as an expandable tool for ...

Added: November 5, 2015

Proceedings of the First International Scientific Conference “Intelligent Information Technologies for Industry” (IITI’16)

Springer, 2016

This volume of Advances in Intelligent Systems and Computing contains papers presented in the main track of IITI 2016, the First International Conference on Intelligent Information Technologies for Industry held in May 16-21 in Sochi, Russia. The conference was jointly co-organized by Rostov State Transport University (Russia) and VŠB – Technical University of Ostrava (Czech ...

Added: June 3, 2016

2017 IEEE 17th International Conference on Data Mining (ICDM)

New Orleans : IEEE, 2017

Added: September 26, 2017

Formal Concept Analysis: 16th International Conference, ICFCA 2021, Strasbourg, France, June 29 – July 2, 2021, Proceedings

Springer, 2021

This book constitutes the proceedings of the 16th International Conference on Formal Concept Analysis, ICFCA 2021, held in Strasbourg, France, in June/July 2021. The 14 full papers and 5 short papers presented in this volume were carefully reviewed and selected from 32 submissions. The book also contains four invited contributions in full paper length. The research part ...

Added: July 10, 2021

Communications in Computer and Information Science, Vol. 542, Springer, 2015

Semenov A., Natekin A., Nikolenko S. I. et al., Springer, 2015

In online social networks, high level features of user behavior such as character traits can be predicted with data from user profiles and their connections. Recent publications use data from online social networks to detect people with depression propensity and diagnosis. In this study, we investigate the capabilities of previously published methods and metrics applied to the Russian online social ...

Added: December 21, 2015

Rough Sets, Fuzzy Sets, Data Mining, and Granular Computing

Heidelberg : Springer, 2013

This paper comprises papers accepted for presentation at the 14th Rough Sets, Fuzzy Sets, Data Mining, and Granular Computing (RSFDGRC) International Conference which was held as a major part of Joint Rough Set Symposium (JRS 2013) held at Halifax Canada during October 11-14, 2013. ...

Added: October 29, 2013

Proceedings of the 7th Spring/Summer Young Researchers’ Colloquium on Software Engineering, SYRCoSE 2013

Kazan : -, 2013

The issue contains the papers presented at the 7th Spring/Summer Young Researchers' Соllоquium оn Software Engineering (SYRCoSE 2013) held in Kazan, Russia on 30th and З1st оf Мay, 2013. Paper selection was based on a competitive peer review process being done by the program committee. Both regular and reseаrсh-in-рrogrеss papers were соnsidered ассeрtable for the ...

Added: June 8, 2013

The State and Perspective of Russian Studies in Artificial Intelligence (Based on the Proceedings of the 13th Russian Conference on Artificial Intelligence with International Participation)

Mikheyenkova M., Druzhinina E., Automatic Documentation and Mathematical Linguistics 2013 Vol. 47 No. 1 P. 36-43

The main directions of research in the field of artificial intelligence are presented on the basis of the Proceedings of the 13th Russian Conference on Artificial Intelligence with International Participation. ...

Added: September 28, 2013

Data Analytics and Management in Data Intensive Domains. 23rd International Conference, DAMDID/RCDL 2021, Moscow, Russia, October 26–29, 2021, Revised Selected Papers

Springer, 2022

“Data Analytics and Management in Data Intensive Domains” conference (DAMDID) is planned as a multidisciplinary forum of researchers and practitioners from various domains of science and research promoting cooperation and exchange of ideas in the area of data analysis and management in data intensive domains. Approaches to data analysis and management being developed in specific data intensive domains of X-informatics (such as X = astro, bio, chemo, geo, medicine, neuro, physics, ...

Added: August 30, 2021

CEUR Workshop Proceedings Volume 2416

CEUR Workshop Proceedings, 2019

This volume contains the papers presented at the session "Data Science" within the V International Conference on Information Technology and Nanotechnology (ITNT-2019). The conference was held in Samara, Russia, during May 21-24, 2019 (itnt-conf.org). The conference is a forum for leading researchers from all over the world aimed to discuss the latest advances in the ...

Added: September 13, 2019

On mining complex sequential data by means of FCA and pattern structures

Buzmakov A. V., Egho E., Jay N. et al., International Journal of General Systems 2016 Vol. 45 No. 2 P. 135-159

Nowadays data-sets are available in very complex and heterogeneous ways. Mining of such data collections is essential to support many real-world applications ranging from healthcare to marketing. In this work, we focus on the analysis of “complex” sequential data by means of interesting sequential patterns. We approach the problem using the elegant mathematical framework of ...

Added: February 25, 2016

Выявление академически неуспешных студентов на первом году обучения в университете на примере НИУ ВШЭ-Нижний Новгород

Shadrina E. V., Oshmarina O. E., Булычева П. А., Вестник Нижегородского университета им. Н.И. Лобачевского. Серия: Социальные науки 2016 № 2(42) С. 136-143

The article suggests the analysis of factors influencing academic success of the first year university students. Analysis was held with the help of statistics methods and methods of data mining. For the initial data the researcher takes information about students obtained by on-line system, which supports the educational process in HSE - LMS (Learning Management ...

Added: February 4, 2016

Bayesian Learning of Consumer Preferences for Residential Demand Response

Губко М. В., Kuznetsov S., Neznanov A. et al., IFAC-PapersOnLine 2016 Vol. 49 No. 32 P. 24-29

In coming years residential consumers will face real-time electricity tariffs with energy prices varying day to day, and effective energy saving will require automation - a recommender system, which learns consumer's preferences from her actions. A consumer chooses a scenario of home appliance use to balance her comfort level and the energy bill. We propose ...

Added: January 24, 2017