Full-text Search in Intermediate Data Storage of FCART

A. Neznanov; A. Parinov

?

Full-text Search in Intermediate Data Storage of FCART

The speed of full-text search directly affects the process of text analysis. Search engine creates a text index, which is used for fast full-text search. Solr and ElasticSearch are two popular search engines. A text analysis system requires fast implementing searching and indexing at the same time. This paper describes preprocessing workflow of the analysis system called Formal Concept Analysis Research Toolbox (FCART) and experiment of searching and indexing social networking service data at the same time. Results of the experiment show which search engine is better as the core of FCART search subsystem.

Language: English

Full text

Text on another site

Keywords: software data mining Formal Concept Analysis social network analysis Knowledge Extraction big data

Publication based on the results of:

Mining Data with Complex Structure and Semantic Technologies (2016)

In book

RuZA 2015 Workshop. Proceedings of Russian and South African Workshop on Knowledge Discovery Techniques Based on Formal Concept Analysis (RuZA 2015). November 30 - December 5, 2015, Stellenbosch, South Africa

Vol. 1552. , Aachen: CEUR Workshop Proceedings, 2015.

Паттерны коллаборации российских социологов: часть 2 – анализ сетей соавторства

Maltseva D., Shcheglova T., Vashchenko V., Социологические исследования 2026 № 1 С. 62–74

The article continues to present the results of the analysis of collaboration networks of Russian sociologists in 2010–2021. It was conducted on the basis of data on co-authorship of scientific articles indexed in the electronic library eLibrary (75,232 scientific publications on sociology). The methodology of bibliometric network analysis implies the construction of several types of ...

Added: May 12, 2026

Балканские войны 1912–1913 гг. в современных национальных СМИ Сербии как символ единения балканских народов

Мулина А. А., В кн.: Балканские войны 1912–1913 гг.: далекие предпосылки и долгое эхо.: М.: Институт славяноведения РАН, 2024. С. 287–297.

В данной статье рассматривается вопрос отражения событий 1912–1913 гг. в национальных СМИ Сербии в 2012–2013 и 2022–2023 гг. Опираясь на «большие данные», полученные из сервиса Google, а также на материалы качественной газеты «Политика», автор анализирует особенности освещения эпизодов Балканских войн, а также запросы пользователей интернета на территории Сербии по темам, связанным с событиями 1912–1913 гг. ...

Added: April 21, 2026

Президентские выборы в Турецкой Республике в информационном пространстве стран Балканского полуострова: медиагеографический анализ

Мулина А. А., Якова Т. С., Вестник Российского университета дружбы народов. Серия: Литературоведение, журналистика 2025 Т. 30 № 1 С. 161–171

The article presents the results of a study of the information space of the Balkan states conducted during the presidential elections in Turkey (2023): the authors referred to this period as one of the most striking political events in the country over the past five years. The purpose of the proposed work is to identify ...

Added: April 21, 2026

Политические эффекты государственных цифровых платформ и сервисов в автократиях

Balayan A. A., Томин Л. В., Публичная политика 2023 Т. 7 № 1-2 С. 108–117

The paper is devoted to the study of certain aspects of the digitalization of public administration in autocracies, primarily government platforms and digital services. The analysis of the political effects of government platforms and services is carried out in the broader context of the study of new cybernetic mode of governance that complement/transform the disciplinary ...

Added: March 31, 2026

Цифровое общество: теоретическая модель и российская действительность

Smirnov A., Мониторинг общественного мнения: Экономические и социальные перемены 2021 № 1 С. 129–153

The article considers a theoretical model of digital society based on four concepts: super-connectivity, platformisation, datafication, and algorithmic governance. The model describes how the digitalisation of society deepens: from the transfer of individual practices and social interactions to a new social order based on big data. Analysis of panel data from the 2003–2018 longitudinal survey ...

Added: March 18, 2026

Прогнозирование миграционных процессов методами цифровой демографии

Smirnov A., Экономика региона 2022 Т. 18 № 1 С. 133–145

The nature and intensity of migration processes are constantly changing. Demographic statistics are not suitable for obtaining up-to-date information and making timely decisions in the field of demographic and social policy. Thus, digital demography is becoming increasingly important, as this area of population research uses new methods and data sources resulting from the Internet expansion ...

Added: March 18, 2026

Improving guest satisfaction by identifying hotel service micro-elements failures through Deep Learning of online reviews

Kazakov S., Cuesta-Valiño P., Butkovskaya V. et al., Cuadernos de Gestion 2025 Vol. 25 No. 1 P. 71–88

This study provides an in-depth examination of often-overlooked hotel service micro-elements within the broader spectrum of hospitality services, with the aim of improving service delivery and enhancing guest satisfaction. To achieve this, we develop a methodological framework that integrates: (a) VADER text-based sentiment analysis, (b) a robust logistic regression procedure to identify the specific hotel ...

Added: February 28, 2026

Data Analytics for Predicting Situational Developments in Smart Cities: Assessing User Perceptions

Kharlamov A. A., Pilgun M., , in: Special Issue Sensing Technology for Smart Cities: Data, Analytics, and VisualizationsVol. 24. Issue 15.: [б.и.], 2024.

The analysis of large volumes of data collected from heterogeneous sources is increasingly important for the development of megacities, the advancement of smart city technologies, and ensuring a high quality of life for citizens. This study aimed to develop algorithms for analyzing and interpreting social media data to assess citizens’ opinions in real time and ...

Added: February 22, 2026

Special Issue Sensing Technology for Smart Cities: Data, Analytics, and Visualizations

[б.и.], 2024.

Nowadays a huge portion of population lives in urban areas, and projections indicate that most cities are going to be confronted with a growing urban population in the next few years. This undoubtably poses new challenges that must be addressed by city councils and stakeholders to guarantee citizens’ high quality of life. Mobility, pollution, climate ...

Added: February 15, 2026

Актуальные вопросы правового обеспечения «бережного» оборота данных в сфере здравоохранения

Oshmankevich K., Холодная Е. В., Магдилова Л. В. et al., В кн.: Правовое регулирование бережного и устойчивого оборота данных.: М.: ИНФРА-М, 2025. Гл. 3.5 С. 137–201.

Монография посвящена актуальным проблемам оборота данных в цифровую эпоху. Исследуется концепия "бережного" оборота данных для защиты прав человека. Анализируется влияние технологий на права, принципы "бережного" оборота, регулирование больших данных, соотношение данных и интеллектуальной слбственности, безопасность при развитии искусственного интеллекта. Особое внимание уделено персональным данным в трудовых отношениях и в здравоохранении. Предназначена для научного сообщества, практикущих юристов, ...

Added: February 13, 2026

Is Canfield Right? On the Asymptotic Coefficients for the Maximum Antichain of Partitions and Related Counting Inequalities

Ignatov D. I., , in: 11th International Conference, AIST 2023, Yerevan, Armenia, September 28–30, 2023, Revised Selected Papers. Analysis of Images, Social Networks and Texts. Lecture Notes in Computer Science (LNCS, volume 14486).: Cham: Springer, 2024. P. 349 – 361.

This paper dates back to the asymptotic solutions of Rota’s problem on the size of maximum antichain in the set partition lattice by Canfield and Harper and others. The knowledge of asymptotic coefficients could pave the way to the asymptotic solutions of such problems as (maximal) antichain counting in partition lattices. In addition to our ...

Added: January 23, 2026

ALGORITHMIZATION OF LAW ENFORCEMENT MANAGEMENT PROCESSES USING ARTIFICIAL INTELLIGENCE

Barchukov, V., Relacoes Internacionais no Mundo Atual 2024 Vol. 4 No. 46 P. 113–132

Objective: Despite the opportunities that are opening up due to the development of information support systems and artificial intelligence in law enforcement, unfortunately, the Russian Federation has not yet fully formed a scientifically based legal and organizational framework for their integrated and practical application in activities of law enforcement agencies. The article aims to assess ...

Added: January 20, 2026

Artificial Intelligence for Urban Planning and Building Smart Cities

Demekhina A., Milshina Y., , in: Artificial Intelligence Enabled Real Time Environmental Monitoring.: Springer, 2026. P. 253–281.

Added: January 13, 2026

Перспективы интеграции новых цифровых технологий в современное образование для повышения его эффективности

Бояров Е. Н., Социальная компетентность 2025 Т. 10 № 2 С. 42–51

The article addresses the problem of integrating new digital technologies into modern education to enhance its effectiveness and quality. The purpose of the study is to summarize theoretical and practical approaches to the use of digital tools in educational environments and to identify key directions and barriers to the digital transformation of education. The research ...

Added: December 9, 2025

Мир стоит на пороге эпохи технологической сингулярности. Как изменятся тренды базовых глобальных процессов и эволюция человечества

Akaev A., Ильин И. В., Korotayev A., Вестник Российской академии наук 2025 № 9 С. 3–15

The article examines the likelihood of creating artificial intelligence (AI) at the human level (“human intelligence level”, AGI) by 2027-2029 and the onset of the era of technological singularity, when a fundamental change in the mechanism of human evolution will occur. It is noted that this probability is close to one, since these dates surprisingly ...

Added: October 28, 2025

Девятнадцатая конференция «Свободное программное обеспечение в высшей школе» : материалы конференции / Переславль-Залесский, 28–30 июня 2024 года

М.: МАКС Пресс, 2024.

The book contains theses of talks approved by the Program Committee of the Nineteenth Conference “Free Software in Higher Education”. ...

Added: October 25, 2025

Двадцатая конференция "Свободное программное обеспечение в высшей школе" : материалы конференции / Переславль-Залесский, 07–09 февраля 2025 г.

М.: МАКС Пресс, 2025.

The book contains conference proceedings approved by the Program Committee of the Twentieth Conference ‘Free Software in Higher Education’. ...

Added: October 24, 2025

Правовой режим данных в эпоху больших данных: правовые дилеммы и способы их разрешения

Лескина Э. И., Panarina M., Закон 2025 № 9 С. 91–100

The article studies the matter of finding the optimal legal regime for different categories of data. Legal problems related to the turnover of big data arise due to the lack of a unified legal framework for regulating the turnover of big data, a variety of data sources and ways to use them. Based on the ...

Added: October 1, 2025

РАЗРАБОТКА БАЗЫ ДАННЫХ ОБРАЗЦОВ ГОРНЫХ ПОРОД И РЕЗУЛЬТАТОВ ГЕОМЕХАНИЧЕСКИХ ЛАБОРАТОРНЫХ ИСПЫТАНИЙ

Krayushkin D. V., Казначеев П. А., Строганова С. М. et al., Геофизические процессы и биосфера 2025 Т. 24 № 3

During laboratory studies of many rock samples, it becomes necessary to develop an appropriate database. Adding new records and maintaining such databases is especially important when creating and updating mathematical and physical models describing hydrocarbon and ore deposits, gas storage facilities, engineering (in particular, transport, hydraulic, pipeline, etc.) structures, the development and operation of which ...

Added: September 21, 2025

Big Data and Artificial Intelligence for Decision-Making in the Smart Economy

Switzerland: Springer, 2025.

This book focuses on the systemic scientific-methodological and practical exploration of organizational-technical and socio-economic issues related to the automation of decision-making in the smart economy under Industry 4.0 using big data and artificial intelligence (AI). The scientific novelty of the results presented in the book lies in uncovering the “black box” of decision-making automation in ...

Added: August 27, 2025

Formation of Collaboration Networks Among Russian Sociologists (2010–2021)

Maltseva D., Kim A., Semenova A., Operations Research Forum 2025 Vol. 6 Article 89

Due to the non-linear nature of the development of the sociological discipline in the Soviet time, and the existing inequality between central cities and regions in mod- ern Russia, the community of Russian sociologists is characterized by a low level of integration at the local level and selective representation in the international sci- entific community. ...

Added: June 26, 2025

Methodology for predicting dependability measures of swarm structures of unmanned aerial vehicles of agricultural application

Kostyuk A., Tsvetkov V., Korolev P. et al., Reliability: Theory and Applications 2025 Vol. 20 No. 3(86) P. 35–53

This paper presents a methodology for predicting dependability measures of swarm structures of unmanned aerial vehicles used in agriculture. The main attention is paid to the development of mathematical models for assessing the dependability of hardware, software, and communication systems in drone swarms. Two types of UAVs are considered in the paper: DJI Phantom 4 ...

Added: June 23, 2025

Analysis of Images, Social Networks and Texts, 12th International Conference, AIST 2024, Bishkek, Kyrgyzstan, October 17–19, 2024, Revised Selected Papers

Springer, 2024.

This book constitutes the refereed proceedings of the 12th International Conference on Analysis of Images, Social Networks and Texts, AIST 2024, held in Bishkek, Kyrgyzstan, during October 17–19, 2024. The 16 full papers included in this book were carefully reviewed and selected from 70 submissions. They were organized in topical sections as follows: Natural Language Processing; Computer Vision; Data Analysis and Machine Learning; ...

Added: May 29, 2025