Система автоматической обработки русскоязычных текстов

М. С. Дубов; Б. Г. Миркин; А. А. Шаль

?

Система автоматической обработки русскоязычных текстов

Открытые системы. СУБД. 2014. № 10. С. 15–17.

Dubov M., Mirkin B., Шаль А. А.

Currently, automating of text processing and analysis is a main tendency of IT applications. As of this moment, there is no unified approach to the analysis and visualization of big volumes of text data. Our system LM Monitor (Latent Meaning Monitor) generates so-called reference graphs which can be considered part of the popular technology of content-analysis in sociology. Usually, content-analysis uses distributions of observations over categories, whereas LM Monitor analyzes related pairs of categories.

Research target: Computer Science

Priority areas: IT and mathematics

Language: Russian

Full text

Text on another site

Keywords: автоматическая обработка текста big data unstructured text analysis аналитика больших данных анализ неструктурированных данных automated text processing

Publication based on the results of:

Методы визуализации текстовой информации с помощью построения суффиксных деревьев, мультифасетных классификаций и иерархических онтологий: алгоритмическое и программное обеспечение (2013)

DATA ANALYTICS 2014, The Third International Conference on Data Analytics

[б.и.], 2014.

Full texts of third international conference on data analytics are presented. ...

Added: October 13, 2014

Perspectives of intellectual processing of large volumes of astronomical data using neural networks

Gorbunov A. A., Isaev E., Samodurov V., Journal of Physics: Conference Series 2018 Vol. 945 No. 1 P. 1–4

In the process of astronomical observations collected vast amounts of data. BSA (Big Scanning Antenna) LPI used in the study of impulse phenomena, daily logs 87.5 GB of data (32 TB per year). These data have important implications for both short-and long-term monitoring of various classes of radio sources (including radio transients of different nature), ...

Added: October 15, 2017

Хранение и обработка графа социальных сетей

Polyakov I. V., Chepovskiy A., Chepovskiy A., Вестник Новосибирского государственного университета. Серия: Информационные технологии 2013 Т. 11 № 4 С. 77–83

In this paper special data structure for big social graph storing and operating is presented. We discuss mainly graph paths searching, obtaining subgrapths and addition of new edges and vertices. ...

Added: October 17, 2013

Кластерный анализ кардиологических данных

Зимина Е. Ю., Статистика и Экономика 2018 Т. 15 № 2 С. 30–37

The article includes the observation of the cluster analysis of medical data on the example of the cardiac data. One of the main effective and commonly used Data Mining methods that applied to the large amounts of information (for example, mathematical economics) are clustering methods: the search for signs of similarity between objects in the study of the subject area ...

Added: May 29, 2018

Создание виртуальных кластеров Apache Spark в облачных средах с использованием систем оркестрации

Борисенко О. Д., Пастухов Р. К., С.Д. Кузнецов, Труды Института системного программирования РАН 2016 Т. 28 № 6 С. 111–120

Apache Spark is a framework providing fast computations on Big Data using MapReduce model. With cloud environments Big Data processing becomes more flexible since they allow to create virtual clusters on-demand. One of the most powerful open-source cloud environments is Openstack. The main goal of this project is to provide an ability to create virtual ...

Added: January 25, 2018

Applying MapReduce to Conformance Checking

Shugurov I., Mitsyuk A. A., Proceedings of the Institute for System Programming of the RAS 2016 Vol. 28 No. 3 P. 103–122

Process mining is a relatively new research field, offering methods of business processes analysis and improvement, which are based on studying their execution history (event logs). Conformance checking is one of the main sub-fields of process mining. Conformance checking algorithms are aimed to assess how well a given process model, typically represented by a Petri ...

Added: September 12, 2016

Труды ХVIII международной конференции DAMDID / RSDL’2016, 11-14 октября 2016, Ершово, Московская область, Россия

НИЯУ МИФИ, 2016.

In 2016 the International Conference “Data Analytics and Management in Data Intensive Domains” (DAMDID/RCDL’2016) was held on October 11 – 14 in the Holiday Center, Ershovo (Moscow region). By tradition the “Data Analytics and Management in Data Intensive Domains” conference (DAMDID) is planned as a multidisciplinary forum of researchers and practitioners from various domains of science and research, promoting ...

Added: January 26, 2017

2020 Global Smart Industry Conference (GloSIC)

IEEE, 2020.

Added: December 3, 2020

Array DBMS: Past, Present, and (Near) Future

Rodriges Zalipynis R. A., PROCEEDINGS OF THE VLDB ENDOWMENT 2021 Vol. 14 No. 12 P. 3186–3189

Array DBMSs strive to be the best systems for managing, processing, and even visualizing big N-d arrays. The last decade blossomed with R&D in array DBMS, making it a young and fast-evolving area. We present the first comprehensive tutorial on array DBMS R&D. We start from past impactful results that are still relevant today, then ...

Added: June 4, 2021

Моделирование образовательных процессов и их оптимизация на примере модели работы с электронными образовательными ресурсами

Прокофьев Д. О., Starykh V., Информационные технологии 2015

This study investigates main problems of automation and optimization of educational processes with the help of BPMS and Big Data. The questions concerning process modeling are raised, particularly related to the integration of process-oriented and business analysis systems. The main goal of study is to find possible new way to implement the ideas of metadata ...

Added: October 9, 2015

Artificial Neural Networks in Pattern Recognition 5th INNS IAPR TC 3 GIRPR Workshop, ANNPR 2012, Trento, Italy, September 2012 Proceeding

Berlin, Heidelberg: Springer, 2012.

Added: September 21, 2012

Proceedings 2018 Global Smart Industry Conference (GloSIC)

Chelyabinsk: IEEE, 2018.

The 2018 Global Smart Industry Conference is organized in order to exchange experience, promote discussion and presentation of research papers, and summarize results in development of innovative models, methods and technologies for the digital industry in universities, scientific and industrial associations of the Russian Federation as well as in foreign companies, and the experience of ...

Added: November 25, 2019

Большие данные в биоинформатике

Назипова Н. Н., Isaev E., Kornilov V. et al., Математическая биология и биоинформатика 2017 Т. 12 № 1 С. 102–119

Секвенирование человеческого генома началось в 1994 году. Понадобилось 10 лет работы многих научных коллективов для того, чтобы получить черновую последовательность ДНК человека. Современные технологии секвенирования позволяют получать геном конкретного человека за несколько дней. Обсуждаются успехи современной биоинформатики, связанные с появлением высокопроизводительных платформ секвенирования, которые не только способствовали расширению возможностей различных направлений биологии и других смежных ...

Added: March 3, 2017

2020 IEEE International Conference on Big Data (Big Data 2020)

IEEE, 2020.

The IEEE BigData conference series is sponsored by the IEEE Computer Society and attracts high-quality original research papers on various aspects of big data. This year, we received 535 full paper submissions from 2049 authors and co-authors of 58 countries. After a rigorous peer review process undertaken by the program committee members, 84 regular papers ...

Added: April 16, 2021

Базовые структуры данных системы поддержки принятия решений FCART

Parinov A., Научно-техническая информация. Серия 2: Информационные процессы и системы 2014

В статье рассматриваются сочетания базовых структур данных локального хранилища системы поддержки принятия решений FCART и приводятся временные характеристики при использовании больших объемов данных. ...

Added: November 19, 2013

Service-Oriented Computing

Berlin, Heidelberg: Springer, 2013.

The proceedings of the 11th International Conference on Service-Oriented Computing (ICSOC 2013), held in Berlin, Germany, December 2–5, 2013, contain high-quality research papers that represent the latest results, ideas, and positions in the field of service-oriented computing. Since the first meeting more than ten years ago, ICSOC has grown to become the premier international forum ...

Added: March 21, 2014

Синтез информационной системы управления подсистемами технического обеспечения интеллектуальных зданий

Vikentyeva O., Deryabin A. I., Shestakova L. V. et al., Вестник Московского государственного строительного университета 2017 Т. 12 № 10 С. 1191–1201

Subject: smart house maintenance requires taking into account a number of factors - resource conservation, mitigating working expenditures, safety enhancement, ensuring comfort of leisure and operation. Automation of such engineering systems networks as illumination, climate control, security and communication, may be achieved through utilization of contemporary technologies (e.g. IoT – Internet of Things). However, storing ...

Added: November 21, 2017

SIGMOD/PODS '21: Proceedings of the 2021 International Conference on Management of Data

NY: ACM, 2021.

The annual ACM SIGMOD/PODS Conference is a leading international forum for database researchers, practitioners, developers, and users to explore cutting-edge ideas and results, and to exchange techniques, tools, and experiences. The conference includes a fascinating technical program with research and industrial talks, tutorials, demos, and focused workshops. It also hosts a poster session to learn about innovative ...

Added: April 28, 2021

Большие данные и их приложения в электроэнергетике: от бизнес аналитики до виртуальных электростанций

Krylov V., Крылов С. В., М.: Нобель Пресс, 2014.

Предназначена для студентов и специалистов в области разработки информационных систем в том числе для электроэнергетики и руководителей ИТ подразделений предприятий, всем, кто работает над планированием направлений развития электроэнергетики и просто интересуется прогресcом в этой области В книге рассматривается направление в области обработки данных, получившее название Большие Данные (Big Data), рассказывается о техниках и технологиях. Главный фокус ...

Added: October 10, 2015

PROSPECTS OF TRANSFERRIG THE LARGE VOLUMES OF RADIO ASTRONOMY DATA

Isaev E., Tarasov P. A., Odessa Astronomical Publications 2014 Vol. 27 No. 2 P. 72–73

Added: November 24, 2014

Архитектура сетевого управляющего комплекса здания на базе IoT устройств

Vikentyeva O., Kychkin A., Deryabin A. I. et al., Датчики и системы 2018 № 5 С. 32–38

This work considers the problem of designing the architecture of a network management system for a generic module of a modern automated building. To improve the efficiency of building operation given the large influx of data, the architecture of the network management system implements multicontour management of a generic modules using cloud scenarios. Building operation ...

Added: July 19, 2018

Методика автоматизации проверки полноты технической отчетной документации

Klyshinskiy E., Kalachyov Y. B., Zhadnov V. V., Научно-техническая информация. Серия 2: Информационные процессы и системы 2014 № 5 С. 11–15

Рассматривается новый метод автоматизации определения соответствия технического задания и итогового отчета в ходе его приемки. Предложенный метод позволяет экспертам получить предварительную оценку степени соответствия отчета техническому заданию. Используются выделение значимых фрагментов технического задания,поиск соответствующих им элементов отчета и проверка степени его покрытия. Разработанный метод,в отличие, например,от косинусной меры сходства, дает лучшее разделение отчетов по критерию ...

Added: June 30, 2014

Онтологические модели ситуаций в задачах компьютерного контроля знаний иностранного языка

Demkin V. M., Sosnin A., Сусманова С. С., Онтология проектирования 2014 № 3(13) С. 63–76

Discussed in the paper are modern approaches to the design of complicated intellectual computer systems assessing foreign language proficiency, e.g. checking students’ academic progress in a higher educational establishment. The paper provides insight into the means to develop ontology-based situation models in the tasks requiring that a person’s command of English be assessed, which is ...

Added: October 24, 2012

ПРИМЕНЕНИЕ ГЛУБОКИХ НЕЙРОННЫХ СЕТЕЙ ДЛЯ КЛАССИФИКАЦИИ БОЛЬШИХ ОБЪЕМОВ АСТРОНОМИЧЕСКИХ ДАННЫХ

Gorbunov A. A., Isaev E., Samodurov V., Radio Physics and Radio Astronomy 2017 Т. 22 № 4 С. 270–275

In the process of astronomical observations are collected vast amounts of data. BSA (Big Scanning Antenna) LPI used in the study of impulse phenomena, daily logs 87.5 GB of data (32 TB per year). Experts classified 83096 individual observations (on the segment of the study July 2012 - October 2013). Over 75% of the sample ...

Added: October 15, 2017