?
MuMMy: Multimodal Dataset supporting VLM-based Egyptology Research Assistant
P. 12875–12881.
Golyadkin M., Innokentiy Humonen, Rubanova V., Kalin D., Plevokas I., Nikolotov D., Utkov A., Sidelnikov N., Ivanov P., Bureeva E., Ekaterina Alexandrova, Makarov I.
We present the first multimodal dataset MuMMy, for developing research assistants that can interpret Egyptian hieroglyphic texts. It pairs images with Gardiner codes, transliteration, and English translation at two levels of granularity. We also evaluate several deep learning pipelines across OCR, transliteration, and translation tasks, revealing the complexity of the domain and the challenges posed by error accumulation.
In book
Association for Computing Machinery (ACM), 2025.
Sviridov I., Miftakhova A., Tereshchenko A. et al., , in: Proceedings of the 2025 Conference on Empirical Methods in Natural Language Processing (EMNLP).: Association for Computational Linguistics, 2025. Ch. 1353 P. 26625–26665.
Though Large Vision-Language Models (LVLMs) are being actively explored in medicine, their ability to conduct complex real-world telemedicine consultations combining accurate diagnosis with professional dialogue remains underexplored. This paper presents 3MDBench (Medical Multimodal Multi-agent Dialogue Benchmark), an open-source framework for simulating and evaluating LVLM-driven telemedical consultations. 3MDBench simulates patient variability through temperament-based Patient Agent and evaluates diagnostic accuracy and dialogue quality ...
Added: November 16, 2025
Golyadkin M., Humonen I., Plevokas Y. et al., , in: SIGGRAPH Posters '25: Proceedings of the Special Interest Group on Computer Graphics and Interactive Techniques Conference Posters. Vancouver, British Columbia, Canada. August 10 - 14, 2025.: Association for Computing Machinery (ACM), 2025. Ch. 39.
We introduce a pipeline for interpreting Ancient Egyptian hieroglyphic texts combining OCR, transliteration, and translation. Designed for the low-resource data, our system improves accessibility for learners and efficiency for researchers. We evaluate its performance on a new diverse dataset reflective of real-world conditions. ...
Added: November 8, 2025
Baimuratov I., Karpovich A., Lisanyuk E. et al., , in: JCDL '24: Proceedings of the 24th ACM/IEEE Joint Conference on Digital Libraries.: NY: Association for Computing Machinery (ACM), 2024. Ch. 6.
Peer review is a cornerstone of the academic editorial decisionmaking process, yet it faces significant challenges. Artificial intelligence can help address these challenges, but its use raises concerns about reliability and the potential for reproducing existing biases. In this research, we employ a formal argumentation-theoretic framework that allows for explicit analysis of arguments and their ...
Added: May 29, 2025
Sergeev A., Minchenkov V., Soldatov A. et al., ADVANCES IN ARTIFICIAL INTELLIGENCE AND MACHINE LEARNING 2025 No. 1 P. 3344–3355
The automatic monitoring of manual assembly processes in production settings increasingly relies on advanced technologies, including computer vision models. These models are designed to detect and classify events such as the presence of components in an assembly area and the connection of these components. However, a significant challenge for detection and classification algorithms is their vulnerability ...
Added: April 2, 2025
Derkach D., Artemev M., IEEE, 2025.
Added: April 1, 2025
Dmitry Ryumin, Alexandr Axyonov, Elena Ryumina et al., Expert Systems with Applications 2024 Vol. 252 Article 124159
This article presents a research methodology for audio–visual speech recognition (AVSR) in driver assistive systems. These systems necessitate ongoing interaction with drivers while driving through voice control for safety reasons. The article introduces a novel audio–visual speech command recognition transformer (AVCRFormer) specifically designed for robust AVSR. We propose (i) a multimodal fusion strategy based on ...
Added: March 6, 2025
Copenhagen, Denmark: CEUR Workshop Proceedings, 2021.
The second workshop on Crowd Science is organized in conjunction with the 47th International Conference on Very Large Data Bases (VLDB 2021). This workshop is the second in a series of events that has the goal of helping crowdsourcing “transition” from art to science, and tackles the research challenges that we face to make crowdsourcing ...
Added: December 13, 2021
Плетенев С. А., В кн.: Компьютерная лингвистика и интеллектуальные технологии: По материалам ежегодной международной конференции «Диалог» (Москва, 16–19 июня 2021 г.)Issue 20.: Russian State University for the Humanitie, 2021.
Added: December 13, 2021
Andrey Malinin, Gales M., , in: Proceedings of the 9th International Conference on Learning Representations (ICLR 2021). ICLR, 2021.: ICLR, 2021. P. 1–31.
Added: November 1, 2021
Ryabinin M., Malinin A., Gales M., , in: Advances in Neural Information Processing Systems 34 (NeurIPS 2021).: Curran Associates, Inc., 2021. P. 6023–6035.
Added: October 31, 2021
Kikot S., Kurucz A., Podolskii V. V. et al., , in: PODS'21: Proceedings of the 40th ACM SIGMOD-SIGACT-SIGAI Symposium on Principles of Database Systems.: NY: Association for Computing Machinery (ACM), 2021. P. 370–387.
Added: September 8, 2021
Zykov S. V., Alexandrov D., Maqsudjon Ismoilov et al., , in: Intelligent Decision Technologies: Proceedings of the 13th KES-IDT 2021 ConferenceТ. 238.: Сингапур: Springer, 2021. P. 333–342.
The idea of code quality assessment is well known for a long time; class connectivity metrics were proposed by community several years ago and have not become generally applicable practice in industrial programming. The objective of the study, part of which we present in this paper, is to critically analyze the metrics available for today: ...
Added: August 5, 2021
Association for Computational Linguistics, 2019.
This document describes the findings of the Third Workshop on Neural Generation and Translation, held in concert with the annual conference of the Empirical Methods in Natural Language Processing (EMNLP 2019). ...
Added: January 7, 2021
CEUR Workshop Proceedings, 2020.
The International Conference “Internet and Modern Society” (IMS-2020) was initially planned to take place in St. Petersburg, Russia. Due to the spread of COVID-19 and the ban on public events, the conference was held during 17-20 June 2020 in the format of online sessions with a discussion of papers and presentations uploaded in advance. The ...
Added: November 1, 2020
Zhukov L. E., Sukharev J., Popescul A., , in: Proceedings of 14th International Conference on Data Mining (ICDM 2014).: NY: IEEE Computer Society, 2014. P. 995–1000.
Record linkage, or entity resolution, is an important area of data mining. Name matching is a key component of systems for record linkage. Alternative spellings of the same name are a common occurrence in many applications. We use the largest collection of genealogy person records in the world together with user search query logs to ...
Added: March 18, 2015