Cross-lingual Named Entity List Search via Transliteration

Khakhmovich A.; Pavlova S.; Kirillova K.; N. Arefyev; Savilova E.

?

Cross-lingual Named Entity List Search via Transliteration

P. 4247–4255.

Khakhmovich A., Pavlova S., Kirillova K., Arefyev N., Savilova E.

Out-of-vocabulary words are still a challenge in cross-lingual Natural Language Processing tasks, for which transliteration from source to target language or script is one of the solutions. In this study, we collect a personal name dataset in 445 Wikidata languages (37 scripts), train Transformer-based multilingual transliteration models on 6 high- and 4 less-resourced languages, compare them with bilingual models from (Merhav and Ash, 2018) and determine that multilingual models perform better for less-resourced languages. We discover that intrinsic evaluation, i.e comparison to a single gold standard, might not be appropriate in the task of transliteration due to its high variability. For this reason, we propose using extrinsic evaluation of transliteration via the cross-lingual named entity list search task (e.g. personal name search in contacts list). Our code and datasets are publicly available online.

Language: English

Text on another site

Keywords: transliteration artificial neural network sequence transduction

Publication based on the results of:

Development of Mathematical Models and Methods for Recommender Systems and Natural Language Processing (2020)

In book

Proceedings of The 12th Language Resources and Evaluation Conference

Vol. 12. , European Language Resources Association (ELRA), 2020.

Elements of Artificial Intellect based on Neuron Analysis through Clinical Trials Statistics and Various Models of Machine Learning

Cohen Y., Dekel B., Krouk E., , in: Biophysical Methods for Diagnosing Human Tissue Anomalies.: Cambridge: Cambridge Scholars Publishing, 2024. Ch. 10 P. 290–310.

In Chapter 10, we go through the components of artificial intelligence, emphasizing neural analysis derived from clinical trial statistics and various machine learning models. The chapter briefly outlines the neural configuration of artificial intelligence. Building on this foundation, several machine learning techniques for the statistical pre-processing of experimental data are explored. Data is analyzed using ...

Added: December 12, 2024

Reconstruction of neuromorphic dynamics from a single scalar time series using variational autoencoder and neural network map

Pavel V. Kuptsov, Nataliya V. Stankevich, Chaos, Solitons and Fractals 2025 Vol. 191 Article 115818

This paper examines the reconstruction of a family of dynamical systems with neuromorphic behavior using a single scalar time series. A model of a physiological neuron based on the Hodgkin–Huxley formalism is considered. Single time series of one of its variables is shown to be enough to train a neural network that can operate as ...

Added: November 30, 2024

Сопоставление подходов к арабско-русской транскрипции и транслитерации

Orlov I., Ориенталистика 2023 Т. 6 № 5 С. 1053–1066

To date, in the Russian-speaking space there has been no unified approach to Arabic-Russian transcription and transliteration (the term “transmission” is used hereinafter). Different groups of authors and researchers use different solutions for certain transmission tasks. Some authors, due to the lack of clear recognized guidelines, do not adhere to specific rules even within the ...

Added: March 12, 2024

Об управлении химическим составом сырьевого материала и режимом плавки для обеспечения требуемых механических свойств стальных изделий серийного производства

Yasnitsky L., Мезенцев А. С., Прикладная математика и вопросы управления 2023 № 3 С. 109–126

A The goal of the work is to create a mathematical model suitable for operational control of the strength characteristics of the resulting steel product in the conditions of serial steelmaking. Existing approaches based on the results of testing prototypes obtained in laboratory conditions are not suitable for this purpose, since in the conditions of serial steelmaking, the ...

Added: February 7, 2024

Computer Vision

Meshcheryakov R., Kataev M., Pantiukhin D., , in: Integral Robot Technologies and Speech Behavior.: Newcastle upon Tyne: Cambridge Scholars Publishing, 2024. Ch. 4 P. 130–154.

For a robot that functions and performs its mission, it is important to receive information from various sources that are increasingly being called sensors. When drawing an analogy between a robot and a living being, the same comparison can be made for obtaining information about the world from various sources. Often, a computer vision system ...

Added: December 10, 2023

Integral Robot Technologies and Speech Behavior

Kharlamov A. A., Pantiukhin D., Borisov V. et al., Newcastle upon Tyne: Cambridge Scholars Publishing, 2024.

The monograph presents papers on the subject domain “Integral robot. Speech behavior”. These cover issues of a theoretical nature, including representation and processing of speech information in the human mind in the process of both text analysis and text generation, and specifically the need to use jointly working linguistic and extralinguistic models of the world, ...

Added: December 1, 2023

GDP responses to supply chain disruptions in a post-pandemic era: Combination of DL and ANN outputs based on Google Trends

Shahzad U., Mohammed K. S., Schneider N. et al., Technological Forecasting and Social Change 2023 Vol. 192 P. 1–15

With the recent Russian-Ukraine conflict, the frequency and intensity of disruptive shocks on major supply chains have risen, causing increasing food and energy security concerns for regulators. That is, the combination of newly available sophisticated deep learning tools with real-time series data may represent a fruitful policy direction because machines can identify patterns without being ...

Added: November 28, 2023

Comment on “Pushing the frontiers of density functionals by solving the fractional electron problem”

Gerasimov I., Losev T., Evgeny Yu. Epifanov et al., Science 2022 Vol. 377 No. 6606 Article eabq3385

Kirkpatrick et al. (Reports, 9 December 2021, p. 1385) trained a neural network–based DFT functional, DM21, on fractional-charge (FC) and fractional-spin (FS) systems, and they claim that it has outstanding accuracy for chemical systems exhibiting strong correlation. Here, we show that the ability of DM21 to generalize the behavior of such systems does not follow ...

Added: September 25, 2022

APPROACH TO INTELLIGENT MONITORING OF CYBER ATTACKS

Nazarov A., Pantiukhin D., Voronkov I. M. et al., Synchroinfo Journal 2020 No. 6 P. 2–9

The results of many years of research on the subject of intellectual counteraction to cyberattacks are presented. Cloud solutions for the synthesis of the monitoring cluster of cyberattacks are based on the latest achievements with the use of neuronfuzzy formalism. The main features of the synthesis of protection functions are determined and the features of ...

Added: November 2, 2021

Intelligent Quality Management System for Casting Gas Turbine Engine Blades

Goldobin M. A., Morozov A. A., Okonechnikov D. V. et al., , in: Proceedings - 2020 2nd International Conference on Control Systems, Mathematical Modeling, Automation and Energy Efficiency, SUMMA 2020.: IEEE, 2020. P. 696–700.

Added: September 29, 2021

Intel OpenVINO Toolkit for Computer Vision: Object Detection and Semantic Segmentation

Zunin V., , in: 2021 International Russian Automation Conference (RusAutoCon).: IEEE, 2021. P. 847–851.

Added: September 20, 2021

Real-time Object Detection with FPGA Using CenterNet

Solovyev R. A., Telpukhov D. V., Romanova I. I. et al., , in: Proceedings of the 2021 IEEE Conference of Russian Young Researchers in Electrical and Electronic Engineering, ElConRus 2021.: IEEE, 2021. P. 2029–2034.

The paper proposes methodology for transferring architecture of modern neural network CenterNet to FPGA. CenterNet is a OneStage object detector that is used to detect and locate objects in images. Although this neural network has simple decoder, it shows good performance in terms of accuracy. Very high operation speed of the neural network hardware is ...

Added: August 8, 2021

Development of Telecommunication System Units in Parallel Neural Network Basis

Sakhnyuk P., , in: 1st International Conference on Innovative Informational and Engineering Technologies (IIET-2020) 28-29 May 2020, Stavropol, Russian FederationVol. 873.: Bristol: IOP Publishing, 2020. P. 012019-1–012019-8.

The article suggests the integration of a neural network as a parallel element base in a telecommunication system. In this case, the ability to learn or adapt to external conditions is applied as the main advantage. For telecommunication systems in conditions when it is possible, this ability will improve noise immunity, reliability, operability, etc. The ...

Added: January 26, 2021

Artificial neural network in predicting cancer based on infrared spectroscopy

Cohen Y., Zilberman A., Zion Dekel B. et al., , in: Intelligent Decision Technologies. Proceedings of the 12th KES International Conference on Intelligent Decision Technologies (KES-IDT 2020)Vol. 193.: Singapore: Springer, 2020. P. 141–153.

In this work, we present a Real-Time (RT), on-site, machine-learning-based methodology for identifying human cancers. The presented approach is reliable, effective, cost-effective, and non-invasive method, which is based on Fourier Transform Infrared (FTIR) spectroscopy—a vibrational method with the ability to detect changes as a result of molecular vibration bonds using Infrared (IR) radiation in human ...

Added: October 29, 2020

Morphological segmentation with sequence to sequence neural network

Arefyev, N.V., Gratsianova T. Y., Popov K., , in: Computational Linguistics and Intellectual Technologies. International Conference "Dialogue 2018" Proceedings.: M.: Conference Proceedings Editorial board, 2018. P. 85–95.

Morphological segmentation is an important task of natural language processing as it can significantly improve the processing of unfamiliar and rare words in different tasks that involve text data. In this paper we present datasets in English and Russian for learning and evaluating morphological segmentation algorithms, demonstrate the method based on the sequence to sequence ...

Added: October 9, 2020

О приоритете советской науки в области нейроинформатики

Yasnitsky L., Нейрокомпьютеры: разработка, применение 2019 Т. 21 № 1 С. 6–8

Based on the analysis of literary sources, it is shown that the author of the first algorithm for training multilayer neural networks, which brought artificial intelligence out of the crisis of the 1950s-1970s, was Professor Alexander Galushkin. ...

Added: November 15, 2019

Application of Natural Language Processing Algorithms to the Task of Automatic Classification of Russian Scientific Texts

Romanov A., Lomotin Konstantin, Kozlova Ekaterina, Data Science Journal 2019 Vol. 18 No. 1 P. 1–17

This work is devoted to the study of applicability of modern methods of machine learning to the task of automatic classification of scientific articles and abstracts. For this purpose, the study of such models of machine learning as artificial neural networks, random forest, logistic regression, and support vector machine (with taking into account such a ...

Added: August 25, 2019

Artificial Neural Networks Implementing Maximum Likelihood Estimator for Passive Radars

Shevgunov T., Efimov E., , in: Artificial Intelligence and Algorithms in Intelligent Systems* 2. Vol. 764: Proceedings of 7th Computer Science On-line Conference 2018.: Springer, 2019. P. 144–153.

This paper introduces the maximum likelihood estimator (MLE) based on artificial neural network (ANN) for a fast computation of the bearing that indicates the direction to the source of the electromagnetic wave received by a passive radar system equipped with an array antenna. Authors propose the cascade scheme for ANN training phase where the network ...

Added: February 5, 2019

Проектирование нейросетевых моделей для обнаружения вторжений с использованием общедоступных баз данных

Абашев М. А., Никитина Е. Ю., Суворова В. et al., Нейрокомпьютеры: разработка, применение 2018 № 8 С. 30–37

The article presents a method of constructing a neural network model for detecting and classifying network intrusions using information about attacks contained in the databases of KDD Cup 1999 Data and UNSW-NB-15. Various variants of neural networks with a full and reduced set of input parameters are constructed, a comparative analysis with similar models is ...

Added: January 10, 2019