Audio-Visual Speech Recognition In-The-Wild: Multi-Angle Vehicle Cabin Corpus and Attention-Based Method

?

Audio-Visual Speech Recognition In-The-Wild: Multi-Angle Vehicle Cabin Corpus and Attention-Based Method

P. 8195–8199.

Axyonov Alexandr, Ryumin Dmitry, Ivanko D., Kashevnik A., Karpov A.

В книге

IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 2024)

IEEE, 2024.

Spatiotemporal dynamics in a network of modified Morris–Lecar neurons with nonlinear magnetic flux diffusion

Сералан В., S L. L., Kunchala S. B. и др., European Physical Journal: Special Topics 2025 Vol. 234 P. 1073–1091

Добавлено: 15 октября 2025 г.

Rhythm-based hierarchical predictive computations support acoustic−semantic transformation in speech processing

Догонашева О. А., Doelling K., Захаров Д. Г. и др., Nature Computational Science 2025 Vol. 5 P. 915–926

Раскрытие того, как человек способен понимать речь, несмотря на искажения, уже давно привлекает внимание исследователей. Одной из ведущих гипотез является предположение о том, что множество эндогенных мозговых ритмов формируют вычислительный контекст для предсказания структуры и содержания речи. Однако до сих пор неясно, каким образом нейронные процессы могут реализовывать формирование такого ритм-основанного контекста. В данной работе мы ...

Добавлено: 2 сентября 2025 г.

Causes in neuron diagrams, and testing causal reasoning in Large Language Models. A glimpse of the future of philosophy?

Вервурт Л. П., Journal for General Philosophy of Science 2025

Добавлено: 26 августа 2025 г.

ISCA International Conference INTERSPEECH

International Society for Computers and Their Applications (ISCA), 2024.

Добавлено: 6 марта 2025 г.

IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 2024)

IEEE, 2024.

Добавлено: 6 марта 2025 г.

OCEAN-AI framework with EmoFormer cross-hemiface attention approach for personality traits assessment

Elena Ryumina, Markitantov M., Dmitry Ryumin и др., Expert Systems with Applications 2024 Vol. 239 P. 0

Добавлено: 6 марта 2025 г.

Audio-visual speech recognition based on regulated transformer and spatio–temporal fusion strategy for driver assistive systems

Dmitry Ryumin, Alexandr Axyonov, Elena Ryumina и др., Expert Systems with Applications 2024 Vol. 252 Article 124159

Добавлено: 6 марта 2025 г.

High-speed optical-waveguide integrated single-walled carbon nanotube bolometer

An P. P., V. V. Kovalyuk, Y. G. Gladush и др., Applied Physics Letters 2024 Vol. 125 No. 20 Article 201101

Добавлено: 11 ноября 2024 г.

2024 IEEE 18th International Conference on Application of Information and Communication Technologies (AICT 2024), 25 - 27 Serptember 2024, Turin, Italy

Turin: Institute of Electrical and Electronics Engineers, 2024.

Добавлено: 7 ноября 2024 г.

Analyzing the Robustness of Vision & Language Models

Ширнин А. А., Andreev N., Potapova S. и др., IEEE/ACM Transactions on Speech and Language Processing 2024 Vol. 32 P. 2751–2763

We present an approach to evaluate the robustness of pre-trained vision and language (V&L) models to noise in input data. Given a source image/text, we perturb it using standard computer vision (CV) / natural language processing (NLP) techniques and feed it to a V&L model. To track performance changes, we explore the problem of visual ...

Добавлено: 19 июля 2024 г.

2023 Seminar on Signal Processing

IEEE, 2023.

Добавлено: 10 февраля 2024 г.

10th International Conference, PReMI 2023, Kolkata, India, December 12–15, 2023, Proceedings. Pattern Recognition and Machine Intelligence. LNCS, volume 14301

Cham: Springer, 2023.

Добавлено: 29 ноября 2023 г.

ICASSP 2023 - 2023 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 4-10 June 2023

IEEE, 2023.

Добавлено: 5 ноября 2023 г.

InterSpeech 2023. Dublin, Ireland, 20-24 August 2023

International Speech Communication Association, 2023.

Добавлено: 5 ноября 2023 г.

2023 IEEE 17th International Conference on Application of Information and Communication Technologies (AICT)

Baku: IEEE, 2023.

Добавлено: 4 ноября 2023 г.

InterSpeech 2022

International Speech Communication Association, 2022.

Добавлено: 31 октября 2022 г.

2022 IEEE 16th International Conference on Application of Information and Communication Technologies (AICT)

Washington: IEEE, 2022.

Добавлено: 29 октября 2022 г.

Self-supervised recurrent depth estimation with attention mechanisms

Макаров И. А., Bakhanova M., Nikolenko S. и др., PeerJ Computer Science 2022 Vol. 8 Article e865

Depth estimation has been an essential task for many computer vision applications, especially in autonomous driving, where safety is paramount. Depth can be estimated not only with traditional supervised learning but also via a self-supervised approach that relies on camera motion and does not require ground truth depth maps. Recently, major improvements have been introduced ...

Добавлено: 1 февраля 2022 г.

Embedded ArUco: a novel approach for high precision UAV landing

Khazetdinov A., Zakiev A., Tsoy T. и др., , in: 2022 International Siberian Conference on Control and Communications (SIBCON).: IEEE, 2022. Ch. 9438855.

Добавлено: 11 октября 2021 г.