Faster variational inducing input Gaussian process classification

Izmailov P.; D. Kropotov

doi:10.21469/22233792.3.1.02

Publications

?

Faster variational inducing input Gaussian process classification

Journal of machine learning and data analysis. 2017. Vol. 3. No. 1. P. 20–35.

Izmailov P., Kropotov D.

Background: Gaussian processes (GP) provide an elegant and effective approach to learning in kernel machines. This approach leads to a highly interpretable model and allows using the Bayesian framework for model adaptation and incorporating the prior knowledge about the problem. The GP framework is successfully applied to regression, classification, and dimensionality reduction problems. Unfortunately, the standard methods for both GP-regression and GP-classification scale as O(n 3 ), where n is the size of the dataset, which makes them inapplicable to big data problems. A variety of methods have been proposed to overcome this limitation both for regression and classification problems. The most successful recent methods are based on the concept of inducing inputs. These methods reduce the computational complexity to O(nm2 ) where m is the number of inducing inputs with m typically much less than n. The present authors focus on classification. The current state-of-the-art method for this problem is based on stochastic optimization of an evidence lower bound (ELBO) that depends on O(m2 ) parameters. For complex problems, the required number of inducing points m is fairly big, making the optimization in this method challenging. Methods: The structure of variational lower bound that appears in inducing input GP classification has been analyzed. First, it has been noted that using quadratic approximation of several terms in this bound, it is possible to obtain analytical expressions for optimal values of most of the optimization parameters, thus sufficiently reducing the dimension of optimization space. Then, two methods have been provided for constructing necessary quadratic approximations: one is based on Jaakkola–Jordan bound for logistic function and the other is derived using Taylor expansion. Results: Two new variational lower bounds have been proposed for inducing input GP classification that depend on a number of parameters. Then, several methods have been suggested for optimization of these bounds and the resulting algorithms have been compared with the state-of-the-art approach based on stochastic optimization. Experiments on a bunch of classification datasets show that the new methods perform the same or better results than the existing one. However, new methods do not require any tunable parameters and can work in settings within a big range of n and m values, thus significantly simplifying training of GP classification models.

Research target: Computer Science

Priority areas: IT and mathematics

Keywords: машинное обучение оптимизация machine learning Variational inference optimization algorithm вариационный вывод Gaussian processes гауссовский процесс

Total conditional complexity of certain objects

Vereshchagin N., Information and Computation 2026 Vol. 308 P. 1–12

The fine approach to measure information dependence is based on the total conditional complexity CT( y |x), which is defined as the minimal length of a total program that outputs y on the input x. It is known that the total conditional complexity can be much larger than the plain conditional complexity. Such strings x, y are defined ...

Added: February 14, 2026

Как прогнозировать дефолты банков: эволюция методов, моделей и факторов риска

Shchepeleva M., Столбов М. И., Экономика и математические методы 2026 Т. 62 № 1 С. 63–77

Predicting bank defaults is an important task for the entire economy. Early identification of troubled banks helps to prevent impending bank failures or minimize the losses associated with them. The paper discusses the state of the art of instrumental methods and data used for this purpose. The theoretical background, the evolution of methodological approaches used ...

Added: February 13, 2026

Diffusion models for synthetic tabular data generation

Hushchyn M., Telesheva E., Doklady Mathematics 2025 No. 527 P. 388–399

he problem of generating high-quality synthetic data is crucial for many data science tasks. A generated dataset can cut the costs on the augmentation of the existing data with additional instances, for example, in physics, or help with its privacy protection, for instance, in banking. However, generating a tabular dataset is challenging, as the data ...

Added: February 12, 2026

Real-Bogus Classification for ZTF Data Releases: Two Approaches

Semenikhin n., Kornilov M., Lavrukhina A. et al., Communications in Computer and Information Science 2026 Vol. 2641 P. 211–219

We considered two fundamentally different approaches to real-bogus classification within the Zwicky Transient Facility survey data. The first approach is based on neural networks that take sequences of object images as input. The second approach uses features extracted from light curves and classical machine learning methods. Several models for both approaches were tested. Quality metrics ...

Added: February 12, 2026

Проблемы достоверности пользовательских оценок и отзывов на маркетплейсах: системный подход

Полежаева Я. В., Popov V., Бизнес-информатика 2025 Т. 19 № 24 С. 26–41

User ratings and reviews on marketplaces are subject to systematic distortions, creating serious risks for e-commerce participants and reducing the efficiency of market mechanisms. This study presents a comprehensive analysis of information distortion problems, covering the process from rating formation to its systematic accounting. The aim of the work is to systematize factors of information distortion on marketplaces and ...

Added: February 11, 2026

Development of a Language Model for Automated Classification of English-Language Scientific Articles by SRSTI Codes

Zunin V., Afonin A. I., Anoshin V. I. et al., Automatic Documentation and Mathematical Linguistics 2025 Vol. 5 No. 59 P. 287–293

The development of an artificial intelligence-based language model for classifying English-language scientific articles by SRSTI codes is described. This improves the processes of reviewing and indexing scientific publications. A pre-processed dataset of scientific articles was used for training and testing the models. An architecture for cascade classification was developed, and the performance of models with ...

Added: February 11, 2026

Generation of Synthesizable Verilog Code From Natural Language Specifications

Yashchenko D. S., Romanov A., Ziazetdinov A.A. et al., IEEE Access 2026 Vol. 14 P. 4990–5001

This study presents a method for generating synthesizable Verilog code for digital integrated circuits directly from natural-language specifications. The approach combines large language models with parameter-efficient fine-tuning (specifically, Low-Rank Adaptation and Quantized Low-Rank Adaptation) together with a specialized corpus of specification-code pairs that covers common design patterns and varying task complexity. The pipeline includes automated ...

Added: February 11, 2026

Application of MIMO technology in wideband millimeter range wireless communications systems

Tiraspolsky S.A., Ermolayev V. T., Flaksman A. G. et al., Radioelectronics and Communications Systems 2011 Vol. 54 P. 219–226

A concept of using MIMO technology in millimeter range wireless communications systems with orthogonal frequency division multiplexing is considered. The concept is based on dividing transmitting and receiving multi-element antenna arrays into separate sub-arrays with analogue radiation pattern shaping and on using two most powerful space sub-channels for information transmission. Sequence and structure of transmitted ...

Added: February 10, 2026

mmWave SVD-based beamformed MIMO communication systems

Sergey Tiraspolsky, Jeon B., Kim J. et al., Proceedings of the 7th IEEE conference on Consumer communications and networking (CCNC’2010) 2010 P. 834–838

This paper provides concept of data transmission protocol for millimeter wave (mmWave) wireless systems operating in Non-Line-of-Sight environment. This concept is designed to provide an effective and practical functioning of Multiple-Input Multiple-Output (MIMO) transmission mode that exploits combination of Singular Value Decomposition (SVD) of channel matrix and non-adaptive beamforming. The proposed protocol reduces complexity of ...

Added: February 10, 2026

Selective interference cancellation using Kalman filtering

Tiraspolsky S., Rubtsov A., Pudeyev A. et al., Proceedings of the 2006 3rd International Symposium on Wireless Communication Systems, IEEE 2006 P. 21–24

In present paper we have investigated a co-channel interference cancellation technique based on the tracking a limited number of strongest interferers only. With the assumption of synchronous base stations operation with overlapping but different training signals (pilots). Kalman filtering may be used for interfering channels estimation and further calculation of interference correlation matrix. This correlation ...

Added: February 10, 2026

Mobile WiMAX - Deployment Scenarios Performance Analysis

Tiraspolsky S., Malstev A., Rubtosv A. et al., Proceedings of the 2006 3rd International Symposium on Wireless Communication Systems, IEEE 2006 P. 353–357

In this paper, dynamic system level simulation methodology of mobile WiMAX (IEEE Std 802.16e) is described. The system level simulations scenarios (channel models, pathloss and shadow fading, sectorization, frequency reuse planning, system loading, etc) will be introduced. Evaluated performance of mobile WiMAX system such as signal-to-interference + noise ratio distributions, spectral efficiency and system outage ...

Added: February 10, 2026

Эффективность применения грассмановской диаграммообразующей схемы в MIMO системах связи

Тираспольский С.А., Червяков А. В., Труды Научной конференции по радиофизике, ННГУ, 2004 2004 С. 169–171

Диаграмообразование (ДО) в MIMO системах (multiple-input multiple-output systems), одновременно использующих несколько приемопередатчиков на обоих концах линии связи, является достаточно простым способом для повышения пропускной способности и увеличения ОСШ на приемном конце. Для этого в большинстве ранее предлагавшихся методов было необходимо знание на передатчике канальной матрицы или части ее SVD разложения, что требует значительной нагрузки на ...

Added: February 10, 2026

High-resolution capability of adaptive antenna arrays for communication systems

S.A. Tiraspolsky, Gerebryakov G. V., Журнал радиоэлектроники 2002 No. 7

In this paper we investigate comparison methods of different geometric configurations of adaptive antenna arrays for communications on purpose to estimate directions-of-arrival (DOA) of several external signals. The investigated antenna configurations have four elements and eleven wavelengths array size. The best high-resolution algorithm and the best array configuration are defined by numerical simulations. ...

Added: February 10, 2026

Применение адаптивных антенных решеток для увеличения скорости передачи информации

С.А.Тираспольский, Ермолаев В. Т., Флаксман А. Г. et al., Труды Научной конференции по радиофизике, ННГУ, 2002 2002 С. 22–28

В данной работе рассматривается принцип передачи информации и теоретически исследуется пропускная способность MIMO системы в условиях случайного канала распространения радиоволн, обсуждаются различные алгоритмы распределения мощности передатчика по параллельным ортогональным пространственным подканалам. ...

Added: February 10, 2026

Multiple adaptive recursive array for multipath environment

S. Tiraspolsky, Sellone F., Serebryakov G., Proceedings of the International Conference on Electromagnetics in Advanced Applications (ICEAA 01) 2001 P. 691–696

In a wireless communication system, signals sent into the channel interact with the environment in a very complex way. Thereby transmitted signals may be subject to many forms of degradation among which there are causes of multipath propagation: • Reflections due to obstacles with the size greater than a wavelength; • Refractions due to the ...

Added: February 10, 2026

Эффективность линейной обработки сигналов в системах связи в условиях многолучевого ионосферного канала декаметрового диапазона

Тираспольский С.А., Флаксман А. Г., Ермолаев В. Т. et al., Известия высших учебных заведений. Радиоэлектроника 2016 № 1 С. 8–14

Рассмотрены системы связи декаметрового диапазона, работающие в условиях многолучевого ионосферного пространственного канала. С помощью имитационного моделирования на физическом уровне исследованы основные характеристики системы (вероятность битовой и блоковой ошибки, про пускная способность). Показано, что в условиях частотно-селективного канала в полосе 3 кГц линейный алгоритм эквализации обеспечивает высокую эффективность подавления межсимвольной помехи для всех скоростей передачи данных, кроме самой высокой. ...

Added: February 10, 2026

UVIP: Model-Free Approach to Evaluate Reinforcement Learning Algorithms

Belomestny D., Levin I., Naumov A. et al., Journal of Optimization Theory and Applications 2026 Vol. 208 Article 89

Policy evaluation is an important instrument for the comparison of different algorithms in Reinforcement Learning (RL). However, even a precise knowledge of the value function Vπ corresponding to a policy π does not provide reliable information on how far the policy π is from the optimal one. We present a novel model-free upper value iteration ...

Added: February 10, 2026

Основы компьютерной графики

Korolev D., СПб.: Лань, 2026.

Учебное пособие состоит из четырех разделов, где рассматриваются физические основы, аналого-цифровое преобразование графики, сжатие графики и видео, устройства ввода и вывода графической информации; книга повторяет структуру и содержание теоретической части курса. Основной подход —- систематизация школьных знаний и формирование целостной картины работы с графикой и видео «изнутри». На различных примерах показываются элегантные инженерные решения в ...

Added: February 7, 2026

Multimodal graph, surface, and language-based model for protein protein interaction prediction

Arteaga Moreano B. D., Poptsova M., Scientific Reports 2026 No. 16 Article 4772

Accurate prediction of protein-protein interactions (PPIs) is fundamental to understanding biological processes and disease mechanisms. While deep learning offers a powerful alternative to costly experimental methods, existing approaches often overlook critical protein-surface information and rely on simplistic feature fusion techniques, thereby limiting performance. To address this, we introduce GSMFormer-PPI, a novel multimodal framework that integrates ...

Added: February 4, 2026

Алгоритмическая сложность теорий с итерацией Клини

Kuznetsov S., Успехи математических наук 2026 Т. 81 № 1(487) С. 137–204

Итерация (звёздочка) Клини – это одна из наиболее интересных алгебраических операций, встречающихся в теоретической информатике. Исследования структур с этой операцией – алгебр Клини и их расширений – начинаются с классического понятия регулярных выражений, задающих формальные языки. Впоследствии были введены так называемые алгебры действий (В. Пратт, 1991 г.; Д. Козен, 1994 г.), или алгебры Клини с делениями. В этих структурах звёздочка Клини сочетается с делениями, согласованными с частичным порядком (такие ...

Added: February 4, 2026

SMMR: Sampling-Based MMR Reranking for Faster, More Diverse, and Balanced Recommendations and Retrieval

Liakhnovich K., Lashinin O., Babkin A. et al., Proceedings of the 48th International ACM SIGIR Conference on Research and Development in Information Retrieval 2025 P. 2754–2758

Relevance and diversity are critical objectives in modern information retrieval (IR), particularly in recommender systems. Achieving a balance between relevance (exploitation) and diversity (exploration) optimizes user satisfaction and business goals such as catalog coverage and novelty. While existing post-processing reranking methods address this trade-off, they usually rely on greedy strategies, leading to suboptimal outcomes for ...

Added: February 3, 2026

30th International Conference on Applications of Natural Language to Information Systems, NLDB 2025, Kanazawa, Japan, July 4–6, 2025, Proceedings, Part I. Natural Language Processing and Information Systems. (LNCS, volume 15836)

Springer, 2025.

The two-volume set LNCS 15836 and 15837 constitutes the proceedings of the 30th International Conference on Applications of Natural Language to Information Systems, NLDB 2025, held in Kanazawa, Japan, during July 4–6, 2025. The 33 full papers, 19 short papers and 2 demo papers presented in this volume were carefully reviewed and selected from 120 submissions. ...

Added: February 3, 2026

Measuring Chemical LLM robustness to molecular representations: a SMILES variation-based framework

Tutubalina E., Храбров К., Ганеева В. et al., Journal of Cheminformatics 2025 No. 17 Article 164

The recent integration of natural language processing into chemistry has advanced drug discovery. Molecule representations in language models (LMs) are crucial to enhance chemical understanding. We explored the ability of models to match the same chemical structures despite their different representations. Recognizing the same substance in different representations is an important component of emulating the ...

Added: February 3, 2026

A Clustering Model for Stocks that Considers Hidden Dynamics and Price Trajectory

Morychev G., Sizykh D., Sizykh N., IEEE Access 2025 Vol. 13 P. 213194–213210

One of the main tools for analyzing large volumes of financial data is the use of clustering methods and models, which allow the identification of various patterns. This study examines the problem of clustering time series that reflect the behavior of prices, yields, modes, trends, and a number of related stock indicators. The relevance and ...

Added: February 3, 2026