Catalyst: Combining Co-training and Active Learning for Lifelong Classification

Ryndin M.; D. Y. Turdakov; S. D. Kuznetsov

doi:10.1109/ISPRAS51486.2020.00021

Publications

?

Catalyst: Combining Co-training and Active Learning for Lifelong Classification

P. 96–101.

Ryndin M., Turdakov D. Y., Kuznetsov S. D.

Modern supervised algorithms assume that the dataset used for training has the same distributions as the data to be processed. However, the real data is permanently changing. This leads to the gradual degradation of supervised machine learning algorithms in production systems and increases the cost of the maintaining. To solve this problem, we are focusing on domain adaptation of machine learning algorithms in lifelong manner. We assume that real unlabelled data come in continuously. For this setting we propose a method for detecting changes in data distributions, as well as updating supervised algorithms. The idea behind the method is to process a portion of the data and create a new labelled dataset for training a supervised model. The trained model becomes a part of the ensemble used for selecting a strategy to deal with new examples: assign the label automatically using co-training or manually with the aid of active learning. This method is independent of the specific architecture of the model and could be used with any modern supervised algorithms, including artificial neural networks. Our research also confirms two findings. First, adding small portion of data with reliable labels to a self-labelled dataset improves model's performance, even if this amount is small to build a model from scratch. It is also shown that accumulating domain knowledge by continuously adding new trained models to ensemble used for labelling, reduces the amount of labelled data required while maintaining the high performance of the adapted model.

Language: English

DOI

Text on another site

Keywords: active learning lifelong learning co-training feature drift concept drift

In book

Proceedings of the Ivannikov ISPRAS Open Conference, 2–3 December 2020 Moscow

Los Alamitos: IEEE Computer Society, 2020.

City Human Potential Ranking 2023. Methodology and Results of Comparative Analysis of 100 BRICS+ Development Leading Cities

Cyril Ilnicki, Raj A., del Rio M. D. et al., Mexico: Universidad panamericana, 2024.

The study, developed by an international consortium, presents the inaugural ranking of human potential in BRICS+ cities, based on UNESCO methodology. The ranking analyzes 100 development-leading cities from 32 BRICS+ countries according to their capacity to develop and realize human potential. The methodology evaluates two interrelated perspectives: the quality of population potential development and the ...

Added: November 26, 2025

Cross-Cultural Differences in the Motivation of Older People Toward Education and Training

Ilya A. Korshunov, Natalia N. Shirkova, Yun-Xiang Y., Population and Economics 2025 Vol. 9 No. 2 P. 31–53

In recent years, education has become one of the primary means of promoting social inclusion among older adults, a demographic group whose proportion in the global population is steadily increasing. However, the degree and nature of participation in educational activities among older individuals vary significantly across countries. Classical motivational theories, psychological frameworks, and theories of ...

Added: July 15, 2025

Generative AI-based Approach to Concept Drift Generation in Streaming Text Data

Belov B., Peter Panfilov, WSEAS Transactions on Information Science and Applications 2025 Vol. 22 P. 11–20

Real-time analysis of text streams is crucial for industrial and business processes and scenarios. It is expected to be one of the important future research topics in the text processing and understanding domain. Analysis of text data is based on the use of pre-trained machine learning/data mining (ML/DM) models that may demonstrate performance degradation over ...

Added: April 5, 2025

Вовлеченность в онлайн-обучение через призму образовательного опыта взрослых

Gerasimova I., Urtenova P., Kulieva A., Вопросы образования 2024 № 4 С. 85–111

The article aims to critically examine the role of engagement in the study of the educational experiences of adult learners in online learning. The authors highlight the existence of a gap between the nature of adult educational experiences and the lens through which these experiences are studied and evaluated. This lens is the learners' engagement. ...

Added: September 25, 2024

Исследование действием как способ трансформации представлений педагогов о применении цифровых сервисов на уроке

Mikhailova A., Вопросы образования 2024 № 2 С. 139–169

Action research - that is “learning how to do something by doing it" - is a way to change teachers’ beliefs through practice. 15 teachers from three primary and secondary schools conducted their own action research. After a preliminary training, teachers independently designed and conducted lessons using digital tools to stimulate students’ active learning. During ...

Added: September 19, 2024

Enhancing the Life Chances and Social Participation of Young Adults through Workplace Learning

Kersh N., Natalia Zaichenko, Zaichenko L., , in: The SAGE Handbook For Learning and Work.: Sage, 2022. Ch. 28 P. 474–492.

The role of workplace and vocationally oriented learning in fostering the inclusion, employability and civic engagement of young adults has been increasingly recognized by policy, practice and research (European Commission, 1998, 2000, 2016; Jarvis, 2012; Saar et al., 2013). Various configurations of work-related learning have become important components of national and international strategies for Lifelong ...

Added: September 6, 2024

Creating New Mathematics by Schoolchildren

A. L. Semenov, Soprunov S. F., Ivanov-Pogodaev I. A., Doklady Mathematics 2023 Vol. 107 No. 1 P. S132–S136

The paper discusses an example of an educational project in modern mathematics in which school students create mathematics that is new to them. The mathematical results produced by the students in the theory of definability also have an “absolute” novelty, i.e., are the basis for professional publications. The described course was based on recent results of this ...

Added: March 14, 2024

Increasing large-scale beta-synchronization during the second day of pseudoword-movement associative learning

Razorenova A., Pavlova A., Nikolaeva A. Y. et al., International Journal of Psychophysiology 2023 Vol. 188 P. 77–78

We examined MEG β-power during two days of pseudoword-movement associative learning. We discovered learning-induced β-ERS in sensorimotor, frontal and posterior temporo-parietal regions. The β-ERS in the associative cortices did not change after sleep but the one in the posterior regions continued to rise during repetitive training on the second day. ...

Added: November 22, 2023

Increasing large-scale beta-synchronization during the second day of pseudoword-movement associative learning

Razorenova A., Pavlova A., Nikolaeva A. et al., , in: International Organization of Psychophysiology (IOP) - Editorial: Proceedings of IOP2023 - The 21st World Congress of PsychophysiologyVol. 188: Supplement.: Elsevier, 2023. P. 77–78.

Added: October 27, 2023

Positive feedback loops lead to concept drift in machine learning systems

Anton Khritankov, Applied Intelligence 2023 Vol. 53 No. 19 P. 22648–22666

We have derived conditions when unintended feedback loops occur in supervised machine learning systems. In this paper, we study an important problem of discovering and measuring hidden feedback loops. Such feedback loops occur in web search, recommender systems, healthcare, predictive public policing and other systems. As a possible cause of echo chambers and filter bubbles, ...

Added: September 1, 2023

Конструирование ценности онлайн-курсов дополнительного профессионального образования. На примере онлайн-отзывов потребителей образовательной платформы

Дубинина Д. М., Манукян Э. Р., Марченко А. В. et al., Экономическая социология 2023 Т. 24 № 1 С. 106–132

Since 2016, the e-learning market in Russia has been rapidly developing and the concept of lifelong learning has become increasingly popular in society. At the same time, the use platforms as a new economic organization is growing. It leads to a contradiction between the services’ standardization and the platform’s aim to retain consumers. This has ...

Added: May 25, 2023

Towards Computationally Feasible Deep Active Learning

Tsvigun A., Shelmanov A., Kuzmin G. et al., , in: Findings of the Association for Computational Linguistics: NAACL 2022.: Seattle: Association for Computational Linguistics, 2022. P. 1198–1218.

Active learning (AL) is a prominent technique for reducing the annotation effort required for training machine learning models. Deep learning offers a solution for several essential obstacles to deploying AL in practice but introduces many others. One of such problems is the excessive computational resources required to train an acquisition model and estimate its uncertainty ...

Added: November 1, 2022

Ключевые аспекты реализации активных методов обучения в дисциплине «Основы программирования» в рамках идей международного проекта по реформированию инженерного образования

Videnin S., Раскина А. В., Виденина М. С. et al., Современные наукоемкие технологии 2020 № 12-1 С. 145–149

The article presents the concept of «Fundamentals of programming» discipline design in the context of the CDIO initiative and the relations between the key sections of the discipline with the CDIO standards. There are methodological features of the applied pedagogical technologies used in the course. The list of competencies to be formed in the educational ...

Added: October 12, 2022

Problem Solving and Formal, Nonformal, and Informal Learning of Russian Employees (Based on PIAAC Data)

Korshunov I., Sergei Lubnikov, Miroshnikov M. et al., Adult Education Quarterly 2023 Vol. 73 No. 2 P. 197–219

Using the PIAAC data, we show how exposure to various dimensions of nonformal and informal learning relates to the problem-solving capacity in a technology-rich environment for working adults. The sample included permanent staff from various economic sectors, self-employed individuals, and casual employees doing fixed-term contracts (n=1248) between 16 and 65 years, 38% of participants were males. ...

Added: September 19, 2022

Exponential Savings in Agnostic Active Learning through Abstention

Puchkin N., Zhivotovskiy N., IEEE Transactions on Information Theory 2022 Vol. 68 No. 7 P. 4651–4665

We show that in pool-based active classification without assumptions on the underlying distribution, if the learner is given the power to abstain from some predictions by paying the price marginally smaller than the average loss 1/2 of a random guess, exponential savings in the number of label requests are possible whenever they are possible in ...

Added: May 30, 2022

Развитие непрерывного технического профессионального образования за рубежом

Mozhaeva G., Краснова Г. А., Полушкина Е. А., Издательский Дом Томского государственного университета, 2017.

The book is devoted to continuous development of technical professional lifelong learning in foreign countries such as EU, APEC, BRICS. This book contains system of indicators characterizing the features of continuous professional education in Europe, being key in the formation of national strategies and practices of professional personnel development. It is for specialists in the field ...

Added: March 23, 2022

Mobility for Smart Cities and Regional Development - Challenges for Higher Education, V1

Springer, 2022.

Proceedings of the 24th International Conference on Interactive Collaborative Learning (ICL2021), Volume 1 ...

Added: February 18, 2022

Learning Loss for Active Learning in Depth Reconstruction Problem

Makarov I., Guschenko-Cheverda I., , in: Proceedings of IEEE 21st International Symposium on Computational Intelligence and Informatics (CINTI'21), 18-20 Nov. 2021.: NY: IEEE, 2021. P. 000115–000120.

Added: January 19, 2022

Active Learning for Sequence Tagging with Deep Pre-trained Models and Bayesian Uncertainty Estimates

Shelmanov A., Puzyrev D. A., Kupriyanova L. et al., , in: Proceedings of the 16th Conference of the European Chapter of the Association for Computational LinguisticsVol. 16.: Association for Computational Linguistics, 2021. P. 1698 1712–1712.

Annotating training data for sequence tagging of texts is usually very time-consuming. Recent advances in transfer learning for natural language processing in conjunction with active learning open the possibility to significantly reduce the necessary annotation budget. We are the first to thoroughly investigate this powerful combination for the sequence tagging task. We conduct an extensive ...

Added: September 23, 2021

Exponential savings in agnostic active learning through abstention

Puchkin N., Zhivotovskiy N., , in: Proceedings of Machine Learning ResearchVol. 134: Conference on Learning Theory.: PMLR, 2021. P. 3806–3832.

Added: September 8, 2021