• A
  • A
  • A
  • АБВ
  • АБВ
  • АБВ
  • A
  • A
  • A
  • A
  • A
Обычная версия сайта
  • RU
  • EN
  • HSE University
  • Publications
  • Book chapter
  • TEncDM: Understanding the Properties of the Diffusion Model in the Space of Language Model Encodings
  • RU
  • EN
Расширенный поиск
Высшая школа экономики
Национальный исследовательский университет
Priority areas
  • business informatics
  • economics
  • engineering science
  • humanitarian
  • IT and mathematics
  • law
  • management
  • mathematics
  • sociology
  • state and public administration
by year
  • 2027
  • 2026
  • 2025
  • 2024
  • 2023
  • 2022
  • 2021
  • 2020
  • 2019
  • 2018
  • 2017
  • 2016
  • 2015
  • 2014
  • 2013
  • 2012
  • 2011
  • 2010
  • 2009
  • 2008
  • 2007
  • 2006
  • 2005
  • 2004
  • 2003
  • 2002
  • 2001
  • 2000
  • 1999
  • 1998
  • 1997
  • 1996
  • 1995
  • 1994
  • 1993
  • 1992
  • 1991
  • 1990
  • 1989
  • 1988
  • 1987
  • 1986
  • 1985
  • 1984
  • 1983
  • 1982
  • 1981
  • 1980
  • 1979
  • 1978
  • 1977
  • 1976
  • 1975
  • 1974
  • 1973
  • 1972
  • 1971
  • 1970
  • 1969
  • 1968
  • 1967
  • 1966
  • 1965
  • 1964
  • 1963
  • 1958
  • More
Subject
News
May 25, 2026
HSE Scientists Train Neural Network to 'Hear' Faults in Electric Motors
Researchers at the AI and Digital Science Institute of the HSE Faculty of Computer Science have developed a new method—the Signature-Guided Data Augmentation (SGDA) framework—that achieves 99% accuracy in motor fault detection and 86% accuracy in fault classification. The application of this approach can reduce industrial equipment repair costs, minimise downtime, and improve production safety. The study results have been published in Engineering Applications of Artificial Intelligence.
May 25, 2026
'The Humanities Serve as a Conscience'
Maria Mizernaia studies Soviet literature and the history of book publishing. In this interview for the HSE Young Scientists project, she discusses plans to publish a novel about besieged Leningrad, AI-provoked reflections on what it means to be human, and how novels can help satisfy our dopamine hunger.
May 25, 2026
Is It Possible to Predict a Citys Life Based on the Shape of Its Neighbourhoods?
Is it possible to predict, based on the configuration of streets and buildings, where a café will open or where traffic congestion will occur? Participants in the Spatial Analysis and Modelling of Urban Processes research and study group use open data and machine learning to identify universal patterns. Alexander Sheludkov and Eduard Somov discuss the purpose of comparing cities, the need for new forms of urban statistics, and how open data is transforming approaches to urban studies.

 

Have you spotted a typo?
Highlight it, click Ctrl+Enter and send us a message. Thank you for your help!

Publications
  • Books
  • Articles
  • Chapters of books
  • Working papers
  • Report a publication
  • Research at HSE

?

TEncDM: Understanding the Properties of the Diffusion Model in the Space of Language Model Encodings

Ch. 110. P. 25110–25118.
Shabalin A., Meshchaninov V., Chimbulatov E., Vladislav Lapikov, Kim R., Grigory Bartosh, Molchanov D., Markov S., Vetrov D.

This paper presents the Text Encoding Diffusion Model (TEncDM), a novel approach to diffusion modeling that operates in the space of pre-trained language model encodings. In contrast to traditionally used embeddings, encodings integrate contextual information. In our approach, we also employ a transformer-based decoder, specifically designed to incorporate context in the token prediction process. We conduct a comprehensive examination of the influence of the encoder, decoder, noise scheduler, and self-conditioning on zero-shot generation. Furthermore, we compare TEncDM with previous approaches on three conditional text generation tasks: QQP, XSum, and Wiki-Auto. The results show that TEncDM exhibits superior performance compared to existing non-autoregressive diffusion models.

Language: English
Full text
DOI
Text on another site
Keywords: text generationtext diffusion models

In book

Proceedings of the 39th Annual AAAI Conference on Artificial Intelligence
Proceedings of the 39th Annual AAAI Conference on Artificial Intelligence
Vol. 39. Issue 23. , Washington, United States of America: AAAI Press, 2025.
Similar publications
Искусственный интеллект как симулякр смысла
Малинов С. А., Галактика медиа: журнал медиа исследований 2025 Т. 7 № 4 С. 154–173
In recent years, artificial intelligence (AI) has been actively integrated into everyday human life. Its popularity continues to grow steadily, and companies increasingly employ AI to optimize and accelerate workflows. Ordinary users leverage large language models (LLMs) and multimodal AI systems to perform a wide range of tasks, including generating texts, images, and videos; planning ...
Added: December 7, 2025
Оценивание студенческих работ в рамках обучения академическому письму на английском языке в контексте развития инструментов искусственного интеллекта
Bakulev A., В кн.: Профессионализм учителя иностранных языков и его реализация. Сборник статей по материалам научно-методического симпозиума с международным участием «Лемпертовские чтения – XXVII» 15-17 мая 2025 года.: Пятигорск: Издательство Пятигорского государственного университета, 2025. С. 270–279.
The paper focuses on assessing students’ written papers in the discipline “Academic Writing in English in the context of AI tools’ capabilities. AI tools, specifically large language models (LLMs) appear to be able to tackle and solve a wide range of educational and research tasks. Foreign language teaching is no exception: AI tools are utilized ...
Added: June 5, 2025
РАЗРАБОТКА СИСТЕМЫ ГЕНЕРАЦИИ ПОВСЕДНЕВНЫХ ДИАЛОГОВ НА РУССКОМ ЯЗЫКЕ: ПИЛОТНОЕ ИССЛЕДОВАНИЕ
Кругликова В. Г., В кн.: Анализ речи: теоретические и прикладные аспекты: сборник научных статей.: [б.и.], 2023.
The article presents a comparative analysis of various language models used to generate texts and evaluates their effectiveness for the task of generating conversational speech. There are such models as GPT-3, BERT, LSTM involved in the comparative analysis. This study is part of a project of developing a system for generating dialogues in Russian. The ...
Added: December 10, 2023
Using Generative Pretrained Transformer-3 Models for Russian News Clustering and Title Generation tasks
Tikhonova M., Pisarevskaya D., Shavrina T. et al., Komp'juternaja Lingvistika i Intellektual'nye Tehnologii 2021 Vol. 20 P. 1214–1223
The paper presents a methodology for news clustering and news headline generation based on the zero-shot approach and minimal tuning of the RuGPT-3 architecture (Generative Pretrained Transformer 3 for Russian). The solution is presented in a competition for news clustering, headline selection and generation. The following approaches are described: 1) zero-shot unsupervised classification based on pairwise news perplexity: the ...
Added: September 22, 2023
Sounds Wilde. Phonetically extended embeddings for author-stylized poetry generation
Tikhonov A., Yamshchikov I. P., , in: Proceedings of the Fifteenth Workshop on Computational Research in Phonetics, Phonology, and Morphology.: Association for Computational Linguistics, 2018. P. 117–124.
This paper addresses author-stylized text generation. Using a version of a language model with extended phonetic and semantic embeddings for poetry generation we show that phonetics has comparable contribution to the overall model performance as the information on the target author. Phonetic information is shown to be important for English and Russian language. Humans tend ...
Added: April 7, 2021
Текстема как языковая единица и сверхтекстовый конструкт
Sosnin A., В кн.: Теория и практика лингвистического описания разговорной речи: Сборник материалов Всероссийской научной конференции "Скребневские чтения" 26 октября 2016 г.: Н. Новгород: НГЛУ им. Н.А. Добролюбова, 2016. С. 170–180.
The article examines text as the primary givenness for humanitarian and philological thought and raises the possibility of instituting the textual paradigm and the texteme as a language unit and textual type; it stresses that supertextual structures significantly influence how verbal syntagms are constructed. The article also suggests that the syntax of a text is ...
Added: January 28, 2017
Правовые тексты: особенности эндофорической референции
Vlasenko S. V., В кн.: Стереотипность и творчество в тексте: межвуз. сб. науч. тр.Вып. 15.: Пермь: Пермский государственный университет, 2011. С. 185–194.
This article features issues related to the inherent structure of legal texts in its relation to the text-internal mechanisms specific for these texts. In particular, the analysis is centered around a legal texts’ propensity to ‘avoid’ pronominal deictic elements, as well as other deixis ordinarily employed for text generation. Legal texts perception is deemed dependent ...
Added: February 3, 2015
  • About
  • About
  • Key Figures & Facts
  • Sustainability at HSE University
  • Faculties & Departments
  • International Partnerships
  • Faculty & Staff
  • HSE Buildings
  • HSE University for Persons with Disabilities
  • Public Enquiries
  • Studies
  • Admissions
  • Programme Catalogue
  • Undergraduate
  • Graduate
  • Exchange Programmes
  • Summer University
  • Summer Schools
  • Semester in Moscow
  • Business Internship
  • Research
  • International Laboratories
  • Research Centres
  • Research Projects
  • Monitoring Studies
  • Conferences & Seminars
  • Academic Jobs
  • Yasin (April) International Academic Conference on Economic and Social Development
  • Media & Resources
  • Publications by staff
  • HSE Journals
  • Publishing House
  • iq.hse.ru: commentary by HSE experts
  • Library
  • Economic & Social Data Archive
  • Video
  • HSE Repository of Socio-Economic Information
  • HSE1993–2026
  • Contacts
  • Copyright
  • Privacy Policy
  • Site Map
Edit