• A
  • A
  • A
  • АБВ
  • АБВ
  • АБВ
  • A
  • A
  • A
  • A
  • A
Обычная версия сайта
  • RU
  • EN
  • HSE University
  • Publications
  • Book chapter
  • TEncDM: Understanding the Properties of the Diffusion Model in the Space of Language Model Encodings
  • RU
  • EN
Расширенный поиск
Высшая школа экономики
Национальный исследовательский университет
Priority areas
  • business informatics
  • economics
  • engineering science
  • humanitarian
  • IT and mathematics
  • law
  • management
  • mathematics
  • sociology
  • state and public administration
by year
  • 2027
  • 2026
  • 2025
  • 2024
  • 2023
  • 2022
  • 2021
  • 2020
  • 2019
  • 2018
  • 2017
  • 2016
  • 2015
  • 2014
  • 2013
  • 2012
  • 2011
  • 2010
  • 2009
  • 2008
  • 2007
  • 2006
  • 2005
  • 2004
  • 2003
  • 2002
  • 2001
  • 2000
  • 1999
  • 1998
  • 1997
  • 1996
  • 1995
  • 1994
  • 1993
  • 1992
  • 1991
  • 1990
  • 1989
  • 1988
  • 1987
  • 1986
  • 1985
  • 1984
  • 1983
  • 1982
  • 1981
  • 1980
  • 1979
  • 1978
  • 1977
  • 1976
  • 1975
  • 1974
  • 1973
  • 1972
  • 1971
  • 1970
  • 1969
  • 1968
  • 1967
  • 1966
  • 1965
  • 1964
  • 1963
  • 1958
  • More
Subject
News
July 2, 2026
Researchers Discover How Spelling Errors Slow Down Reading in Russian
Psycholinguists from the Centre for Language and Brain at HSE University–St Petersburg have shown that words that are frequently misspelled are processed more slowly by readers, even when presented with the correct spelling. The researchers confirmed this effect for the first time using Russian-language materials and found that response speed is most strongly linked to how confidently individuals can distinguish the correct spelling of a word from an incorrect one. The study has been published in The Mental Lexicon.
July 2, 2026
HSE Develops App for Assessing Phonological Processing in Children
Researchers at the HSE Centre for Language and Brain have developed a new digital tool for assessing children's phonological processing skills—the ZARYA (Sound Analysis of the Russian Language) test battery. It is the first standardised application in Russia designed to provide a fast and reliable assessment of children's ability to distinguish speech sounds, retain them in working memory, and perform phonemic analysis. The app runs on Android tablets and smartphones and is available for download from RuStore. Details of the test validation have been published in the Journal of Speech, Language, and Hearing Research.
July 1, 2026
Scientists Discover Why Europium 'Misbehaves'
Europium is a rare-earth metal responsible for the pure red glow in displays and other luminescent materials. For a long time, however, it refused to emit light when surrounded by certain organic molecules known as acylpyrazolone ligands. Chemists have now uncovered the reason: in europium complexes with these ligands, a 'black window' appears—a charge-transfer state in which the energy absorbed by the ligand is dissipated as heat rather than emitted as light. Understanding this mechanism opens the way to designing more efficient red-emitting materials for displays, fluorescent thermometers, and chemical sensors. The results have been published in Dalton Transactions.

 

Have you spotted a typo?
Highlight it, click Ctrl+Enter and send us a message. Thank you for your help!

Publications
  • Books
  • Articles
  • Chapters of books
  • Working papers
  • Report a publication
  • Research at HSE

?

TEncDM: Understanding the Properties of the Diffusion Model in the Space of Language Model Encodings

Ch. 110. P. 25110–25118.
Shabalin A., Meshchaninov V., Chimbulatov E., Vladislav Lapikov, Kim R., Grigory Bartosh, Molchanov D., Markov S., Vetrov D.

This paper presents the Text Encoding Diffusion Model (TEncDM), a novel approach to diffusion modeling that operates in the space of pre-trained language model encodings. In contrast to traditionally used embeddings, encodings integrate contextual information. In our approach, we also employ a transformer-based decoder, specifically designed to incorporate context in the token prediction process. We conduct a comprehensive examination of the influence of the encoder, decoder, noise scheduler, and self-conditioning on zero-shot generation. Furthermore, we compare TEncDM with previous approaches on three conditional text generation tasks: QQP, XSum, and Wiki-Auto. The results show that TEncDM exhibits superior performance compared to existing non-autoregressive diffusion models.

Language: English
Full text
DOI
Text on another site
Keywords: text generationtext diffusion models

In book

Proceedings of the 39th Annual AAAI Conference on Artificial Intelligence
Proceedings of the 39th Annual AAAI Conference on Artificial Intelligence
Vol. 39. Issue 23. , Washington, United States of America: AAAI Press, 2025.
Similar publications
Влияние социально-демографических факторов на эмоциональное восприятие текста цифровой коммуникации: опыт экспериментального исследования
Герцен А. С., Виртуальная коммуникация и социальные сети 2026 Т. 5 С. 198–206
Emotional interpretation of digital texts requires the reconstruction of the author’s intent. According to Activity Theory, emotions are shaped by historical and cultural factors rather than human biology. Using the Geneva Emotion Wheel model, the authors studied responses to a digital communication text in order to measure the convergence between attributed emotions (those the reader ...
Added: July 2, 2026
Искусственный интеллект как симулякр смысла
Малинов С. А., Галактика медиа: журнал медиа исследований 2025 Т. 7 № 4 С. 154–173
In recent years, artificial intelligence (AI) has been actively integrated into everyday human life. Its popularity continues to grow steadily, and companies increasingly employ AI to optimize and accelerate workflows. Ordinary users leverage large language models (LLMs) and multimodal AI systems to perform a wide range of tasks, including generating texts, images, and videos; planning ...
Added: December 7, 2025
Оценивание студенческих работ в рамках обучения академическому письму на английском языке в контексте развития инструментов искусственного интеллекта
Bakulev A., В кн.: Профессионализм учителя иностранных языков и его реализация. Сборник статей по материалам научно-методического симпозиума с международным участием «Лемпертовские чтения – XXVII» 15-17 мая 2025 года.: Пятигорск: Издательство Пятигорского государственного университета, 2025. С. 270–279.
The paper focuses on assessing students’ written papers in the discipline “Academic Writing in English in the context of AI tools’ capabilities. AI tools, specifically large language models (LLMs) appear to be able to tackle and solve a wide range of educational and research tasks. Foreign language teaching is no exception: AI tools are utilized ...
Added: June 5, 2025
РАЗРАБОТКА СИСТЕМЫ ГЕНЕРАЦИИ ПОВСЕДНЕВНЫХ ДИАЛОГОВ НА РУССКОМ ЯЗЫКЕ: ПИЛОТНОЕ ИССЛЕДОВАНИЕ
Кругликова В. Г., В кн.: Анализ речи: теоретические и прикладные аспекты: сборник научных статей.: [б.и.], 2023.
The article presents a comparative analysis of various language models used to generate texts and evaluates their effectiveness for the task of generating conversational speech. There are such models as GPT-3, BERT, LSTM involved in the comparative analysis. This study is part of a project of developing a system for generating dialogues in Russian. The ...
Added: December 10, 2023
Using Generative Pretrained Transformer-3 Models for Russian News Clustering and Title Generation tasks
Tikhonova M., Pisarevskaya D., Shavrina T. et al., Komp'juternaja Lingvistika i Intellektual'nye Tehnologii 2021 Vol. 20 P. 1214–1223
The paper presents a methodology for news clustering and news headline generation based on the zero-shot approach and minimal tuning of the RuGPT-3 architecture (Generative Pretrained Transformer 3 for Russian). The solution is presented in a competition for news clustering, headline selection and generation. The following approaches are described: 1) zero-shot unsupervised classification based on pairwise news perplexity: the ...
Added: September 22, 2023
Sounds Wilde. Phonetically extended embeddings for author-stylized poetry generation
Tikhonov A., Yamshchikov I. P., , in: Proceedings of the Fifteenth Workshop on Computational Research in Phonetics, Phonology, and Morphology.: Association for Computational Linguistics, 2018. P. 117–124.
This paper addresses author-stylized text generation. Using a version of a language model with extended phonetic and semantic embeddings for poetry generation we show that phonetics has comparable contribution to the overall model performance as the information on the target author. Phonetic information is shown to be important for English and Russian language. Humans tend ...
Added: April 7, 2021
Текстема как языковая единица и сверхтекстовый конструкт
Sosnin A., В кн.: Теория и практика лингвистического описания разговорной речи: Сборник материалов Всероссийской научной конференции "Скребневские чтения" 26 октября 2016 г.: Н. Новгород: НГЛУ им. Н.А. Добролюбова, 2016. С. 170–180.
The article examines text as the primary givenness for humanitarian and philological thought and raises the possibility of instituting the textual paradigm and the texteme as a language unit and textual type; it stresses that supertextual structures significantly influence how verbal syntagms are constructed. The article also suggests that the syntax of a text is ...
Added: January 28, 2017
Правовые тексты: особенности эндофорической референции
Vlasenko S. V., В кн.: Стереотипность и творчество в тексте: межвуз. сб. науч. тр.Вып. 15.: Пермь: Пермский государственный университет, 2011. С. 185–194.
This article features issues related to the inherent structure of legal texts in its relation to the text-internal mechanisms specific for these texts. In particular, the analysis is centered around a legal texts’ propensity to ‘avoid’ pronominal deictic elements, as well as other deixis ordinarily employed for text generation. Legal texts perception is deemed dependent ...
Added: February 3, 2015
  • About
  • About
  • Key Figures & Facts
  • Sustainability at HSE University
  • Faculties & Departments
  • International Partnerships
  • Faculty & Staff
  • HSE Buildings
  • HSE University for Persons with Disabilities
  • Public Enquiries
  • Studies
  • Admissions
  • Programme Catalogue
  • Undergraduate
  • Graduate
  • Exchange Programmes
  • Summer University
  • Summer Schools
  • Semester in Moscow
  • Business Internship
  • Research
  • International Laboratories
  • Research Centres
  • Research Projects
  • Monitoring Studies
  • Conferences & Seminars
  • Academic Jobs
  • Yasin (April) International Academic Conference on Economic and Social Development
  • Media & Resources
  • Publications by staff
  • HSE Journals
  • Publishing House
  • iq.hse.ru: commentary by HSE experts
  • Library
  • Economic & Social Data Archive
  • Video
  • HSE Repository of Socio-Economic Information
  • HSE1993–2026
  • Contacts
  • Copyright
  • Privacy Policy
  • Site Map
Edit