Book
Компьютерная лингвистика и интеллектуальные технологии: По материалам ежегодной Международной конференции «Диалог» (Бекасово, 30 мая–3 июня 2012 г.). В 2 томах
The paper describes experiments on automatic single-word term extraction based on combining various features of words, mainly linguistic and statistical, by machine learning methods. Since single-word terms are much more difficult to recognize than multi-word terms, a broad range of word features was taken into account, among them are widely-known measures (such as TF-IDF), some novel features, as well as proposed modifications of features usually applied for multi-word term extraction. A large target collection of Russian texts in the domain of banking was taken for experiments. Average Precision was chosen to evaluate the results of term extraction, along with the manually created thesaurus of terminology on banking activity that was used to approve extracted terms. The experiments showed that the use of multiple features significantly improves the results of automatic extraction of domain-specific terms. It was proved that logistic regression is the best machine learning method for single- word term extraction; the subset of word features significant for term extraction was also revealed.

Students' internet usage attracts the attention of many researchers in different countries. Differences in internet penetration in diverse countries lead us to ask about the interaction of medium and culture in this process. In this paper we present an analysis based on a sample of 825 students from 18 Russian universities and discuss findings on particularities of students' ICT usage. On the background of the findings of the study, based on data collected in 2008-2009 year during a project "A сross-cultural study of the new learning culture formation in Germany and Russia", we discuss the problem of plagiarism in Russia, the availability of ICT features in Russian universities and an evaluation of the attractiveness of different categories of ICT usage and gender specifics in the use of ICT.