?
Crowdsourcing morphological annotation
.
Bocharov V. V., Alexeeva S. V., Granovsky D. V., Protopopova E. V., Stepanova M. E., Surikov A. V.
Manually annotated corpora are very important and very expensive resources:
the annotation process requires a lot of time and skills. In Open-
Corpora project we are trying to involve into annotation works native speakers
with no special linguistic knowledge. In this paper we describe the way
we organize our processes in order to maintain high quality of annotation
and report on our preliminary results.
Language:
English
In book
Т. 1: Основная программа конференции. Вып. 12 (19). , М. : РГГУ, 2013
Alexeeva S. V., Protopopova E. V., Bodrova A. A. et al., Компьютерная лингвистика и интеллектуальные технологии 2014 P. 562-571
The paper describes the noun phase and anaphora annotation in OpenCorpora and compares it to that in other corpora. We discuss the choice of representative texts for anaphoric annotation and the basic principles of syntactic annotation. In case of noun phrase annotation we followed the scheme introduced earlier for morphological annotation: it was carried out ...
Added: October 8, 2014
Klyshinskiy E., Рысаков С. В., Новые информационные технологии в автоматизированных системах 2015 С. 555-563
Статья знакомит читателя со статистическими методами устранения морфологической неоднозначности. Описывается процесс насыщения, параметры методов и n-грамм. Большое внимание уделено методам снятия омонимии, в обзоре которых описания сопровождены практическими оценками и даны алгоритмы их работы. В конце приведено сравнение качества методов дизамбигуации, осуществлённое авторами. ...
Added: November 25, 2015
Edeleva J., Chrabaszcz A., Demareva V., The Quarterly Journal of Experimental Psychology 2020 Vol. 73 No. 8 P. 1173-1188
We report results from a self-paced silent reading study and a self-paced reading-aloud study examining ambiguous forms (heteronyms) of Russian animate and inanimate nouns which are differentiated in speech through word stress, e.g. uCHItelja.TEACHER.GEN/ACC.SG and uchiteLJA.TEACHERS.NOM.PL1. During reading, the absence of the auditory cue (word stress) to word identification results in morphologically ambiguous forms since ...
Added: November 25, 2019
Sorokin P. S., Afanaseva I., Шмаевка В. К. et al., М. : Издательский дом НИУ ВШЭ, 2023
The issue of agency (enterprise, initiative) is one of the central ones for the corporate sector. The key factor determining the importance of this issue is the processes of ‘destructuration’, that is, the growth of variability in the forms of social organization in various spheres of public life.
The authors identified three levels of proactive behavior ...
Added: November 16, 2023
Dolzhenko R. A., Проблемы теории и практики управления 2014 № 4 С. 125-129
В статье рассмотрены возможности использования краудсорсинга при определении основных стратегических направлений развития компании. Определена суть краудсорсинга, выделены типы краудсорсинга. Рассмотрен опыт Сбербанка России в этой области, который первым среди отечественных компаний, привлёк сообщество заинтересованных работников к определению стратегических направлений развития Банка на период до 2018 года. ...
Added: October 21, 2014
Iomdin B., Iomdin L., , in : Meaning Text Theory: Current Developments. Vol. . Issue 85.: Muenchen : Wiener Slawistischer Almanach, 2013.
The paper discusses the semantic interaction of the negation with certain types of verbal predicates in Russian, which involves, depending on the predicate type and its main valency structure, the emergence of new semantic valencies: the valency of the missing distance, the valency of the missing time span, and the valency of the missing quantity. ...
Added: August 20, 2014
Ozcan S., Boye D., Arsenyan J. et al., IEEE Transactions on Engineering Management 2022 Vol. 69 No. 6 P. 3023-3037
Crowdsourcing is a multidisciplinary research area that represents a rapidly expanding field where new applications are constantly emerging. Research in this area has investigated its use for citizen science in data gathering for research and crowdsourcing for industrial innovation. Previous studies have reviewed and categorized crowdsourcing research using qualitative methods. This has led to the ...
Added: September 12, 2022
Копыченко Г.С., Управленческие науки 2014
The article describes questions of the formation new centers of development of regions and country. These development centers in all Russian regions are cities as they concentrate the main socio-economic potential of the country, which determining their level of competitiveness. The most effective mechanism for management of cities’ competitiveness in the long term period is ...
Added: May 16, 2014
Lyashevskaya O., Ostyakova L., Сальников Е. А. et al., , in : Компьютерная лингвистика и интеллектуальные технологии: По материалам ежегодной международной конференции «Диалог» (Москва, 17 июня — 20 июня 2020 г.). Дополнительный том материалов. : M. : ., 2020. P. 1091-1108.
Orthographic and morphological heterogeneity of historical texts in pre-modern Slavic causes many difficulties in pos- and morphological tagging. Existing approaches to these tasks show state-of-the-art results without normalization, but they are still very sensitive to the properties of training data such as genre and origin. In this paper, we investigate to what extent the heterogeneity ...
Added: July 3, 2020
Корнилов А. М., Теоретическая экономика 2020 № 8(68) С. 32-38
The ongoing process of digital transformation the world economic system is currently experiencing has recently been perceived as a universal solution to all the problems of the world economy. Meanwhile its’ very conceptual basis is such that it promises in the near future rather than construction of a utopian “knowledge economy”, institutionalization of imitation develop ...
Added: February 19, 2021
Tsaplin E., Бушеленкова С. В., Проблемы теории и практики управления 2015 № 2 С. 127-132
This article discusses the concept of web 3.0 and the upcoming changes related to the use of this technology in the marketing field. The exploratory study presents the results of the research of possible development directions of crowdsourcing technologies, remote work, and management of unstructured distributed data in the open Internet. The article discusses in ...
Added: October 19, 2014
Patarakin Y., Ярмахов Б. Б., Burov V., Образовательные технологии и общество 2012 Т. 15 № 2 С. 517-535
We create collaborative environment for collaborative creation, improvement and promoting bills within public and legislative projects. Enacting a new law means that a community devises out new rules which help it to become more efficient. Below are the principles on which legislative collaboration is based: Public construction of a document aiming at complex cloud issues ...
Added: August 19, 2012
Iomdin B., Иомдин Л. Л., Труды института русского языка им. В.В. Виноградова 2020 № 25 С. 49-61
The paper discusses certain Russian verbs that have so far been considered as having the valency of content: otnekivat’sja ‘to deny,’ izvinjat’sja ‘to apologize,’ opravdyvat’sja ‘to make excuses,’ otgovarivat’sja ‘to dissuade,’ otbrexivat’sja ‘to rebute,’ otpirat’sja ‘to disown,’ vozmushchat’sja ‘to resent,’ vozrazhat’ ‘to object,’ sporit’ ‘to argue,’ etc. Wу hypothesize that in reality these valencies should ...
Added: November 6, 2020
Dolzhenko R. A., Известия высших учебных заведений. Серия: Экономика, финансы и управление производством 2015 Т. 23 № 1 С. 67-75
В статье рассмотрены возможности использования краудсорсинга для обсуждения и доработки отчёта о корпоративной социальной ответственности силами заинтересованной общественности. Определена суть краудсорсинга, описаны этапы краудсорсинга документов организации. Выделены преимущества его использования для обсуждения и доработки документа, как для организации, так и для участников. Рассмотрен опыт Сбербанка России в этой области, который первым среди отечественных компаний, на ...
Added: April 18, 2015
Dudchuk P., Hladky D., Klintsov V., , in : I-SEMANTICS'10. Proceedings of the 6th International Conference on Semantic Systems. : NY : ACM, 2010. Ch. 39. P. 246-247.
This project describes an application for creating ubiquitous hypertext on the Web, which enhances the user experience by allowing clipping and sharing the information. The goal of the application is to annotate text and link it to relevant content, especially from the Linked Open Data community and from the Ontos knowledge base. The paper describes ...
Added: October 19, 2014
Wohlgenannt G., Artemova E., Ilvovsky D., , in : Proceedings of the Workshop on Language Technology Resources and Tools for Digital Humanities (LT4DH). : Osaka : [б.и.], 2016. Ch. 4. P. 18-26.
In this paper a social network is extracted from a literary text. The social network shows, how frequent the characters interact and how similar their social behavior is. Two types of similarity measures are used: the first applies co-occurrence statistics, while the second exploits cosine similarity on different types of word embedding vectors. The results ...
Added: March 6, 2017
Fornaciari T., Cagnina L., Россо П. et al., Language Resources and Evaluation 2020 Vol. 54 No. 4 P. 1019-1058
Identifying deceptive online reviews is a challenging tasks for Natural Language Processing (NLP). Collecting corpora for the task is difficult, because normally it is not possible to know whether reviews are genuine. A common workaround involves collecting (supposedly) truthful reviews online and adding them to a set of deceptive reviews obtained through crowdsourcing services. Models ...
Added: October 29, 2020
Springer, 2014
This book constitutes the refereed proceedings of the 10th International Conference on Machine Learning and Data Mining in Pattern Recognition, MLDM 2014, held in St. Petersburg, Russia in July 2014. The 40 full papers presented were carefully reviewed and selected from 128 submissions. The topics range from theoretical topics for classification, clustering, association rule and ...
Added: September 30, 2014
Frankish A., Uszczynska B., Ritchie G. et al., BMC Genomics 2015 Vol. 16 No. Suppl 8 P. S2
BACKGROUND:
A vast amount of DNA variation is being identified by increasingly large-scale exome and genome sequencing projects. To be useful, variants require accurate functional annotation and a wide range of tools are available to this end. McCarthy et al recently demonstrated the large differences in prediction of loss-of-function (LoF) variation when RefSeq and Ensembl transcripts ...
Added: February 20, 2017
Lancaster : Lancaster University Press, 2015
The main trends and achievements in corpus linguistics are presented in this collection os abstracts of plenaries, papers and posters presented at the 8th internation conference Corpus Linguistics - 2015 (Lancaster University, UCREL, July 2015) ...
Added: October 17, 2015
Ignatov D. I., Kaminskaya A. Y., Konstantinova N. et al., , in : Proceedings of The 2014 IEEE/WIC/ACM International Joint Conference on Web Intelligence and Intelligent Agent Technology, WI-IAT 2014, 11-14 August 2014 Warsaw, Poland. : Los Alamitos, Washington, Tokyo : IEEE Computer Society, 2014. P. 327-335.
This paper discusses the recommender models and methods for crowdsourcing platforms. These models are based on modern methods of data analysis of object-attribute data, such as Formal Concept Analysis and biclustering. In particular, the paper is focused on the solution of two tasks – idea and antagonists recommendation – on the example of crowdsourcing platform ...
Added: June 9, 2014
Dolzhenko R. A., Барнаул : Издательство Алтайского университета, 2014
В монографии проводится изучение краудсорсинга - новой формы взаимодействия организации, персонала и общества, получившей распространение в условиях формирования постиндустриального общества.
Рассмотренные в работе подходы к организации и мотивации участников краудсорсинговых проектов могут стать основой для внедрения краудсорсинга в практику отечественных компаний. ...
Added: November 6, 2014
Kharlamov A. A., , in : Neuroinformatics and Semantic Representations: Theory and Applications. : Cambridge Scholars Publishing, 2020. P. 156-167.
На основе представлений об обработке информации в мозге человека [1] реализована технология автоматической смысловой обработки текстов TextAnalyst, позволяющая выявить ключевые понятия текста в их взаимосвязях, реализовать реферирование текстов и их смысловое сравнение (классификацию). Реализованы продукты, использующие функциональность этой технологии: персональный – TextAnalyst, и библиотека COM модулей – TextAnalyst SDK. ...
Added: December 7, 2021
Dolzhenko R. A., Экономический анализ: теория и практика 2014 № 36 С. 30-38
The article considers the possibility of evaluating the effectiveness of using the crowdsourcing in the organization. The nature of crowdsourcing and ingredients that make up effect of using this technology are considered. The performance and effectiveness of crowdsourcing activities are compared. The indicators for evaluating the effectiveness of crowdsourcing in the organization are presented. Author ...
Added: November 6, 2014