Corpus-Based Text Retrieval and Adaptation for Learning System

N. Karpov

?

Corpus-Based Text Retrieval and Adaptation for Learning System

P. 60–65.

Karpov N.

The algorithm to adapt lexical complexity in the news article which can be used as materials for learning language presented in the paper. We consider words substitution retrieval according to wordnet-based and corpus-based semantic relatedness. Two corpus-based similarity measures empirically tested: Vector Space Model and Distributional Semantic Model. This language processing algorithm has created as a client-server application. It retrieves appropriate text from Web-resource. Next it performs adaptation procedure

Language: English

Full text

Text on another site

Keywords: information retrieval

Publication based on the results of:

Адаптация языкового материала НКРЯ для электронного учебника «Русский язык как иностранный» (2013)

In book

International Conference on Advances in Computing and Information Technology - ACIT 2014

Newark: Institute of Research engineers and Doctors, 2014.

International Conference on Educational Data Mining (EDM) 2011. Proceedings of the 4th International Conference on Educational Data Mining. Eindhoven, 6-8 July, 2011

Eindhoven: Eindhoven University of Technology, 2011.

The 4th International Conference on Educational Data Mining (EDM 2011) brings together researchers from computer science, education, psychology, psychometrics, and statistics to analyze large datasets to answer educational research questions. The conference, held in Eindhoven, The Netherlands, July 6-9, 2011, follows the three previous editions (Pittsburgh 2010, Cordoba 2009 and Montreal 2008), and a series ...

Added: November 8, 2012

Proc. 35th European Conference on Information Retrieval (ECIR 2013): Advances in Information Retrieval

Springer, 2013.

Added: November 18, 2013

Роль общей и специфической лексики при извлечении информации из текста на примере анализа события «Ввод новых технологий»

Klintsov V., Bonch-Osmolovskaya A. A., Kuznetsov I. et al., Вестник Новосибирского государственного университета. Серия: Информационные технологии 2012 Т. 10 № 4 С. 74–80

This paper discusses approaches to the selection of keywords, used for information extraction of event frames. In particular, the innovation event is associated with different lexical items in different areas of knowledge. The paper evaluated the contribution of general and specific vocabulary in the representation of the frame in a particular subject area. ...

Added: September 5, 2013

Machine Learning and Knowledge Discovery in Databases. Applied Data Science Track. European Conference, ECML PKDD 2024, Vilnius, Lithuania, September 9–13, 2024, Proceedings, Part X. LNCS, volume 14950

Cham: Springer, 2024.

This multi-volume set, LNAI 14941 to LNAI 14950, constitutes the refereed proceedings of the European Conference on Machine Learning and Knowledge Discovery in Databases, ECML PKDD 2024, held in Vilnius, Lithuania, in September 2024. ...

Added: November 22, 2024

Lecture Notes in Computer Science

Berlin: Springer, 2013.

This book constitutes the refereed proceedings of the 20th International Symposium on String Processing and Information Retrieval, SPIRE 2013, held in Jerusalem, Israel, in October 2013. The 18 full papers, 10 short papers were carefully reviewed and selected from 60 submissions. The program also featured 4 keynote speeches. The following topics are covered: fundamentals algorithms ...

Added: October 30, 2013

From Triconcepts to Triclusters

Ignatov D. I., Kuznetsov S., , in: Conceptual Structures: Leveraging Semantic Technologies. 17th International Conference on Conceptual Structures, ICCS 2009, Moscow, Russia, July 26-31, 2009, ProceedingsVol. 5662. Berlin, Heidelberg: Springer, 2009. P. 185–200.

A vast amount of documents in the Web have duplicates, which is a challenge for developing efficient methods that would compute clusters of similar documents. In this paper we use an approach based on computing (closed) sets of attributes having large support (large extent) as clusters of similar documents. The method is tested in a ...

Added: December 9, 2012

Bimodal Cross-Validation Approach for Recommender Systems Diagnostics

Ignatov D. I., Poelmans J., , in: Diagnostic Test Approaches to Machine Learning and Commonsense Reasoning Systems. Hershey: IGI Global, 2012. Ch. 8 P. 185–195.

Recommender systems are becoming an inseparable part of many modern Internet web sites and web shops. The quality of recommendations made may significantly influence the browsing experience of the user and revenues made by web site owners. Developers can choose between a variety of recommender algorithms; unfortunately no general scheme exists for evaluation of their ...

Added: December 3, 2012

DC ECIR 2012 -- Doctoral Consortium Doctoral Consortium is associated with the 35th European Conference on Infor- mation Retrieval (ECIR 2013) March 24, 2013, Moscow, Russia

M.: HSE, 2013.

Doctoral students were invited to the Doctoral Consortium held in conjunction with the main conference of ECIR 2013. The Doctoral Consortium aimed to provide a constructive setting for presentations and discussions of doctoral students’ research projects with senior researchers and other participating students. The two main goals of the Doctoral Consortium were: 1) to advise ...

Added: October 27, 2013

Text Mining Scientific Papers: A Survey on FCA-Based Information Retrieval Research

Poelmans J., Ignatov D. I., Viaene S. undefined. et al., , in: Advances in Data Mining. Applications and Theoretical Aspects. 12th Industrial Conference, ICDM 2012, Berlin, Germany, July 13-20, 2012. ProceedingsVol. 7377. Berlin, Heidelberg: Springer, 2012. P. 273–287.

Formal Concept Analysis (FCA) is an unsupervised clustering technique and many scientific papers are devoted to applying FCA in Information Retrieval (IR) research. We collected 103 papers published between 2003-2009 which mention FCA and information retrieval in the abstract, title or keywords. Using a prototype of our FCA-based toolset CORDIET, we converted the pdf-files containing ...

Added: December 3, 2012

Proceedings of the International Workshop "What can FCA do for Artificial Intelligence?" (FCA4AI at ECAI 2016)

M.: [б.и.], 2016.

The four preceding editions of the FCA4AI Workshop showed that many researchers working in Artificial Intelligence are deeply interested by a well-founded method for classi- fication and mining such as Formal Concept Analysis (see http://www.fca4ai.hse.ru/). The first edition of FCA4AI was co-located with ECAI 2012 in Montpellier, the second one with IJCAI 2013 in Beijing, ...

Added: October 6, 2016

Advances in Information Retrieval

Kuznetsov S., Serdyukov P., Segalovich I. et al., L.: Springer, 2013.

Higher School of Economics (HSE) and supported by the Information Retrieval Specialist Group at the British Computer Society (BCS–IRSG). The conference was held during March 24–27, 2013, in Moscow, Russia – the easternmost location in the history of the ECIR series. ECIR 2013 received a total of 287 submissions in three categories: 191 full papers, ...

Added: April 15, 2013

CEUR Workshop Proceedings (Proceedings of the International Conference "Internet and Modern Society" IMS-2020, 17-20 June 2020, ITMO University, St. Petersburg, Russia)

CEUR Workshop Proceedings, 2020.

The International Conference “Internet and Modern Society” (IMS-2020) was initially planned to take place in St. Petersburg, Russia. Due to the spread of COVID-19 and the ban on public events, the conference was held during 17-20 June 2020 in the format of online sessions with a discussion of papers and presentations uploaded in advance. The ...

Added: November 1, 2020

Proceedings of COLING 2012: Posters

Mumbai: The COLING 2012 Organizing Committee, 2012.

Added: April 17, 2013

Proceedings of the Workshop Formal Concept Analysis Meets Information Retrieval

M.: CEUR Workshop Proceedings, 2013.

Added: November 18, 2013

Proceedings of the International Workshop "What can FCA do for Artificial Intelligence?" (FCA4AI at ECAI 2014)

Prague: CEUR Workshop Proceedings, 2014.

The first and the second edition of the FCA4AI Workshop showed that many researchers working in Artificial Intelligence are indeed interested by a well-founded method for classi- fication and mining such as Formal Concept Analysis (see http://www.fca4ai.hse.ru/). The first edition of FCA4AI was co-located with ECAI 2012 in Montpellier and published as http://ceur-ws.org/Vol-939/ while the ...

Added: September 12, 2014

International Conference on Advances in Computing and Information Technology - ACIT 2014

Newark: Institute of Research engineers and Doctors, 2014.

Proceedings of International Conference on Advances in Computing and Information Technology 04-05 January, 2014 Bangkok, Thailand. ...

Added: December 9, 2013

Content Based Video Retrieval System for Distorted Video Queries

Boris Tseytlin, Makarov I., , in: Proceedings of the Conference on Modeling and Analysis of Complex Systems and Processes 2020 (MACSPro 2020)Vol. 2795. CEUR Workshop Proceedings, 2020. P. 99–107.

We consider the task of content-based video retrieval (CBVR) given a query video, which is expected to match if it is a distorted short subsequence of a reference video from a database. In this paper, we present a CBVR system architecture that is both robust and scalable. We use a modified rHash frame fingerprint generation ...

Added: January 4, 2021

Formal Concept Analysis Meets Information Retrieval 2013

Aachen: CEUR Workshop Proceedings, 2013.

Formal Concept Analysis (FCA) is a mathematically well-founded theory aimed at data analysis and classication, introduced and detailed in the book of Bernhard Ganter and Rudolf Wille, \Formal Concept Analysis", Springer 1999. The area came into being in the early 1980s and has since then spawned over 10000 scientic publications and a variety of practically ...

Added: October 10, 2013

Learning Alternative Name Spellings

Zhukov L. E., Sukharev J., Popescul A., Information Retrieval 2014

Name matching is a key component of systems for entity resolution or record linkage. Alternative spellings of the same names are a com- mon occurrence in many applications. We use the largest collection of genealogy person records in the world together with user search query logs to build name matching models. The procedure for building ...

Added: May 12, 2014