?
Semantic Proximity Establishment in the Tasks of Knowledge Extraction and Named Entities Recognition
The paper deals with the problem of establishing text segments containing the similar semantic units for the tasks of analytical text processing within the semantic technology platform. The methods and instruments presented in the paper provide the discovery of relevant content based on users' focused interests within a certain domain. The hybrid approach comprising linguistic rules and example-based learning techniques is employed. The legal and mass media texts are considered. In this paper a brief description of the NER task history is cited, the Pullenti-based engine is specified, the two-step Semantic Expansion Algorithm is presented, the Distributional Semantics methods for domain terms extraction are discussed as well as some technical challenges and the prospective directions of further research and development.