К построению инвентаря русских именных конструкций
The paper presents experimental results on automatic construction identification performed on the Russian National Corpus (RNC). For this purpose we developed a toolbox which allows to extract and process co-occurrence data from RNC samples. Russian nouns are chosen as target words. Lists of constructions were built for each target word. By constructions we mean frequent word combinations which include a target word and frequent lexical-semantic tags – context marker of certain meanings of a target word, as well as frequent lemmas representing the given lexical-semantic tags. E.g.: ВИД (kind, sort, type) + r:abstr t:sport: спорт (sport), футбол (football), биатлон (biathlon), etc. Extracted constructions are grouped according to their structure and lexical-semantic content. In conclusion we perform verification of experimental results which implies comparison of lists of constructions with lists of collocations, idioms, etc. registered in various linguistic resources (bigram search engines, dictionaries).