Найдено научных статей и публикаций: 3, для научной тематики: Thesaurus
1.
A. A. Krizhanovsky, A. V. Smirnov
- Journal of Computer and Systems Sciences International , 2013
An approach to the design of a system of automated construction of a general-purpose lexical ontology is proposed, and the architecture of such a system is described. Wiktionary is chosen as the online dictionary because it has a large database of words with translations into many languages. The str...
An approach to the design of a system of automated construction of a general-purpose lexical ontology is proposed, and the architecture of such a system is described. Wiktionary is chosen as the online dictionary because it has a large database of words with translations into many languages. The structure of the dictionary entry is considered using the English Wiktionary as an example. This structure is used to design a database for storing the retrieved information. Ontologies are an important part of knowledge management systems. Ontologies require the development of approaches and algorithms for their construction. Lexical ontologies are constructed, and the main features of two ontology databases based on the Russian and English Wiktionaries are compared. The dynamics of numerical parameters of Wiktionaries and general-purpose lexical ontologies for 2010–2012 constructed by the authors is analyzed.
A. A. Krizhanovsky, A. V. Smirnov. An approach to automated construction of a general-purpose lexical ontology based on Wiktionary. J. of Computer and Systems Sciences International, 2013, Vol. 52, No. 2, pp. 215–225.
2.
Крижановский Андрей
- (презентация в pdf) , 2010
Для хранения лексикографической информации Русского Викисловаря разработаны (1) правила (на основе регулярных выражений) извлечения текстовых данных, (2) структура базы данных для хранения данных, (3) программный интерфейс к этой базе данных. Созданный машинно-читаемый словарь был использован в эксп...
Для хранения лексикографической информации Русского Викисловаря разработаны (1) правила (на основе регулярных выражений) извлечения текстовых данных, (2) структура базы данных для хранения данных, (3) программный интерфейс к этой базе данных. Созданный машинно-читаемый словарь был использован в эксперименте для сравнения алгоритмов, вычисляющих семантическое расстояние на основе данных Русского Викисловаря и WordNet. Алгоритмы и метрики оценивались с помощью тестовой коллекции (из 353 пар английских слов), включающей оценку экспертов. Эксперимент показал, что предложенный метод позволяет вычислить семантическое расстояние между парой слов, в принципе, на любом из языков, представленных в Русском Викисловаре.
3.
A. A. Krizhanovsky, F. Lin
, 2009
A set of ontology matching algorithms (for finding correspondences between concepts) is based on a thesaurus that provides the source data for the semantic distance calculations. In this wiki era, new resources may spring up and improve this kind of semantic search. In the paper a solution of this t...
A set of ontology matching algorithms (for finding correspondences between concepts) is based on a thesaurus that provides the source data for the semantic distance calculations. In this wiki era, new resources may spring up and improve this kind of semantic search. In the paper a solution of this task based on Russian Wiktionary is compared to WordNet based algorithms. Metrics are estimated using the test collection, containing 353 English word pairs with a relatedness score assigned by human evaluators. The experiment shows that the proposed method is capable in principle of calculating a semantic distance between pair of words in any language presented in Russian Wiktionary. The calculation of Wiktionary based metric had required the development of the open-source Wiktionary parser software.
A. Krizhanovsky, F. Lin, Related terms search based on WordNet / Wiktionary and its application in Ontology Matching. In: RCDL 2009. September 17-21, Petrozavodsk, Russia. – pp. 363-369.