From Language Documentation Data to LLOD: A Case Study in Turkic Lemon Dictionaries

Авторы: Chiarcos Christian, Walther Désirée, Ionov Maxim
Сборник: Proceedings of the LDK 2017 Workshops: 1st Workshop on the OntoLex Model (OntoLex-2017), Shared Task on Translation Inference Across Dictionaries & Challenges for Wordnets
Серия: CEUR Workshop Proceedings (CEUR-WS.org)
Том: 1899
Год издания: 2017
Место издания: CEUR-WS.org
Первая страница: 22
Последняя страница: 32
Аннотация: In this paper, we describe the Lemon-OntoLex modeling of dictionaries created within language documentation efforts. We focus on exemplary resources for two less-resourced languages from the Turkic language family, Chalkan and Tuvan. Both datasets have been converted into a Linked Data representation using the Lemon-OntoLex data model, with an extensible converter written in Python. We compare the conversion process for two both lexical resources, we analyze the difficulties we encountered during the conversion process and discuss the cases which caused the most common problems during the conversion. Furthermore, we evaluate the quality of converted dictionaries using specially designed SPARQL queries, and by manually checking random samples of the data. Finally, we describe the future application of this data within a lexicographic-comparative workbench, designed to facilitate language contact studies.
Добавил в систему: Ионов Максим Игоревич

	ИСТИНА	Войти в систему Регистрация
	ИСТИНА ЦЭМИ РАН
	Главная Поиск Статистика О проекте Помощь

ИСТИНА