ИСТИНА |
Войти в систему Регистрация |
|
ИСТИНА ЦЭМИ РАН |
||
We provide a survey over the main strategies to harmonize and to integrate TEI/XML documents and Linked Open Data resources. As a highly popular community standard, the Text Encoding Initiative (TEI) provides a the most frequently adopted model for the semantic markup of text data in the Digital Humanities. Likewise, applications of Linked Open Data technologies and resources in Digital Humanities are manifold, and where commonly used LOD and RDF technology is employed, the scientific challenges involved are comparable to those in other areas of application. A scientific problem specific to Digital Humanities is, however, how these technologies can be related to the TEI as the current de facto standard for computational philology. While benefits of LOD technologies have long been recognized in the DH community, and lead to the formation of a LOD SIG in 2014, there is no agreement on possible technological bridges between TEI/XML and LOD technology. With this paper, we provide an overview over existing solutions and their characteristics, and contribute to the discussion of the further standardization — and possibly, revision — of these possibilities. We focus on in-line XML, because Web Annotation (Sanderson et al., 2017) already provides a convenient and established W3C standard for establishing LOD as a standoff layer over XML documents. В докладе мы приводим сравнение основных стратегий интеграции двух широко известных форматов представления лингвистических данных: TEI/XML и Linked Data. TEI — это наиболее широко используемая модель данных в области Digital Humanities, однако достаточно большое количество ресурсов существует в виде Linked Data, что приводит к необходимости интеграции этих двух миров. В докладе мы приводим обзор существующих стратегий, их достоинства и недостатки и предлагаем возможные пути к дальнейшему их развитию