Using TEI, CMDI and ISOcat in CLARIN-DK
Research output: Chapter in Book/Report/Conference proceeding › Article in proceedings › Research › peer-review
Standard
Using TEI, CMDI and ISOcat in CLARIN-DK. / Hansen, Dorte Haltrup; Offersgaard, Lene; Olsen, Sussi.
Proceedings of the Ninth International Conference on Language Resources and Evaluation: LREC 2014. Reykjavik, Iceland : European Language Resources Association, 2014. p. 613 - 618.Research output: Chapter in Book/Report/Conference proceeding › Article in proceedings › Research › peer-review
Harvard
APA
Vancouver
Author
Bibtex
}
RIS
TY - GEN
T1 - Using TEI, CMDI and ISOcat in CLARIN-DK
AU - Hansen, Dorte Haltrup
AU - Offersgaard, Lene
AU - Olsen, Sussi
PY - 2014/5
Y1 - 2014/5
N2 - This paper presents the challenges and issues encountered in the conversion of TEI header metadata into the CMDI format. The work is carried out in the Danish research infrastructure, CLARIN-DK, in order to enable the exchange of language resources nationally as well as internationally, in particular with other partners of CLARIN ERIC. The paper describes the task of converting an existing TEI specification applied to all the text resources deposited in DK-CLARIN. During the task we have tried to reuse and share CMDI profiles and components in the CLARIN Component Registry, as well as linking the CMDI components and elements to the relevant data categories in the ISOcat Data Category Registry. The conversion of the existing metadata into the CMDI format turned out not to be a trivial task and the experience and insights gained from this work have resulted in a proposal for a work flow for future use. We also present a core TEI header metadata set.
AB - This paper presents the challenges and issues encountered in the conversion of TEI header metadata into the CMDI format. The work is carried out in the Danish research infrastructure, CLARIN-DK, in order to enable the exchange of language resources nationally as well as internationally, in particular with other partners of CLARIN ERIC. The paper describes the task of converting an existing TEI specification applied to all the text resources deposited in DK-CLARIN. During the task we have tried to reuse and share CMDI profiles and components in the CLARIN Component Registry, as well as linking the CMDI components and elements to the relevant data categories in the ISOcat Data Category Registry. The conversion of the existing metadata into the CMDI format turned out not to be a trivial task and the experience and insights gained from this work have resulted in a proposal for a work flow for future use. We also present a core TEI header metadata set.
KW - Faculty of Humanities
KW - CMDI
KW - TEI
KW - metadata
M3 - Article in proceedings
SP - 613
EP - 618
BT - Proceedings of the Ninth International Conference on Language Resources and Evaluation
PB - European Language Resources Association
CY - Reykjavik, Iceland
ER -
ID: 131792950