TY - GEN
T1 - Towards automatic wayang ontology construction using relation extraction from free text
AU - Sanabila, Hadaiq Rolis
AU - Manurung, Ruli
N1 - Publisher Copyright:
© 2014 Association for Computational Linguistics.
PY - 2014
Y1 - 2014
N2 - This paper reports on our work to automatically construct and populate an ontology of wayang (Indonesian shadow puppet) mythology from free text using relation extraction and relation clustering. A reference ontology is used to evaluate the generated ontology. The reference ontology contains concepts and properties within the wayang character domain. We examined the influence of corpus data variations, threshold value variations in the relation clustering process, and the usage of entity pairs or entity pair types during the feature extraction stages. The constructed ontology is examined using three evaluation methods, i.e. cluster purity (CP), instance knowledge (IK), and relation concept (RC). Based on the evaluation results, the proposed method generates the best ontology when using a consolidated corpus, the threshold value in relation clustering is 1, and entity pairs are used during feature extraction.
AB - This paper reports on our work to automatically construct and populate an ontology of wayang (Indonesian shadow puppet) mythology from free text using relation extraction and relation clustering. A reference ontology is used to evaluate the generated ontology. The reference ontology contains concepts and properties within the wayang character domain. We examined the influence of corpus data variations, threshold value variations in the relation clustering process, and the usage of entity pairs or entity pair types during the feature extraction stages. The constructed ontology is examined using three evaluation methods, i.e. cluster purity (CP), instance knowledge (IK), and relation concept (RC). Based on the evaluation results, the proposed method generates the best ontology when using a consolidated corpus, the threshold value in relation clustering is 1, and entity pairs are used during feature extraction.
UR - http://www.scopus.com/inward/record.url?scp=85011255640&partnerID=8YFLogxK
M3 - Conference contribution
AN - SCOPUS:85011255640
T3 - Proceedings of the 8th Workshop on Language Technology for Cultural Heritage, Social Sciences, and Humanities, LaTeCH 2014 at the 14th Conference of the European Chapter of the Association for Computational Linguistics, EACL 2014
SP - 128
EP - 136
BT - Proceedings of the 8th Workshop on Language Technology for Cultural Heritage, Social Sciences, and Humanities, LaTeCH 2014 at the 14th Conference of the European Chapter of the Association for Computational Linguistics, EACL 2014
A2 - Zervanou, Kalliopi
A2 - Vertan, Cristina
A2 - van den Bosch, Antal
A2 - Sporleder, Caroline
PB - Association for Computational Linguistics (ACL)
T2 - 8th Workshop on Language Technology for Cultural Heritage, Social Sciences, and Humanities, LaTeCH 2014 at the 14th Conference of the European Chapter of the Association for Computational Linguistics, EACL 2014
Y2 - 26 April 2014
ER -