TY - GEN
T1 - Named Entity Recognition for the Indonesian language
T2 - 8th International Conference on Discovery Science, DS 2005
AU - Budi, Indra
AU - Bressan, Stéphane
AU - Wahyudi, Gatot
AU - Hasibuan, Zainal Arifin
AU - Nazief, Bobby Achirul Awal
PY - 2005
Y1 - 2005
N2 - We present a novel named entity recognition approach for the Indonesian language. We call the new method InNER for Indonesian Named Entity Recognition. InNER is based on a set of rules capturing the contextual, morphological, and part of speech knowledge necessary in the process of recognizing named entities in Indonesian texts. The InNER strategy is one of knowledge engineering: the domain and language specific rules are designed by expert knowledge engineers. After showing in our previous work that mined association rules can effectively recognize named entities and outperform maximum entropy methods, we needed to evaluate the potential for improvement to the rule based approach when expert crafted knowledge is used. The results are conclusive: the InNER method yields recall and precision of up to 63.43% and 71.84%, respectively. Thus, it significantly outperforms not only maximum entropy methods but also the association rule based method we had previously designed.
AB - We present a novel named entity recognition approach for the Indonesian language. We call the new method InNER for Indonesian Named Entity Recognition. InNER is based on a set of rules capturing the contextual, morphological, and part of speech knowledge necessary in the process of recognizing named entities in Indonesian texts. The InNER strategy is one of knowledge engineering: the domain and language specific rules are designed by expert knowledge engineers. After showing in our previous work that mined association rules can effectively recognize named entities and outperform maximum entropy methods, we needed to evaluate the potential for improvement to the rule based approach when expert crafted knowledge is used. The results are conclusive: the InNER method yields recall and precision of up to 63.43% and 71.84%, respectively. Thus, it significantly outperforms not only maximum entropy methods but also the association rule based method we had previously designed.
UR - http://www.scopus.com/inward/record.url?scp=33745292768&partnerID=8YFLogxK
U2 - 10.1007/11563983_7
DO - 10.1007/11563983_7
M3 - Conference contribution
AN - SCOPUS:33745292768
SN - 3540292306
SN - 9783540292302
T3 - Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
SP - 57
EP - 69
BT - Discovery Science - 8th International Conference, DS 2005, Proceedings
Y2 - 8 October 2005 through 11 October 2005
ER -