TY - GEN
T1 - Toward advice mining
T2 - 22nd ACM International Conference on Information and Knowledge Management, CIKM 2013
AU - Wicaksono, Alfan Farizki
AU - Myaeng, Sung Hyon
PY - 2013
Y1 - 2013
N2 - Web forums are platforms for personal communications on sharing information with others. Such information is often expressed in the form of advice. In this paper, we address the problem of advice-revealing text unit (ATU) extraction from online forums due to its usefulness in travel domain. We represent advice as a two-tuple comprising an advice-revealing sentence and its context sentences. To extract the advice-revealing sentences, we propose to define the task as a sequence labeling problem, using three different types of features: syntactic, contextual, and semantic features. To extract the context sentences, we propose to use a 2 Dimensional CRF (2D-CRF) model, which gives the best performance compared to traditional machine learning models. Finally, we present a solution to the integrated problem of extracting both advice-revealing sentences and their respective context sentences at the same time using our proposed models, i.e., Multiple Linear CRF (ML-CRF) and 2 Dimensional CRF Plus (2D-CRF+). The experimental results show that ML-CRF performs better than any other models studied in this paper for extracting advice-revealing sentences and context sentences.
AB - Web forums are platforms for personal communications on sharing information with others. Such information is often expressed in the form of advice. In this paper, we address the problem of advice-revealing text unit (ATU) extraction from online forums due to its usefulness in travel domain. We represent advice as a two-tuple comprising an advice-revealing sentence and its context sentences. To extract the advice-revealing sentences, we propose to define the task as a sequence labeling problem, using three different types of features: syntactic, contextual, and semantic features. To extract the context sentences, we propose to use a 2 Dimensional CRF (2D-CRF) model, which gives the best performance compared to traditional machine learning models. Finally, we present a solution to the integrated problem of extracting both advice-revealing sentences and their respective context sentences at the same time using our proposed models, i.e., Multiple Linear CRF (ML-CRF) and 2 Dimensional CRF Plus (2D-CRF+). The experimental results show that ML-CRF performs better than any other models studied in this paper for extracting advice-revealing sentences and context sentences.
KW - Advice mining
KW - Conditional random field
KW - Sequence labeling
UR - http://www.scopus.com/inward/record.url?scp=84889577686&partnerID=8YFLogxK
U2 - 10.1145/2505515.2505520
DO - 10.1145/2505515.2505520
M3 - Conference contribution
AN - SCOPUS:84889577686
SN - 9781450322638
T3 - International Conference on Information and Knowledge Management, Proceedings
SP - 2039
EP - 2048
BT - CIKM 2013 - Proceedings of the 22nd ACM International Conference on Information and Knowledge Management
Y2 - 27 October 2013 through 1 November 2013
ER -