TY - GEN
T1 - Semantic keyword selection for automatic video annotation
AU - Imran, Ali Shariq
AU - Rahadianti, Laksmita
AU - Cheikh, Faouzi Alaya
AU - Yayilgan, Sule Yildirim
PY - 2013
Y1 - 2013
N2 - Choosing descriptive keywords to best describe digital media content is crucial for many applications, especially those involving content-based indexing or retrieval. Traditionally such keywords are selected manually, which is labor intensive, restrictive to a limited set of words and inherently subjective to the annotator. Therefore, in this paper, we propose an automatic and objective keyword selection method for annotating video. We specifically used lecture videos and surrogate documents, e.g. transcripts, to extract potential candidate keywords. These potential keywords are then filtered based on a set of seed words to select fewer but more descriptive keywords. The seed words are extracted from the title of the video and subject category. We propose a new objective method to select top ranking keywords based on visual similarity and word sense disambiguation. To validate this approach, the selected keywords are compared to subjectively selected keywords obtained experimentally. Furthermore, the proposed ranking method is also compared to traditional term frequency inverse document frequency (TF-IDF) and state of the art latent dirichlet allocation (LDA) method. The obtained results show that the words selected by the proposed objective method correlate highly with those selected by viewers. In general, the proposed method performs better than TF-IDF and LDA.
AB - Choosing descriptive keywords to best describe digital media content is crucial for many applications, especially those involving content-based indexing or retrieval. Traditionally such keywords are selected manually, which is labor intensive, restrictive to a limited set of words and inherently subjective to the annotator. Therefore, in this paper, we propose an automatic and objective keyword selection method for annotating video. We specifically used lecture videos and surrogate documents, e.g. transcripts, to extract potential candidate keywords. These potential keywords are then filtered based on a set of seed words to select fewer but more descriptive keywords. The seed words are extracted from the title of the video and subject category. We propose a new objective method to select top ranking keywords based on visual similarity and word sense disambiguation. To validate this approach, the selected keywords are compared to subjectively selected keywords obtained experimentally. Furthermore, the proposed ranking method is also compared to traditional term frequency inverse document frequency (TF-IDF) and state of the art latent dirichlet allocation (LDA) method. The obtained results show that the words selected by the proposed objective method correlate highly with those selected by viewers. In general, the proposed method performs better than TF-IDF and LDA.
KW - Automatic
KW - Descriptive
KW - LVD-F
KW - Objective
KW - Semantic keyword selection
KW - TFIDF
KW - Video annotation
UR - http://www.scopus.com/inward/record.url?scp=84894213742&partnerID=8YFLogxK
U2 - 10.1109/SITIS.2013.49
DO - 10.1109/SITIS.2013.49
M3 - Conference contribution
AN - SCOPUS:84894213742
SN - 9781479932115
T3 - Proceedings - 2013 International Conference on Signal-Image Technology and Internet-Based Systems, SITIS 2013
SP - 241
EP - 246
BT - Proceedings - 2013 International Conference on Signal-Image Technology and Internet-Based Systems, SITIS 2013
T2 - 2013 9th International Conference on Signal-Image Technology and Internet-Based Systems, SITIS 2013
Y2 - 2 December 2013 through 5 December 2013
ER -