TY - GEN
T1 - Wikidata completeness profiling using ProWD
AU - Wisesa, Avicenna
AU - Darari, Fariz
AU - Krisnadhi, Adila
AU - Nutt, Werner
AU - Razniewski, Simon
PY - 2019/9/23
Y1 - 2019/9/23
N2 - Completeness is a crucial data quality aspect that deals with the question: do we have all the data we need? The lack of awareness on the completeness state of a knowledge graph (KG) may result in bias or even falsity for any decisions made based on the KG. Given a KG, one may be wondering how its completeness may vary across different topics. In this paper, we present ProWD, a framework and tool for profiling the completeness of Wikidata, a central KG on the (Semantic) Web that is open and free to use. ProWD measures the degree of completeness based on the Class-Facet-Attribute (CFA) profiles. A class denotes a collection of entities, which can be of multiple facets, allowing attribute completeness to be analyzed and compared, e.g., how does the completeness of the attribute "educated at" and "date of birth" compare between male, German computer scientists, and female, Indonesian computer scientists? ProWD generates summaries and visualizations for such analysis, giving insights into the KG completeness. ProWD is available online at∼\urlhttp://prowd.id.
AB - Completeness is a crucial data quality aspect that deals with the question: do we have all the data we need? The lack of awareness on the completeness state of a knowledge graph (KG) may result in bias or even falsity for any decisions made based on the KG. Given a KG, one may be wondering how its completeness may vary across different topics. In this paper, we present ProWD, a framework and tool for profiling the completeness of Wikidata, a central KG on the (Semantic) Web that is open and free to use. ProWD measures the degree of completeness based on the Class-Facet-Attribute (CFA) profiles. A class denotes a collection of entities, which can be of multiple facets, allowing attribute completeness to be analyzed and compared, e.g., how does the completeness of the attribute "educated at" and "date of birth" compare between male, German computer scientists, and female, Indonesian computer scientists? ProWD generates summaries and visualizations for such analysis, giving insights into the KG completeness. ProWD is available online at∼\urlhttp://prowd.id.
KW - Data completeness
KW - Data profiling
KW - Rdf
KW - Sparql
KW - Wikidata
UR - http://www.scopus.com/inward/record.url?scp=85077265181&partnerID=8YFLogxK
U2 - 10.1145/3360901.3364425
DO - 10.1145/3360901.3364425
M3 - Conference contribution
T3 - K-CAP 2019 - Proceedings of the 10th International Conference on Knowledge Capture
SP - 123
EP - 130
BT - K-CAP 2019 - Proceedings of the 10th International Conference on Knowledge Capture
PB - Association for Computing Machinery, Inc
T2 - 10th International Conference on Knowledge Capture, K-CAP 2019
Y2 - 19 November 2019 through 21 November 2019
ER -