TY - GEN
T1 - OntoBiodiv
T2 - 5th International Conference on Networking, Information Systems and Security, NISS 2022
AU - Saleh, Dadan Ridwan
AU - Kartika, Yulia Aris
AU - Akbar, Zaenal
AU - Krisnadhi, Adila Alfa
AU - Manik, Lindung Parningotan
N1 - Funding Information:
This work was supported by the Research Center for Informatics, National Research and Innovation Agency, Indonesia, in collaboration with the Faculty of Computer Science, Universitas Indonesia, through the By Research Degree Programme.
Publisher Copyright:
© 2022 IEEE.
PY - 2022
Y1 - 2022
N2 - Biodiversity data can be produced from preserved specimens where multiple pieces of information (e.g., taxonomic identification) will be extracted from biological samples or materials. Another approach, observation-based, collects data digitally without actual biological samples or materials. The latter approach has produced much more data compared to the first one. However, with recent technological developments, the tangible samples or materials preserved by the first approach have become gold mines because they opened more opportunities for scientific discovery. For example, a new method for genomic investigation can be performed on specimens collected a decade ago. However, this new investigation will only be possible with preserved specimens. Therefore, it is necessary to shift the focus of biodiversity data collection to the specimens-oriented. Unfortunately, most of the current biodiversity data standards cover specimens minimally. This work proposes a schema to extend an existing biodiversity data standard (i.e., Darwin Core) where specimens are the core. The extension covers a variety of data properties of specimens, including the generalization of multiple kinds of information that can be obtained by extracting from specimens. Comparing the coverage ratio and matching scores with the existing one reveals the superiority of the proposed schema. The evaluation results show that the proposed schema covers up to 80% higher and has the utmost exact match scores for specimen-based biodiversity data. This work initiates our effort to reconnect biodiversity data to specimens.
AB - Biodiversity data can be produced from preserved specimens where multiple pieces of information (e.g., taxonomic identification) will be extracted from biological samples or materials. Another approach, observation-based, collects data digitally without actual biological samples or materials. The latter approach has produced much more data compared to the first one. However, with recent technological developments, the tangible samples or materials preserved by the first approach have become gold mines because they opened more opportunities for scientific discovery. For example, a new method for genomic investigation can be performed on specimens collected a decade ago. However, this new investigation will only be possible with preserved specimens. Therefore, it is necessary to shift the focus of biodiversity data collection to the specimens-oriented. Unfortunately, most of the current biodiversity data standards cover specimens minimally. This work proposes a schema to extend an existing biodiversity data standard (i.e., Darwin Core) where specimens are the core. The extension covers a variety of data properties of specimens, including the generalization of multiple kinds of information that can be obtained by extracting from specimens. Comparing the coverage ratio and matching scores with the existing one reveals the superiority of the proposed schema. The evaluation results show that the proposed schema covers up to 80% higher and has the utmost exact match scores for specimen-based biodiversity data. This work initiates our effort to reconnect biodiversity data to specimens.
KW - biodiversity data
KW - data standard
KW - ontology
KW - specimens
UR - http://www.scopus.com/inward/record.url?scp=85153408377&partnerID=8YFLogxK
U2 - 10.1109/NISS55057.2022.10085505
DO - 10.1109/NISS55057.2022.10085505
M3 - Conference contribution
AN - SCOPUS:85153408377
T3 - Proceedings of the 5th International Conference on Networking, Information Systems and Security: Envisage Intelligent Systems in 5G/6G-Based Interconnected Digital Worlds, NISS 2022
BT - Proceedings of the 5th International Conference on Networking, Information Systems and Security
A2 - Kwintiana, Bernadetta
A2 - Pardede, Hilman F.
A2 - Supianto, Ahmad Afif
PB - Institute of Electrical and Electronics Engineers Inc.
Y2 - 30 March 2022 through 31 March 2022
ER -