Premium
NoSQL Data Model for Semi‐automatic Integration of Ethnomedicinal Plant Data from Multiple Sources
Author(s) -
Ningthoujam Sanjoy Singh,
Choudhury Manabendra Dutta,
Potsangbam Kumar Singh,
Chetia Pankaj,
Nahar Lutfun,
Sarker Satyajit D.,
Basar Norazah,
Talukdar Anupam Das
Publication year - 2014
Publication title -
phytochemical analysis
Language(s) - English
Resource type - Journals
SCImago Journal Rank - 0.574
H-Index - 72
eISSN - 1099-1565
pISSN - 0958-0344
DOI - 10.1002/pca.2520
Subject(s) - nosql , computer science , schema (genetic algorithms) , data sharing , database , information retrieval , scalability , data model (gis) , data science , data mining , medicine , alternative medicine , pathology , artificial intelligence
ABSTRACT Introduction Sharing traditional knowledge with the scientific community could refine scientific approaches to phytochemical investigation and conservation of ethnomedicinal plants. As such, integration of traditional knowledge with scientific data using a single platform for sharing is greatly needed. However, ethnomedicinal data are available in heterogeneous formats, which depend on cultural aspects, survey methodology and focus of the study. Phytochemical and bioassay data are also available from many open sources in various standards and customised formats. Objective To design a flexible data model that could integrate both primary and curated ethnomedicinal plant data from multiple sources. Materials and methods The current model is based on MongoDB, one of the Not only Structured Query Language (NoSQL) databases. Although it does not contain schema, modifications were made so that the model could incorporate both standard and customised ethnomedicinal plant data format from different sources. Results The model presented can integrate both primary and secondary data related to ethnomedicinal plants. Accommodation of disparate data was accomplished by a feature of this database that supported a different set of fields for each document. It also allowed storage of similar data having different properties. Conclusion The model presented is scalable to a highly complex level with continuing maturation of the database, and is applicable for storing, retrieving and sharing ethnomedicinal plant data. It can also serve as a flexible alternative to a relational and normalised database. Copyright © 2014 John Wiley & Sons, Ltd.