Premium
Cloud architecture for plant phenotyping research
Author(s) -
Debauche Olivier,
Mahmoudi Sidi Ahmed,
De Cock Nicolas,
Mahmoudi Saïd,
Manneback Pierre,
Lebeau Frédéric
Publication year - 2020
Publication title -
concurrency and computation: practice and experience
Language(s) - English
Resource type - Journals
SCImago Journal Rank - 0.309
H-Index - 67
eISSN - 1532-0634
pISSN - 1532-0626
DOI - 10.1002/cpe.5661
Subject(s) - bottleneck , cloud computing , architecture , computer science , robustness (evolution) , distributed computing , big data , database , computer architecture , data science , data mining , operating system , embedded system , biology , art , biochemistry , visual arts , gene
Summary Digital phenotyping is an emergent science mainly based on imagery techniques. The tremendous amount of data generated needs important cloud computing for their processing. The coupling of recent advance of distributed databases and cloud computing offers new possibilities of big data management and data sharing for the scientific research. In this paper, we present a solution combining a lambda architecture built around Apache Druid and a hosting platform leaning on Apache Mesos. Lambda architecture has already proved its performance and robustness. However, the capacity of ingesting and requesting of the database is essential and can constitute a bottleneck for the architecture, in particular, for in terms of availability and response time of data. We focused our experimentation on the response time of different databases to choose the most adapted for our phenotyping architecture. Apache Druid has shown its ability to respond to typical queries of phenotyping applications in times generally inferior to the second.