Automated classification of schools of the silver cyprinid Rastrineobola argentea in Lake Victoria acoustic survey data using random forests
Author(s) -
Roland Proud,
Richard Mangeni-Sande,
Robert Kayanda,
Martin J. Cox,
Chrisphine S. Nyamweya,
Collins Ongore,
Vianny Natugonza,
Inigo Everson,
Mboni Elison,
Laura Hobbs,
Benedicto Boniphace Kashindye,
Enock Mlaponi,
Anthony TaabuMunyaho,
Venny Mziri Mwainge,
Esther Kagoya,
Antonio Pegado,
Evarist Nduwayesu,
Andrew S. Brierley
Publication year - 2020
Publication title -
ices journal of marine science
Language(s) - English
Resource type - Journals
eISSN - 1095-9289
pISSN - 1054-3139
DOI - 10.1093/icesjms/fsaa052
Subject(s) - random forest , fishing , classifier (uml) , fishery , biomass (ecology) , environmental science , geography , hydrology (agriculture) , ecology , computer science , machine learning , geology , artificial intelligence , biology , geotechnical engineering
Biomass of the schooling fish Rastrineobola argentea (dagaa) is presently estimated in Lake Victoria by acoustic survey following the simple “rule” that dagaa is the source of most echo energy returned from the top third of the water column. Dagaa have, however, been caught in the bottom two-thirds, and other species occur towards the surface: a more robust discrimination technique is required. We explored the utility of a school-based random forest (RF) classifier applied to 120 kHz data from a lake-wide survey. Dagaa schools were first identified manually using expert opinion informed by fishing. These schools contained a lake-wide biomass of 0.68 million tonnes (MT). Only 43.4% of identified dagaa schools occurred in the top third of the water column, and 37.3% of all schools in the bottom two-thirds were classified as dagaa. School metrics (e.g. length, echo energy) for 49 081 manually classified dagaa and non-dagaa schools were used to build an RF school classifier. The best RF model had a classification test accuracy of 85.4%, driven largely by school length, and yielded a biomass of 0.71 MT, only c. 4% different from the manual estimate. The RF classifier offers an efficient method to generate a consistent dagaa biomass time series.
Accelerating Research
Robert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom
Address
John Eccles HouseRobert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom