Methods of training set construction: Towards improving performance for automated mesozooplankton image classification systems

Chang,  Chun-Yi;Ho,  Pei-Chi;Sastri,  Akash R.;Lee,  Yu-Ching;Gong,  Gwo-Ching;Hsieh,  Chih-hao; Chang, C.-Y. and Ho, P.-C. and Sastri, A.R. and Lee, Y.-C. and Gong, G.-C. and Hsieh, C.-H.

標題:	Methods of training set construction: Towards improving performance for automated mesozooplankton image classification systems
作者:	Chang, Chun-Yi CHIH-HAO HSIEH Ho, Pei-Chi Sastri, Akash R. Lee, Yu-Ching Gong, Gwo-Ching Hsieh, Chih-hao
關鍵字:	Balanced training; Category-specific accuracy; Global training set; Hydrographic heterogeneity; Water mass-specific training set; ZooScan
公開日期:	2012
卷:	36
起(迄)頁:	19-28
來源出版物:	Continental Shelf Research
摘要:	The correspondence between variation in the physico-chemical properties of the water column and the taxonomic composition of zooplankton communities represents an important indicator of long-term and broad-scale change in marine systems. Evaluating and relating compositional change to various forms of perturbation demand routine taxonomic identification methods that can be applied rapidly and accurately. Traditional identification by human experts is accurate but very time-consuming. The application of automated image classificatio mitation. The objective of this study is to evaluate how specific aspects of training set construction for the ZooScan system influenced our ability to relate variation in zooplankton taxonomic composition to variation of hydrographic properties in the East China Sea. Specifically, we compared the relative utility of zooplankton classifiers trained with the following: (i) water mass-specific and global training sets; (ii) balanced versus imbalanced training sets. The classification performance (accuracy and precision) of water-mass specific classifiers tended to decline with environmental dissimilarity, suggesting water-mass specificity However, similar classification performance was also achieved by training our system with samples representing all hydrographic sub-regions (i.e. a global classifier). After examining category-specific accuracy, we found that equal performance arises because the accuracy was mainly determined by dominant taxa. This apparently high classification accuracy was at the expense of accurate classification of rare taxa. To explore the basis for such biased classification, we trained our global classifier with an equal amount of training data for each category (balanced training). We found that balanced training had higher accuracy at recognizing rare taxa but low accuracy at abundant taxa. The errors introduced in recognition still pose a major challenge for automatic classification systems. In order to fully automate analyses of zooplankton communities and relate variation in composition to hydrographic properties, the recognition power of the system requires further improvements. © 2012 Elsevier Ltd.
URI:	http://www.scopus.com/inward/record.url?eid=2-s2.0-84862819845&partnerID=MN8TOARS http://scholars.lib.ntu.edu.tw/handle/123456789/370706
DOI:	10.1016/j.csr.2012.01.005
SDG/關鍵字:	accuracy assessment; automation; community composition; hydrography; image classification; marine ecosystem; performance assessment; physicochemical property; taxonomy; training; water column; water mass; zooplankton; East China Sea; Pacific Ocean
顯示於：	海洋研究所

顯示文件完整紀錄

SCOPUS^TM
Citations

checked on 2023/12/27

WEB OF SCIENCE^TM
Citations

checked on 2024/2/13

Page view(s)

105

checked on 2024/4/20

Google Scholar^TM

檢查

Altmetric

TAIR相關文章

SCOPUSTM Citations

WEB OF SCIENCETM Citations

Page view(s)

Google ScholarTM

Altmetric

Altmetric

SCOPUS^TM
Citations

WEB OF SCIENCE^TM
Citations

Google Scholar^TM