TY - JOUR
T1 - MTS-CNN
T2 - Multi-task semantic segmentation-convolutional neural network for detecting crops and weeds
AU - Kim, Yu Hwan
AU - Park, Kang Ryoung
N1 - Publisher Copyright:
© 2022 The Author(s)
PY - 2022/8
Y1 - 2022/8
N2 - Research is being extensively conducted on using deep learning in the field of crop and weed segmentation based on images captured with a camera. However, the segmentation performance for various crops and weeds varies significantly, implying that certain classes of crops or weeds are not being detected properly. This problem may also occur in the loss calculations used in crop and weed segmentation. In previous studies, the cross-entropy loss (corresponding to a distribution loss) and dice loss (using spatial information) have been widely used. However, such losses lead to large discrepancies in crop and weed segmentation performance, as the correlations between crop and weed classes are not considered. In order to solve these problems, this study proposes multi-task semantic segmentation-convolutional neural network for detecting crops and weeds (MTS-CNN) using one-stage training. This approach adds the crop, weed, and both (crop and weed) losses to heighten the correlations between the crop and weed classes, and designs the model so that the object (crop and weed) region is trained intensively. In experiments conducted using three types of open databases - the BoniRob dataset, a crop/weed field image dataset (CWFID), and rice seedling and weed dataset - the mean intersection of union (MIOU) values of the segmentation for the crops and weeds in the MTS-CNN are 0.9164, 0.8372, and 0.8260, respectively. Thus, the results indicate higher accuracy from the proposed approach than from the state-of-the-art methods.
AB - Research is being extensively conducted on using deep learning in the field of crop and weed segmentation based on images captured with a camera. However, the segmentation performance for various crops and weeds varies significantly, implying that certain classes of crops or weeds are not being detected properly. This problem may also occur in the loss calculations used in crop and weed segmentation. In previous studies, the cross-entropy loss (corresponding to a distribution loss) and dice loss (using spatial information) have been widely used. However, such losses lead to large discrepancies in crop and weed segmentation performance, as the correlations between crop and weed classes are not considered. In order to solve these problems, this study proposes multi-task semantic segmentation-convolutional neural network for detecting crops and weeds (MTS-CNN) using one-stage training. This approach adds the crop, weed, and both (crop and weed) losses to heighten the correlations between the crop and weed classes, and designs the model so that the object (crop and weed) region is trained intensively. In experiments conducted using three types of open databases - the BoniRob dataset, a crop/weed field image dataset (CWFID), and rice seedling and weed dataset - the mean intersection of union (MIOU) values of the segmentation for the crops and weeds in the MTS-CNN are 0.9164, 0.8372, and 0.8260, respectively. Thus, the results indicate higher accuracy from the proposed approach than from the state-of-the-art methods.
KW - Combined cross entropy and dice losses based on class imbalance weight
KW - MTS-CNN
KW - Object loss
KW - Semantic segmentation for crop and weed
UR - http://www.scopus.com/inward/record.url?scp=85132782632&partnerID=8YFLogxK
U2 - 10.1016/j.compag.2022.107146
DO - 10.1016/j.compag.2022.107146
M3 - Article
AN - SCOPUS:85132782632
SN - 0168-1699
VL - 199
JO - Computers and Electronics in Agriculture
JF - Computers and Electronics in Agriculture
M1 - 107146
ER -