TY - GEN
T1 - Digestive Organ Recognition in Video Capsule Endoscopy Based on Temporal Segmentation Network
AU - Shin, Yejee
AU - Eo, Taejoon
AU - Rha, Hyeongseop
AU - Oh, Dong Jun
AU - Son, Geonhui
AU - An, Jiwoong
AU - Kim, You Jin
AU - Hwang, Dosik
AU - Lim, Yun Jeong
N1 - Publisher Copyright:
© 2022, The Author(s), under exclusive license to Springer Nature Switzerland AG.
PY - 2022
Y1 - 2022
N2 - The interpretation of video capsule endoscopy (VCE) usually takes more than an hour, which can be a tedious process for clinicians. To shorten the reading time of VCE, algorithms that automatically detect lesions in the small bowel are being actively developed, however, it is still necessary for clinicians to manually mark anatomic transition points in VCE. Therefore, anatomical temporal segmentation must first be performed automatically at the full-length VCE level for the fully automated reading. This study aims to develop an automated organ recognition method in VCE based on a temporal segmentation network. For temporal locating and classifying organs including the stomach, small bowel, and colon in long untrimmed videos, we use MS-TCN++ model containing temporal convolution layers. To improve temporal segmentation performance, a hybrid model of two state-of-the-art feature extraction models (i.e., TimeSformer and I3D) is used. Extensive experiments showed the effectiveness of the proposed method in capturing long-range dependencies and recognizing temporal segments of organs. For training and validation of the proposed model, the dataset of 200 patients (100 normal and 100 abnormal VCE) was used. For the test set of 40 patients (20 normal and 20 abnormal VCE), the proposed method showed accuracy of 96.15, F1-score@{50,75,90} of {96.17, 93.61, 86.80}, and segmental edit distance of 95.83 in the three-class classification of organs including the stomach, small bowel, and colon in the full-length VCE.
AB - The interpretation of video capsule endoscopy (VCE) usually takes more than an hour, which can be a tedious process for clinicians. To shorten the reading time of VCE, algorithms that automatically detect lesions in the small bowel are being actively developed, however, it is still necessary for clinicians to manually mark anatomic transition points in VCE. Therefore, anatomical temporal segmentation must first be performed automatically at the full-length VCE level for the fully automated reading. This study aims to develop an automated organ recognition method in VCE based on a temporal segmentation network. For temporal locating and classifying organs including the stomach, small bowel, and colon in long untrimmed videos, we use MS-TCN++ model containing temporal convolution layers. To improve temporal segmentation performance, a hybrid model of two state-of-the-art feature extraction models (i.e., TimeSformer and I3D) is used. Extensive experiments showed the effectiveness of the proposed method in capturing long-range dependencies and recognizing temporal segments of organs. For training and validation of the proposed model, the dataset of 200 patients (100 normal and 100 abnormal VCE) was used. For the test set of 40 patients (20 normal and 20 abnormal VCE), the proposed method showed accuracy of 96.15, F1-score@{50,75,90} of {96.17, 93.61, 86.80}, and segmental edit distance of 95.83 in the three-class classification of organs including the stomach, small bowel, and colon in the full-length VCE.
KW - Organ recognition
KW - Temporal convolutional networks
KW - Temporal segmentation
KW - Video Capsule Endoscopy
UR - http://www.scopus.com/inward/record.url?scp=85139073357&partnerID=8YFLogxK
U2 - 10.1007/978-3-031-16449-1_14
DO - 10.1007/978-3-031-16449-1_14
M3 - Conference contribution
AN - SCOPUS:85139073357
SN - 9783031164484
T3 - Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
SP - 136
EP - 146
BT - Medical Image Computing and Computer Assisted Intervention – MICCAI 2022 - 25th International Conference, Proceedings
A2 - Wang, Linwei
A2 - Dou, Qi
A2 - Fletcher, P. Thomas
A2 - Speidel, Stefanie
A2 - Li, Shuo
PB - Springer Science and Business Media Deutschland GmbH
T2 - 25th International Conference on Medical Image Computing and Computer-Assisted Intervention, MICCAI 2022
Y2 - 18 September 2022 through 22 September 2022
ER -