TY - JOUR
T1 - Feature Extraction With Genetic Programming for Root Cause Identification in Manufacturing With Interpretable Machine Learning
AU - Lee, Chan Gyu
AU - Jun, Sungbum
N1 - Publisher Copyright:
© 1997-2012 IEEE.
PY - 2025
Y1 - 2025
N2 - For fault detection (FD) in manufacturing, various machine learning (ML) models have been widely applied to minimize human intervention and improve detection performance. Even though ML models, such as neural networks (NNs), have been shown to identify faults effectively, root cause identification (RCI) is becoming more difficult due to their black-box structures and the tradeoff between accuracy and interpretability. In order to improve performance while maintaining interpretability, we propose a new framework named Feature Extraction for finding Root causes for Manufacturing Applications with Tree-based algorithms (FERMAT), which enhances the performance of height-limited decision trees (DTs) (C4.5) through dimensionally aware genetic programming for feature extraction. Especially in FERMAT, only interpretable features are extracted to prevent DTs from delivering uninterpretable expressions to practitioners. In the present study, FERMAT’s applicability to RCI was verified with both manufacturing and nonmanufacturing datasets with different imbalance ratios. The experimental results showed that FERMAT outperformed the other single-tree-based models by extracting good features and delivered performance comparable to the black-box models.
AB - For fault detection (FD) in manufacturing, various machine learning (ML) models have been widely applied to minimize human intervention and improve detection performance. Even though ML models, such as neural networks (NNs), have been shown to identify faults effectively, root cause identification (RCI) is becoming more difficult due to their black-box structures and the tradeoff between accuracy and interpretability. In order to improve performance while maintaining interpretability, we propose a new framework named Feature Extraction for finding Root causes for Manufacturing Applications with Tree-based algorithms (FERMAT), which enhances the performance of height-limited decision trees (DTs) (C4.5) through dimensionally aware genetic programming for feature extraction. Especially in FERMAT, only interpretable features are extracted to prevent DTs from delivering uninterpretable expressions to practitioners. In the present study, FERMAT’s applicability to RCI was verified with both manufacturing and nonmanufacturing datasets with different imbalance ratios. The experimental results showed that FERMAT outperformed the other single-tree-based models by extracting good features and delivered performance comparable to the black-box models.
KW - Decision tree (DT)
KW - dimensional awareness
KW - feature extraction
KW - genetic programming (GP)
KW - interpretability
KW - machine learning (ML)
UR - https://www.scopus.com/pages/publications/85190734713
U2 - 10.1109/TEVC.2024.3388725
DO - 10.1109/TEVC.2024.3388725
M3 - Article
AN - SCOPUS:85190734713
SN - 1089-778X
VL - 29
SP - 1029
EP - 1040
JO - IEEE Transactions on Evolutionary Computation
JF - IEEE Transactions on Evolutionary Computation
IS - 4
ER -