TY - JOUR
T1 - Cancer subtype classification and modeling by pathway attention and propagation
AU - Lee, Sangseon
AU - Lim, Sangsoo
AU - Lee, Taeheon
AU - Sung, Inyoung
AU - Kim, Sun
N1 - Publisher Copyright:
© 2020 The Author(s) 2020. Published by Oxford University Press. All rights reserved. For permissions, please e-mail: [email protected].
PY - 2020/3/31
Y1 - 2020/3/31
N2 - Motivation: Biological pathway is an important curated knowledge of biological processes. Thus, cancer subtype classification based on pathways will be very useful to understand differences in biological mechanisms among cancer subtypes. However, pathways include only a fraction of the entire gene set, only one-third of human genes in KEGG, and pathways are fragmented. For this reason, there are few computational methods to use pathways for cancer subtype classification. Results: We present an explainable deep-learning model with attention mechanism and network propagation for cancer subtype classification. Each pathway is modeled by a graph convolutional network. Then, a multi-attention-based ensemble model combines several hundreds of pathways in an explainable manner. Lastly, network propagation on pathway-gene network explains why gene expression profiles in subtypes are different. In experiments with five TCGA cancer datasets, our method achieved very good classification accuracies and, additionally, identified subtype-specific pathways and biological functions.
AB - Motivation: Biological pathway is an important curated knowledge of biological processes. Thus, cancer subtype classification based on pathways will be very useful to understand differences in biological mechanisms among cancer subtypes. However, pathways include only a fraction of the entire gene set, only one-third of human genes in KEGG, and pathways are fragmented. For this reason, there are few computational methods to use pathways for cancer subtype classification. Results: We present an explainable deep-learning model with attention mechanism and network propagation for cancer subtype classification. Each pathway is modeled by a graph convolutional network. Then, a multi-attention-based ensemble model combines several hundreds of pathways in an explainable manner. Lastly, network propagation on pathway-gene network explains why gene expression profiles in subtypes are different. In experiments with five TCGA cancer datasets, our method achieved very good classification accuracies and, additionally, identified subtype-specific pathways and biological functions.
UR - https://www.scopus.com/pages/publications/85087321274
U2 - 10.1093/bioinformatics/btaa203
DO - 10.1093/bioinformatics/btaa203
M3 - Article
C2 - 32207514
AN - SCOPUS:85087321274
SN - 1367-4803
VL - 36
SP - 3818
EP - 3824
JO - Bioinformatics
JF - Bioinformatics
IS - 12
ER -