TY - JOUR
T1 - Enhancer prediction with histone modification marks using a hybrid neural network model
AU - Lim, A.
AU - Lim, Sangsoo
AU - Kim, Sun
N1 - Publisher Copyright:
© 2019 Elsevier Inc.
PY - 2019/8/15
Y1 - 2019/8/15
N2 - Enhancer is a DNA sequence of a genome that controls transcription of downstream target genes. Enhancers are known to be associated with certain epigenetic signatures. Machine learning tools, such as CSI-ANN, ChromHMM, and RFECS, were developed for predicting enhancers using various epigenetic features. However, predictions by different tools vary widely and quite a significant portion of enhancer predictions does not agree. Thus, computational methods for enhancer prediction should be further developed. In this paper, a hybrid neural network called Enhancer-CRNN, a convolutional neural network (CNN) followed by a recurrent neural network (RNN), was developed and they were used to predict enhancer regions with histone modification marks as input. The CNN in our model is to reflect local characteristics and the RNN is to learn sequential dependencies among the histone marks. Hybridization of both neural networks outperformed existing prediction tools in experiments with GM12878, H1hesc, HeLaS3, and HepG2 cell lines. On average, 13–17 percent of the enhancers predicted by our method were cell type-specific. With the trained model, optimized virtual input histone marks was generated to provide a deeper insight into how histone modification marks can represent enhancer regions in which histone marks indicate active or repressed enhancers. In summary, our model produced accurate annotation of enhancers with detailed information on how histone profiles contribute to the presence of putative enhancers.
AB - Enhancer is a DNA sequence of a genome that controls transcription of downstream target genes. Enhancers are known to be associated with certain epigenetic signatures. Machine learning tools, such as CSI-ANN, ChromHMM, and RFECS, were developed for predicting enhancers using various epigenetic features. However, predictions by different tools vary widely and quite a significant portion of enhancer predictions does not agree. Thus, computational methods for enhancer prediction should be further developed. In this paper, a hybrid neural network called Enhancer-CRNN, a convolutional neural network (CNN) followed by a recurrent neural network (RNN), was developed and they were used to predict enhancer regions with histone modification marks as input. The CNN in our model is to reflect local characteristics and the RNN is to learn sequential dependencies among the histone marks. Hybridization of both neural networks outperformed existing prediction tools in experiments with GM12878, H1hesc, HeLaS3, and HepG2 cell lines. On average, 13–17 percent of the enhancers predicted by our method were cell type-specific. With the trained model, optimized virtual input histone marks was generated to provide a deeper insight into how histone modification marks can represent enhancer regions in which histone marks indicate active or repressed enhancers. In summary, our model produced accurate annotation of enhancers with detailed information on how histone profiles contribute to the presence of putative enhancers.
KW - Convolutional neural network
KW - Enhancer
KW - Histone modification mark
KW - Recurrent neural network
UR - http://www.scopus.com/inward/record.url?scp=85063315703&partnerID=8YFLogxK
U2 - 10.1016/j.ymeth.2019.03.014
DO - 10.1016/j.ymeth.2019.03.014
M3 - Article
C2 - 30905748
AN - SCOPUS:85063315703
SN - 1046-2023
VL - 166
SP - 48
EP - 56
JO - Methods
JF - Methods
ER -