DeepmiRNATar: A deep learning-based model for miRNA targets prediction
Abstract
MicroRNAs (miRNAs) play a crucial role in regulating fundamental biological processes such as the cell cycle, differentiation, and apoptosis by directly interacting with multiple genes (mRNAs). This regulatory mechanism has a profound impact on cellular function and the overall physiological condition of an organism. However, the prediction of miRNA-mRNA interactions encounters computational challenges in the field of biology due to the diverse sequences and complex data patterns. To overcome these obstacles, this research effort introduced DeepmiRNATar, a tool designed to precisely pinpoint miRNA targets, offering essential assistance in the realm of disease management. DeepmiRNATar leverages the Word2vec-based DeepLncLoc approach for encoding miRNA sequence characteristics and utilizes the DNABERT pre-trained model for in-depth semantic comprehension of target sequences. Through the integration of TextCNN, BiLSTM, and SpatialConv Attention mechanisms, the model scrutinizes structural features, temporal relationships, and overall interactions within the sequences. Following a series of experimental assessments, DeepmiRNATar attained an impressive AUC of 99.15% on the evaluation dataset, on par with the current leading prediction methodologies. Notably, the precision-recall curve, sensitivity, and F-measure values reached 99.18%, 97.43%, and 95.47%, respectively. Compared to existing miRNA target prediction models, DeepmiRNATar demonstrates a notable enhancement in overall predictive accuracy. The successful creation and experimental validation of the DeepmiRNATar model signify a significant advancement in miRNA target identification technology.
References
1. Alshalalfa M, Alhajj R. Using context-specific effect of miRNAs to identify functional associations between miRNAs and gene signatures. BMC Bioinformatics. 2013;14 Suppl 12(Suppl 12): S1.
2. Taguchi YH. Inference of Target Gene Regulation via miRNAs during Cell Senescence by Using the MiRaGE Server. Aging Dis. 2012;3(4):301-6.
3. Plaisance-Bonstaff K, Renne R. Viral miRNAs. Antiviral RNAi: Concepts, Methods, and Applications. 2011:43-66.
4. Shaikh MAJ, Altamimi ASA, Afzal M, Gupta G, Singla N, Gilhotra R, et al. Unraveling the impact of miR-21 on apoptosis regulation in glioblastoma. Pathol Res Pract. 2024; 254:155121.
5. Ng MY, Yu C, Chen S-H, Liao YW, Lin T. Er:YAG Laser Alleviates Inflammaging in Diabetes-Associated Periodontitis via Activation CTBP1-AS2/miR-155/SIRT1 Axis. International Journal of Molecular Sciences. 2024;25.
6. Enright AJ, John B, Gaul U, Tuschl T, Sander C, Marks DS. MicroRNA targets in Drosophila. Genome Biol. 2003;5(1):R1.
7. Lewis BP, Burge CB, Bartel DP. Conserved seed pairing, often flanked by adenosines, indicates that thousands of human genes are microRNA targets. Cell. 2005;120(1):15-20.
8. Kertesz M, Iovino N, Unnerstall U, Gaul U, Segal E. The role of site accessibility in microRNA target recognition. Nat Genet. 2007;39(10):1278-84.
9. Sturm M, Hackenberg M, Langenberger D, Frishman D. TargetSpy: a supervised machine learning approach for microRNA target prediction. BMC Bioinformatics. 2010; 11: 292.
10. Betel D, Koppal A, Agius P, Sander C, Leslie C. Comprehensive modeling of microRNA targets predicts functional non-conserved and non-canonical sites. Genome Biology. 2010;11(8): R90.
11. Alipanahi B, Delong A, Weirauch MT, Frey BJ. Predicting the sequence specificities of DNA- and RNA-binding proteins by deep learning. Nat Biotechnol. 2015;33(8):831-8.
12. Shuang C, Maozu G, Chunyu W, Xiaoyan L, Yang L, Xuejian W. MiRTDL: A Deep Learning Approach for miRNA Target Prediction. IEEE/ACM Trans Comput Biol Bioinform. 2016;13(6):1161-9.
13. Lee B, Baek J, Park S, Yoon S, editors. deepTarget: end-to-end learning framework for microRNA target prediction using deep recurrent neural networks. Proceedings of the 7th ACM international conference on bioinformatics, computational biology, and health informatics; 2016.
14. Wen M, Cong P, Zhang Z, Lu H, Li T. DeepMirTar: a deep-learning approach for predicting human miRNA targets. Bioinformatics. 2018;34(22):3781-7.
15. Pla A, Zhong X, Rayner S. miRAW: A deep learning-based approach to predict microRNA targets by analyzing whole microRNA transcripts. PLoS computational biology. 2018;14(7): e1006185.
16. Gu T, Zhao X, Barbazuk WB, Lee JH. miTAR: a hybrid deep learning-based approach for predicting miRNA targets. BMC Bioinformatics. 2021;22(1):96.
17. Yang T, Wang Y, He Y. TEC-miTarget: enhancing microRNA target prediction based on deep learning of ribonucleic acid sequences. BMC bioinformatics. 2024;25(1):159.
18. Gallego A, Melé M, Balcells I, García-Ramallo E, Torruella-Loran I, Fernández-Bellon H, et al. Functional Implications of Human-Specific Changes in Great Ape microRNAs. PLoS One. 2016;11(4): e0154194.
19. Moore MJ, Scheel TK, Luna JM, Park CY, Fak JJ, Nishiuchi E, et al. miRNA–target chimeras reveal miRNA 3′-end pairing as a major determinant of Argonaute target specificity. Nature communications. 2015;6(1):8864.
20. Kim D, Sung YM, Park J, Kim S, Kim J, Park J, et al. General rules for functional microRNA targeting. Nat Genet. 2016;48(12):1517-26.
21. Vlachos IS, Paraskevopoulou MD, Karagkouni D, Georgakilas G, Vergoulis T, Kanellos I, et al. DIANA-TarBase v7. 0: indexing more than half a million experimentally supported miRNA: mRNA interactions. Nucleic acids research. 2015;43(D1): D153-D9.
22. Chou C-H, Chang N-W, Shrestha S, Hsu S-D, Lin Y-L, Lee W-H, et al. miRTarBase 2016: updates to the experimentally validated miRNA-target interactions database. Nucleic acids research. 2016;44(D1): D239-D47.
23. Hafner M, Landthaler M, Burger L, Khorshid M, Hausser J, Berninger P, et al. Transcriptome-wide identification of RNA-binding protein and microRNA target sites by PAR-CLIP. Cell. 2010;141(1):129-41.
24. Ding J, Li X, Hu H. TarPmiR: a new approach for microRNA target site prediction. Bioinformatics. 2016;32(18):2768-75.
25. Wang X. Improving microRNA target prediction by modeling with unambiguously identified microRNA-target pairs from CLIP-ligation studies. Bioinformatics. 2016;32(9):1316-22.
26. Zeng M, Wu Y, Lu C, Zhang F, Wu F-X, Li M. DeepLncLoc: a deep learning framework for long non-coding RNA subcellular localization prediction based on subsequence embedding. Briefings in Bioinformatics. 2022;23(1): bbab360.
27. Edgar R. Syncmers are more sensitive than minimizers for selecting conserved k‑mers in biological sequences. PeerJ. 2021;9: e10805.
28. Marçais G, Kingsford C. A fast, lock-free approach for efficient parallel counting of occurrences of k-mers. Bioinformatics. 2011;27(6):764-70.
29. Ji Y, Zhou Z, Liu H, Davuluri RV. DNABERT: pre-trained Bidirectional Encoder Representations from Transformers model for DNA-language in genome. Bioinformatics. 2021;37(15):2112-20.
30. Chen Y. Convolutional neural network for sentence classification: University of Waterloo; 2015.
31. Subakan C, Ravanelli M, Cornell S, Bronzi M, Zhong J, editors. Attention is all you need in speech separation. ICASSP 2021-2021 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP); 2021: IEEE.
32. Shen S, Yao Z, Gholami A, Mahoney M, Keutzer K, editors. Powernorm: Rethinking batch normalization in transformers. International conference on machine learning; 2020: PMLR.
33. Duan H, Chen B, Wang W, Luo H. Identification of GNG7 as a novel biomarker and potential therapeutic target for gastric cancer via bioinformatic analysis and in vitro experiments. Aging (Albany NY). 2023;15(5):1445.
34. Singh D, Khan MA, Siddique HR. Role of epigenetic drugs in sensitizing cancers to anticancer therapies: emerging trends and clinical advancements. Epigenomics. 2023;15(8):517-37.
35. Pan X, Zhang W, Wang L, Guo H, Zheng M, Wu H, et al. KLF12 transcriptionally regulates PD‐L1 expression in non‐small cell lung cancer. Molecular Oncology. 2023;17(12):2659-74.
36. Kozomara A, Griffiths-Jones S. miRBase: annotating high confidence microRNAs using deep sequencing data. Nucleic acids research. 2014;42(D1): D68-D73.
Copyright (c) 2024 Huimin Peng, Chenyu Li, Ying Lu, Dazhou Li
This work is licensed under a Creative Commons Attribution 4.0 International License.
Copyright on all articles published in this journal is retained by the author(s), while the author(s) grant the publisher as the original publisher to publish the article.
Articles published in this journal are licensed under a Creative Commons Attribution 4.0 International, which means they can be shared, adapted and distributed provided that the original published version is cited.