Patterns of Gene Expression Profiles Associated with Colorectal Cancer in Colorectal Mucosa by Using Machine Learning Methods
- PMID: 37957897
- DOI: 10.2174/0113862073266300231026103844
Patterns of Gene Expression Profiles Associated with Colorectal Cancer in Colorectal Mucosa by Using Machine Learning Methods
Abstract
Background: Colorectal cancer (CRC) has a very high incidence and lethality rate and is one of the most dangerous cancer types. Timely diagnosis can effectively reduce the incidence of colorectal cancer. Changes in para-cancerous tissues may serve as an early signal for tumorigenesis. Comparison of the differences in gene expression between para-cancerous and normal mucosa can help in the diagnosis of CRC and understanding the mechanisms of development.
Objectives: This study aimed to identify specific genes at the level of gene expression, which are expressed in normal mucosa and may be predictive of CRC risk.
Methods: A machine learning approach was used to analyze transcriptomic data in 459 samples of normal colonic mucosal tissue from 322 CRC cases and 137 non-CRC, in which each sample contained 28,706 gene expression levels. The genes were ranked using four ranking methods based on importance estimation (LASSO, LightGBM, MCFS, and mRMR) and four classification algorithms (decision tree [DT], K-nearest neighbor [KNN], random forest [RF], and support vector machine [SVM]) were combined with incremental feature selection [IFS] methods to construct a prediction model with excellent performance.
Result: The top-ranked genes, namely, HOXD12, CDH1, and S100A12, were associated with tumorigenesis based on previous studies.
Conclusion: This study summarized four sets of quantitative classification rules based on the DT algorithm, providing clues for understanding the microenvironmental changes caused by CRC. According to the rules, the effect of CRC on normal mucosa can be determined.
Keywords: Colorectal cancer; biomarker; feature selection.; gene expression; machine learning; mucosa.
Copyright© Bentham Science Publishers; For any queries, please email at [email protected].
Similar articles
-
Using Machine Learning Methods to Study Colorectal Cancer Tumor Micro-Environment and Its Biomarkers.Int J Mol Sci. 2023 Jul 6;24(13):11133. doi: 10.3390/ijms241311133. Int J Mol Sci. 2023. PMID: 37446311 Free PMC article.
-
Transcriptome profiling by combined machine learning and statistical R analysis identifies TMEM236 as a potential novel diagnostic biomarker for colorectal cancer.Sci Rep. 2021 Jul 12;11(1):14304. doi: 10.1038/s41598-021-92692-0. Sci Rep. 2021. PMID: 34253750 Free PMC article.
-
Enhanced non-invasive machine learning approach for early colorectal cancer detection: Predictive modeling and validation in a Jordanian cohort.Comput Biol Med. 2025 Jun;191:110184. doi: 10.1016/j.compbiomed.2025.110184. Epub 2025 Apr 17. Comput Biol Med. 2025. PMID: 40249989
-
Whale optimized mixed kernel function of support vector machine for colorectal cancer diagnosis.J Biomed Inform. 2019 Apr;92:103124. doi: 10.1016/j.jbi.2019.103124. Epub 2019 Feb 20. J Biomed Inform. 2019. PMID: 30796977
-
Analysis of gene expression profiles of lung cancer subtypes with machine learning algorithms.Biochim Biophys Acta Mol Basis Dis. 2020 Aug 1;1866(8):165822. doi: 10.1016/j.bbadis.2020.165822. Epub 2020 Apr 28. Biochim Biophys Acta Mol Basis Dis. 2020. PMID: 32360590
Cited by
-
Machine learning approaches reveal methylation signatures associated with pediatric acute myeloid leukemia recurrence.Sci Rep. 2025 May 6;15(1):15815. doi: 10.1038/s41598-025-99258-4. Sci Rep. 2025. PMID: 40328883 Free PMC article.
-
CMAGN: circRNA-miRNA association prediction based on graph attention auto-encoder and network consistency projection.BMC Bioinformatics. 2024 Oct 24;25(1):336. doi: 10.1186/s12859-024-05959-4. BMC Bioinformatics. 2024. PMID: 39449126 Free PMC article.
-
Machine Learning Identifies Key Gene Markers Related to Fetal Retina Development at Single-Cell Transcription Level.Invest Ophthalmol Vis Sci. 2025 Jun 2;66(6):60. doi: 10.1167/iovs.66.6.60. Invest Ophthalmol Vis Sci. 2025. PMID: 40531615 Free PMC article.
-
Herb-disease association prediction model based on network consistency projection.Sci Rep. 2025 Jan 27;15(1):3328. doi: 10.1038/s41598-025-87521-7. Sci Rep. 2025. PMID: 39865145 Free PMC article.
-
Machine Learning in Identifying Marker Genes for Congenital Heart Diseases of Different Cardiac Cell Types.Life (Basel). 2024 Aug 19;14(8):1032. doi: 10.3390/life14081032. Life (Basel). 2024. PMID: 39202774 Free PMC article.
References
MeSH terms
Substances
LinkOut - more resources
Full Text Sources
Medical
Research Materials
Miscellaneous