Abstract
Background: Globally, the most common form of arrhythmias is atrial fibrillation (AF), which causes severe morbidity, mortality, and socioeconomic burden. The application of machine learning algorithms in combination with weighted gene co-expression network analysis (WGCNA) can be used to screen genes, therefore, we aimed to screen for potential biomarkers associated with AF development using this integrated bioinformatics approach.Methods: On the basis of the AF endocardium gene expression profiles GSE79768 and GSE115574 from the Gene Expression Omnibus database, differentially expressed genes (DEGs) between AF and sinus rhythm samples were identified. DEGs enrichment analysis and transcription factor screening were then performed. Hub genes for AF were screened using WGCNA and machine learning algorithms, and the diagnostic accuracy was assessed by the receiver operating characteristic (ROC) curves. GSE41177 was used as the validation set for verification. Subsequently, we identified the specific signaling pathways in which the key biomarkers were involved, using gene set enrichment analysis and reverse prediction of mRNA–miRNA interaction pairs. Finally, we explored the associations between the hub genes and immune microenvironment and immune regulation.Results: Fifty-seven DEGs were identified, and the two hub genes, hypoxia inducible factor 1 subunit alpha inhibitor (HIF1AN) and mitochondrial inner membrane protein MPV17 (MPV17), were screened using WGCNA combined with machine learning algorithms. The areas under the receiver operating characteristic curves for MPV17 and HIF1AN validated that two genes predicted AF development, and the differential expression of the hub genes was verified in the external validation dataset. Enrichment analysis showed that MPV17 and HIF1AN affect mitochondrial dysfunction, oxidative stress, gap junctions, and other signaling pathway functions. Immune cell infiltration and immunomodulatory correlation analyses showed that MPV17 and HIF1AN are strongly correlated with the content of immune cells and significantly correlated with HLA expression.Conclusion: The identification of hub genes associated with AF using WGCNA combined with machine learning algorithms and their correlation with immune cells and immune gene expression can elucidate the molecular mechanisms underlying AF occurrence. This may further identify more accurate and effective biomarkers and therapeutic targets for the diagnosis and treatment of AF.