A New Method to Identify Affected Pathway Subnetworks and Clusters in Colon Cancer
Abstract
Nowadays new technological developments that play an important role in the production of big data have brought about the interpretation, sharing and storage of data related to complex diseases. Combining multi-omic data in different molecular levels is potentially important for understanding the biological origin of complex diseases. One of these complex diseases is cancer of different types, which has one of the highest causes of death worldwide. The integration of multiple omic data in the framework of a comprehensive analysis and identification of relevant pathways contribute to the development of therapeutic approaches related to disease. In this study, RNA and methylation data (genes and p values) of colon adenocarcinoma were obtained from TCGA data portal and combined with Fisher's method. While protein subnetworks affected by the disease were identified by using subnetwork algorithm, pathways related to the disease and genes associated with these pathways were determined by functional enrichment analysis. Using gene-pathway relationship matrix, kappa scores of pathways were determined by similarity calculation. In this way, the pathways were clustered according to the hierarchically optimal number, as a result, the most important pathway clusters and related genes that are effective in disease formation identified.