TY - JOUR
T1 - A method for identifying protein complexes with the features of joint co-localization and joint co-expression in static PPI networks
AU - Zhang, Jinxiong
AU - Zhong, Cheng
AU - Huang, Yiran
AU - Lin, Hai Xiang
AU - Wang, Mian
N1 - Accepted author manuscript
PY - 2019
Y1 - 2019
N2 - Identifying protein complexes in static protein-protein interaction (PPI) networks is essential for understanding the underlying mechanism of biological processes. Proteins in a complex are co-localized at the same place and co-expressed at the same time. We propose a novel method to identify protein complexes with the features of joint co-localization and joint co-expression in static PPI networks. To achieve this goal, we define a joint localization vector to construct a joint co-localization criterion of a protein group, and define a joint gene expression to construct a joint co-expression criterion of a gene group. Moreover, the functional similarity of proteins in a complex is an important characteristic. Thus, we use the CC-based, MF-based, and BP-based protein similarities to devise functional similarity criterion to determine whether a protein is functionally similar to a protein cluster. Based on the core-attachment structure and following to seed expanding strategy, we use four types of biological data including PPI data with reliability score, protein localization data, gene expression data, and gene ontology annotations, to identify protein complexes. The experimental results on yeast data show that comparing with existing methods our proposed method can efficiently and exactly identify more protein complexes, especially more protein complexes of sizes from 2 to 6. Furthermore, the enrichment analysis demonstrates that the protein complexes identified by our method have significant biological meaning.
AB - Identifying protein complexes in static protein-protein interaction (PPI) networks is essential for understanding the underlying mechanism of biological processes. Proteins in a complex are co-localized at the same place and co-expressed at the same time. We propose a novel method to identify protein complexes with the features of joint co-localization and joint co-expression in static PPI networks. To achieve this goal, we define a joint localization vector to construct a joint co-localization criterion of a protein group, and define a joint gene expression to construct a joint co-expression criterion of a gene group. Moreover, the functional similarity of proteins in a complex is an important characteristic. Thus, we use the CC-based, MF-based, and BP-based protein similarities to devise functional similarity criterion to determine whether a protein is functionally similar to a protein cluster. Based on the core-attachment structure and following to seed expanding strategy, we use four types of biological data including PPI data with reliability score, protein localization data, gene expression data, and gene ontology annotations, to identify protein complexes. The experimental results on yeast data show that comparing with existing methods our proposed method can efficiently and exactly identify more protein complexes, especially more protein complexes of sizes from 2 to 6. Furthermore, the enrichment analysis demonstrates that the protein complexes identified by our method have significant biological meaning.
KW - Core-attachment structure
KW - Joint co-expression
KW - Joint co-localization
KW - Protein complexes
KW - Seed expanding strategy
KW - Static PPI networks
UR - http://www.scopus.com/inward/record.url?scp=85069942248&partnerID=8YFLogxK
U2 - 10.1016/j.compbiomed.2019.103333
DO - 10.1016/j.compbiomed.2019.103333
M3 - Article
AN - SCOPUS:85069942248
SN - 0010-4825
VL - 111
SP - 1
EP - 19
JO - Computers in Biology and Medicine
JF - Computers in Biology and Medicine
M1 - 103333
ER -